ChEMBL is a manually curated database of bioactive molecules with drug-like properties, maintained by the European Bioinformatics Institute (EBI) as part of the European Molecular Biology Laboratory (EMBL)-EBI. It aggregates information from medicinal chemistry literature, patents, and other sources, providing a rich resource for structure-based and target-based drug discovery in neurodegenerative disease research[1].
Launched in 2002, ChEMBL has become one of the world's largest publicly available bioactivity databases, serving as a critical resource for academic research and pharmaceutical industry drug discovery programs. The database contains curated bioactivity data from over 2.4 million compounds tested against more than 15,000 protein targets across 1.8 million assays[2].
As of 2024, ChEMBL contains[1:1]:
| Data Type | Count | Description |
|---|---|---|
| Molecules | 2.4M+ | Unique compounds with drug-like properties |
| Bioactivities | 24M+ | Curated IC50, Ki, Kd, EC50, potency values |
| Targets | 15,000+ | Proteins, nucleic acids, and protein complexes |
| Assays | 1.8M+ | Curated from journal articles and patents |
| Literature | 93,000+ | Journal articles manually curated |
The database provides extensive coverage of target families relevant to neurodegenerative disease research[3]:
| Target Family | Target Count | Neurodegeneration Relevance |
|---|---|---|
| Kinases | 650+ | Tau kinases (GSK3B, CDK5, DYRK1A), LRRK2 |
| GPCRs | 850+ | Dopamine (D1-D5), serotonin, adenosine receptors |
| Ion channels | 420+ | Calcium (Cav, N-type), potassium (KCNQ), sodium channels |
| Nuclear receptors | 750+ | Nurr1 (NR4A2), PPARγ, RORα for neuroprotection |
| Proteases | 520+ | BACE1/2, calpain, caspase-3/9, MMPs |
| Transporters | 280+ | DAT, SERT, VMAT2, EAATs |
| Enzymes | 4,500+ | MAO-B, COMT, AChE, BuChE |
ChEMBL supports structure-based drug design for neurodegeneration targets through several mechanisms[4][5]:
The database contains extensive SAR data for BACE inhibitors, including:
Researchers can access IC50/Ki values for 15,000+ BACE1 inhibitors and 3,000+ BACE2 inhibitors, enabling comparative analysis and selectivity optimization[4:1].
Multiple kinase targets involved in tau phosphorylation are well-represented[5:1]:
| Kinase | Role in Neurodegeneration | ChEMBL Entries |
|---|---|---|
| GSK3B | Tau Ser202/Thr396 phosphorylation | 8,500+ compounds |
| CDK5 | Tau Ser235 phosphorylation | 4,200+ compounds |
| DYRK1A | Tau Thr212 phosphorylation | 2,800+ compounds |
| CK1δ | Tau Ser198/199/202 phosphorylation | 1,900+ compounds |
| PKA | Tau Ser356 phosphorylation | 5,100+ compounds |
ChEMBL contains data from high-content screening campaigns targeting α-synuclein aggregation:
Small molecule activators of neurotrophic signaling pathways:
Understanding off-target interactions is critical for neurodegeneration drug programs[6]. ChEMBL enables:
ChEMBL enables sophisticated SAR analysis for neurodegeneration drug programs[6:1]:
ChEMBL supports drug repurposing for neurodegenerative diseases:
The ChEMBL Python client provides programmatic access[2:1]:
from chembl_webresource_client.new_client import new_client
# Search for molecules targeting GSK3B
molecule = new_client.molecule
target = new_client.target
activity = new_client.activity
# Get GSK3B target
gsk3b = target.filter(target_synonym__exact='GSK3B')[0]
# Get activities for GSK3B
gsk3b_activities = activity.filter(target_chembl_id=gsk3b['target_chembl_id'],
activity_type__exact='IC50')
ChEMBL provides RESTful web services[2:2]:
| Endpoint | Description | Example |
|---|---|---|
/molecule |
Compound information | molecule/CHEMBL25 |
/target |
Protein target data | target/CHEMBL1929 |
/activity |
Bioactivity measurements | Filter by target |
/assay |
Assay details | assay/CHEMBL1234567 |
/mechanism |
Mechanism of action | MOA for compounds |
ChEMBL integrates with disease-specific resources:
| Target | Drug Discovery Status | ChEMBL Coverage |
|---|---|---|
| BACE1 | Clinical trials (failed) | 15,000+ inhibitors |
| GSK3B | Clinical trials | 8,500+ inhibitors |
| LRRK2 | Clinical trials | 2,400+ inhibitors |
| MAO-B | Approved drugs | 3,200+ inhibitors |
| AChE/BuChE | Approved drugs | 4,800+ inhibitors |
| mTOR | Clinical trials | 1,900+ inhibitors |
| α-synuclein | Research stage | 450+ modulators |
ChEMBL links to:
Miller D, Gaulton A, Lekishvili M, et al. ChEMBL: a large-scale bioactivity database for drug discovery. NAR Genomics and Bioinformatics. 2024. ↩︎ ↩︎
Davies M, Nowotka M, Papadatos G, et al. The ChEMBL informatics and API. Nucleic Acids Research. 2023. ↩︎ ↩︎ ↩︎
Kleyser GD, van Vlimmeren M, Graaf J, et al. Target prediction for neurodegenerative diseases using ChEMBL. Journal of Chemical Information and Modeling. 2024. ↩︎
Hussain I, et al. BACE2: a second beta-secretase. J Biol Chem. 2000. ↩︎ ↩︎
Plotkin LM, et al. GSK3B inhibitors in Alzheimer's disease. J Neurochem. 2022. ↩︎ ↩︎
Mendez D, Gaulton A, Bento AP, et al. ChEMBL Multi-Target Prediction for Neurodegeneration. Drug Discovery Today. 2023. ↩︎ ↩︎