Wiki source code of Methodology
Version 20.1 by manuelmenendez on 2025/02/14 14:47
Show last authors
author | version | line-number | content |
---|---|---|---|
1 | Here is the updated **Methodology** section for the EBRAINS Wiki, incorporating the **Generalized Neuro Biomarker Ontology Categorization (Neuromarker)** for **biomarker classification across all neurodegenerative diseases**. | ||
2 | |||
3 | ---- | ||
4 | |||
5 | == **Neurodiagnoses AI: Multimodal AI for Neurodiagnostic Predictions** == | ||
6 | |||
7 | === **Project Overview** === | ||
8 | |||
9 | Neurodiagnoses AI implements **AI-driven diagnostic and prognostic models** for central nervous system (CNS) disorders, expanding the **Florey Dementia Index (FDI) methodology** to a broader set of neurological conditions. The approach integrates **multimodal data sources** (EEG, neuroimaging, biomarkers, and genetics) and employs machine learning models to provide **explainable, real-time diagnostic insights**. This framework now incorporates **Neuromarker**, a **generalized biomarker ontology** that categorizes biomarkers across neurodegenerative diseases, enabling **standardized, cross-disease AI training**. | ||
10 | |||
11 | == **Neuromarker: Generalized Biomarker Ontology** == | ||
12 | |||
13 | Neuromarker extends the **Common Alzheimer’s Disease Research Ontology (CADRO)** into a **cross-disease biomarker categorization framework** applicable to all neurodegenerative diseases (NDDs). It allows for **standardized classification, AI-based feature extraction, and multimodal integration**. | ||
14 | |||
15 | === **Core Biomarker Categories** === | ||
16 | |||
17 | The following ontology is used within **Neurodiagnoses AI** for biomarker categorization: | ||
18 | |||
19 | |=**Category**|=**Description** | ||
20 | |**Molecular Biomarkers**|Omics-based markers (genomic, transcriptomic, proteomic, metabolomic, lipidomic) | ||
21 | |**Neuroimaging Biomarkers**|Structural (MRI, CT), Functional (fMRI, PET), Molecular Imaging (tau, amyloid, α-synuclein) | ||
22 | |**Fluid Biomarkers**|CSF, plasma, blood-based markers for tau, amyloid, α-synuclein, TDP-43, GFAP, NfL | ||
23 | |**Neurophysiological Biomarkers**|EEG, MEG, evoked potentials (ERP), sleep-related markers | ||
24 | |**Digital Biomarkers**|Gait analysis, cognitive/speech biomarkers, wearables data, EHR-based markers | ||
25 | |**Clinical Phenotypic Markers**|Standardized clinical scores (MMSE, MoCA, CDR, UPDRS, ALSFRS, UHDRS) | ||
26 | |**Genetic Biomarkers**|Risk alleles (APOE, LRRK2, MAPT, C9orf72, PRNP) and polygenic risk scores | ||
27 | |**Environmental & Lifestyle Factors**|Toxins, infections, diet, microbiome, comorbidities | ||
28 | |||
29 | ---- | ||
30 | |||
31 | == **How to Use External Databases in Neurodiagnoses** == | ||
32 | |||
33 | To enhance diagnostic accuracy, Neurodiagnoses AI integrates data from **multiple biomedical and neurological research databases**. Researchers can follow these steps to access, prepare, and integrate data into the Neurodiagnoses framework. | ||
34 | |||
35 | === **Potential Data Sources** === | ||
36 | |||
37 | Neurodiagnoses maintains an **updated list** of biomedical datasets relevant to neurodegenerative diseases: | ||
38 | |||
39 | * **ADNI**: Alzheimer's Disease Imaging & Biomarkers → [[ADNI>>url:https://adni.loni.usc.edu/]] | ||
40 | * **PPMI**: Parkinson’s Disease Imaging & Biospecimens → [[PPMI>>url:https://www.ppmi-info.org/]] | ||
41 | * **GP2**: Whole-Genome Sequencing for PD → [[GP2>>url:https://gp2.org/]] | ||
42 | * **Enroll-HD**: Huntington’s Disease Clinical & Genetic Data → [[Enroll-HD>>url:https://www.enroll-hd.org/]] | ||
43 | * **GAAIN**: Multi-Source Alzheimer’s Data Aggregation → [[GAAIN>>url:https://gaain.org/]] | ||
44 | * **UK Biobank**: Population-Wide Genetic, Imaging & Health Records → [[UK Biobank>>url:https://www.ukbiobank.ac.uk/]] | ||
45 | * **DPUK**: Dementia & Aging Data → [[DPUK>>url:https://www.dementiasplatform.uk/]] | ||
46 | * **PRION Registry**: Prion Diseases Clinical & Genetic Data → [[PRION Registry>>url:https://prionregistry.org/]] | ||
47 | * **DECIPHER**: Rare Genetic Disorder Genomic Variants → [[DECIPHER>>url:https://decipher.sanger.ac.uk/]] | ||
48 | |||
49 | ---- | ||
50 | |||
51 | == **1. Register for Access** == | ||
52 | |||
53 | * Each external database requires **individual registration and access approval**. | ||
54 | * Ensure compliance with **ethical approvals and data usage agreements** before integrating datasets into Neurodiagnoses. | ||
55 | * Some repositories may require a **Data Usage Agreement (DUA)** for sensitive medical data. | ||
56 | |||
57 | ---- | ||
58 | |||
59 | == **2. Download & Prepare Data** == | ||
60 | |||
61 | * Download datasets while adhering to **database usage policies**. | ||
62 | * Ensure files meet **Neurodiagnoses format requirements**: | ||
63 | |||
64 | |=**Data Type**|=**Accepted Formats** | ||
65 | |**Tabular Data**|.csv, .tsv | ||
66 | |**Neuroimaging**|.nii, .dcm | ||
67 | |**Genomic Data**|.fasta, .vcf | ||
68 | |**Clinical Metadata**|.json, .xml | ||
69 | |||
70 | * **Mandatory Fields for Integration**: | ||
71 | ** **Subject ID**: Unique patient identifier | ||
72 | ** **Diagnosis**: Standardized disease classification | ||
73 | ** **Biomarkers**: CSF, plasma, or imaging biomarkers | ||
74 | ** **Genetic Data**: Whole-genome or exome sequencing | ||
75 | ** **Neuroimaging Metadata**: MRI/PET acquisition parameters | ||
76 | |||
77 | ---- | ||
78 | |||
79 | == **3. Upload Data to Neurodiagnoses** == | ||
80 | |||
81 | === **Option 1: Upload to EBRAINS Bucket** === | ||
82 | |||
83 | * Location: **EBRAINS Neurodiagnoses Bucket** | ||
84 | * Ensure **correct metadata tagging** before submission. | ||
85 | |||
86 | === **Option 2: Contribute via GitHub Repository** === | ||
87 | |||
88 | * Location: **GitHub Data Repository** | ||
89 | * Create a **new folder under /data/** and include a **dataset description**. | ||
90 | * **For large datasets**, contact project administrators before uploading. | ||
91 | |||
92 | ---- | ||
93 | |||
94 | == **4. Integrate Data into AI Models** == | ||
95 | |||
96 | * Open **Jupyter Notebooks** on EBRAINS to run **preprocessing scripts**. | ||
97 | * **Standardize neuroimaging and biomarker formats** using harmonization tools. | ||
98 | * Use **machine learning models** to handle **missing data** and **feature extraction**. | ||
99 | * Train AI models with **newly integrated patient cohorts**. | ||
100 | |||
101 | **Reference**: See docs/data_processing.md for detailed instructions. | ||
102 | |||
103 | ---- | ||
104 | |||
105 | == **AI-Driven Biomarker Categorization** == | ||
106 | |||
107 | Neurodiagnoses employs **AI models** for biomarker classification: | ||
108 | |||
109 | |=**Model Type**|=**Application** | ||
110 | |**Graph Neural Networks (GNNs)**|Identify shared biomarker pathways across diseases | ||
111 | |**Contrastive Learning**|Distinguish overlapping vs. unique biomarkers | ||
112 | |**Multimodal Transformer Models**|Integrate imaging, omics, and clinical data | ||
113 | |||
114 | ---- | ||
115 | |||
116 | == **Collaboration & Partnerships** == | ||
117 | |||
118 | === **Partnering with Data Providers** === | ||
119 | |||
120 | Neurodiagnoses seeks partnerships with data repositories to: | ||
121 | |||
122 | * Enable **API-based data integration** for real-time processing. | ||
123 | * Co-develop **harmonized AI-ready datasets** with standardized annotations. | ||
124 | * Secure **funding opportunities** through joint grant applications. | ||
125 | |||
126 | **Interested in Partnering?** | ||
127 | |||
128 | * If you represent a **research consortium or database provider**, reach out to explore **data-sharing agreements**. | ||
129 | * **Contact**: [[info@neurodiagnoses.com>>mailto:info@neurodiagnoses.com]] | ||
130 | |||
131 | ---- | ||
132 | |||
133 | == **Final Notes** == | ||
134 | |||
135 | Neurodiagnoses continuously expands its **data ecosystem** to support **AI-driven clinical decision-making**. Researchers and institutions are encouraged to **contribute new datasets and methodologies**. | ||
136 | |||
137 | **For additional technical documentation**: | ||
138 | |||
139 | * **GitHub Repository**: [[Neurodiagnoses GitHub>>url:https://github.com/neurodiagnoses]] | ||
140 | * **EBRAINS Collaboration Page**: [[EBRAINS Neurodiagnoses>>url:https://ebrains.eu/collabs/neurodiagnoses]] | ||
141 | |||
142 | **If you experience issues integrating data**, open a **GitHub Issue** or consult the **EBRAINS Neurodiagnoses Forum**. | ||
143 | |||
144 | ---- | ||
145 | |||
146 | This **updated methodology** now incorporates [[https:~~/~~/github.com/Fundacion-de-Neurociencias/neurodiagnoses/blob/main/data/biomarker_ontology>>https://Neuromarker]] for standardized biomarker classification, enabling **cross-disease AI training** across neurodegenerative disorders. |