Changes for page Methodology
Last modified by manuelmenendez on 2025/03/14 08:31
From version 20.1
edited by manuelmenendez
on 2025/02/14 14:47
on 2025/02/14 14:47
Change comment:
There is no comment for this version
To version 19.1
edited by manuelmenendez
on 2025/02/14 13:57
on 2025/02/14 13:57
Change comment:
There is no comment for this version
Summary
-
Page properties (1 modified, 0 added, 0 removed)
Details
- Page properties
-
- Content
-
... ... @@ -1,146 +1,207 @@ 1 - Here is the updated**Methodology** sectionfor theEBRAINSWiki, incorporatingthe**GeneralizedNeuroBiomarker Ontology Categorization(Neuromarker)** for **biomarkerclassificationacross all neurodegenerative diseases**.1 +**# Neurodiagnoses AI: Multimodal AI for Neurodiagnostic Predictions** 2 2 3 ----- 3 +## **Project Overview** 4 +Neurodiagnoses AI implements AI-driven diagnostic and prognostic models for central nervous system (CNS) disorders, adapting the Florey Dementia Index (FDI) methodology to a broader set of neurological conditions. The approach integrates **multimodal data sources** (EEG, neuroimaging, biomarkers, and genetics) and employs **machine learning models** to provide **explainable, real-time diagnostic insights**.## 4 4 5 -== **Neurodiagnoses AI: Multimodal AI for Neurodiagnostic Predictions** == 6 +## **How to Use External Databases in Neurodiagnoses** 7 +To enhance diagnostic accuracy, Neurodiagnoses integrates data from multiple biomedical and neurological research databases. Researchers can follow these steps to access, prepare, and integrate data into the Neurodiagnoses framework.## 6 6 7 -=== **Project Overview** === 9 +### **Potential Data Sources** 10 +Neurodiagnoses maintains an updated list of potential biomedical databases relevant to neurodegenerative diseases. ## 8 8 9 -Neurodiagnoses AI implements **AI-driven diagnostic and prognostic models** for central nervous system (CNS) disorders, expanding the **Florey Dementia Index (FDI) methodology** to a broader set of neurological conditions. The approach integrates **multimodal data sources** (EEG, neuroimaging, biomarkers, and genetics) and employs machine learning models to provide **explainable, real-time diagnostic insights**. This framework now incorporates **Neuromarker**, a **generalized biomarker ontology** that categorizes biomarkers across neurodegenerative diseases, enabling **standardized, cross-disease AI training**. 12 +**Reference: List of Potential Databases** 13 +- **ADNI**: Alzheimer's Disease data ([ADNI](https://adni.loni.usc.edu)) 14 +- **PPMI**: Parkinson’s Disease Imaging and biospecimens ([PPMI](https://www.ppmi-info.org)) 15 +- **GP2**: Whole-genome sequencing for PD ([GP2](https://gp2.org)) 16 +- **Enroll-HD**: Huntington’s Disease Clinical and genetic data ([Enroll-HD](https://www.enroll-hd.org)) 17 +- **GAAIN**: Multi-source Alzheimer’s data aggregation ([GAAIN](https://gaain.org)) 18 +- **UK Biobank**: Population-wide genetic, imaging, and health records ([UK Biobank](https://www.ukbiobank.ac.uk)) 19 +- **DPUK**: Dementia and Aging data ([DPUK](https://www.dementiasplatform.uk)) 20 +- **PRION Registry**: Prion Diseases clinical and genetic data ([PRION Registry](https://prionregistry.org)) 21 +- **DECIPHER**: Rare genetic disorder genomic variants ([DECIPHER](https://decipher.sanger.ac.uk)) 10 10 11 -== **Neuromarker: Generalized Biomarker Ontology** == 23 +### **1. Register for Access** 24 +- Each external database requires **individual registration** and access approval. 25 +- Ensure compliance with **ethical approvals** and **data usage agreements** before integrating datasets into Neurodiagnoses. 26 +- Some repositories may require a **Data Usage Agreement (DUA)** for sensitive medical data.## 12 12 13 -Neuromarker extends the **Common Alzheimer’s Disease Research Ontology (CADRO)** into a **cross-disease biomarker categorization framework** applicable to all neurodegenerative diseases (NDDs). It allows for **standardized classification, AI-based feature extraction, and multimodal integration**. 28 +### **2. Download & Prepare Data** 29 +- Download datasets while adhering to database usage policies. 30 +- Ensure files meet **Neurodiagnoses format requirements**: 31 + - **Tabular Data**: `.csv`, `.tsv` 32 + - **Neuroimaging Data**: `.nii`, `.dcm` 33 + - **Genomic Data**: `.fasta`, `.vcf` 34 + - **Clinical Metadata**: `.json`, `.xml`## 14 14 15 -=== **Core Biomarker Categories** === 36 +- **Mandatory Fields for Integration**: 37 + - **Subject ID**: Unique patient identifier 38 + - **Diagnosis**: Standardized disease classification 39 + - **Biomarkers**: CSF, plasma, or imaging biomarkers 40 + - **Genetic Data**: Whole-genome or exome sequencing 41 + - **Neuroimaging Metadata**: MRI/PET acquisition parameters 16 16 17 -The following ontology is used within **Neurodiagnoses AI** for biomarker categorization: 43 +### **3. Upload Data to Neurodiagnoses** 44 +**Option 1: Upload to EBRAINS Bucket** 45 +- Location: **EBRAINS Neurodiagnoses Bucket** 46 +- Ensure correct **metadata tagging** before submission.## 18 18 19 -|=**Category**|=**Description** 20 -|**Molecular Biomarkers**|Omics-based markers (genomic, transcriptomic, proteomic, metabolomic, lipidomic) 21 -|**Neuroimaging Biomarkers**|Structural (MRI, CT), Functional (fMRI, PET), Molecular Imaging (tau, amyloid, α-synuclein) 22 -|**Fluid Biomarkers**|CSF, plasma, blood-based markers for tau, amyloid, α-synuclein, TDP-43, GFAP, NfL 23 -|**Neurophysiological Biomarkers**|EEG, MEG, evoked potentials (ERP), sleep-related markers 24 -|**Digital Biomarkers**|Gait analysis, cognitive/speech biomarkers, wearables data, EHR-based markers 25 -|**Clinical Phenotypic Markers**|Standardized clinical scores (MMSE, MoCA, CDR, UPDRS, ALSFRS, UHDRS) 26 -|**Genetic Biomarkers**|Risk alleles (APOE, LRRK2, MAPT, C9orf72, PRNP) and polygenic risk scores 27 -|**Environmental & Lifestyle Factors**|Toxins, infections, diet, microbiome, comorbidities 48 + **Option 2: Contribute via GitHub Repository** 49 +- Location: **GitHub Data Repository** 50 +- Create a new folder under `/data/` and include a **dataset description**. 51 +- For large datasets, contact project administrators before uploading. 28 28 29 ----- 53 +### **4. Integrate Data into AI Models** 54 +- Open **Jupyter Notebooks** on EBRAINS to run **preprocessing scripts**. 55 +- Standardize **neuroimaging and biomarker formats** using harmonization tools. 56 +- Use **machine learning models** to handle missing data and feature extraction. 57 +- Train AI models with **newly integrated patient cohorts**.## 30 30 59 +**Reference**: See `docs/data_processing.md` for detailed instructions. 60 + 61 +## **Collaboration & Partnerships**## 62 +# **Partnering with Data Providers** 63 +Neurodiagnoses seeks partnerships with data repositories to: 64 +- Enable **API-based data integration** for real-time processing. 65 +- Co-develop **harmonized AI-ready datasets** with standardized annotations. 66 +- Secure **funding opportunities** through joint grant applications. 67 + 68 +**Interested in Partnering?** 69 +- If you represent a research consortium or database provider, reach out to explore data-sharing agreements. 70 +- **Contact**: info@neurodiagnoses.com 71 + 72 +## **Final Notes** 73 +Neurodiagnoses continuously expands its data ecosystem to support AI-driven clinical decision-making. Researchers and institutions are encouraged to contribute **new datasets and methodologies**.## 74 + 75 +For additional technical documentation: 76 +- **GitHub Repository**: [Neurodiagnoses GitHub](https://github.com/neurodiagnoses) 77 +- **EBRAINS Collaboration Page**: [EBRAINS Neurodiagnoses](https://ebrains.eu/collabs/neurodiagnoses) 78 + 79 +If you experience issues integrating data, **open a GitHub Issue** or consult the **EBRAINS Neurodiagnoses Forum**. 80 + 31 31 == **How to Use External Databases in Neurodiagnoses** == 32 32 33 -To enhance diagnostic accuracy, NeurodiagnosesAIintegrates data from**multiple biomedical and neurological research databases**.Researcherscanfollow these steps to access, prepare, and integrate data into the Neurodiagnoses framework.83 +To enhance the accuracy of our diagnostic models, Neurodiagnoses integrates data from multiple biomedical and neurological research databases. If you are a researcher, follow these steps to access, prepare, and integrate data into the Neurodiagnoses framework. 34 34 35 35 === **Potential Data Sources** === 36 36 37 -Neurodiagnoses maintains an **updated list**of biomedical datasets relevant to neurodegenerative diseases:87 +Neurodiagnoses maintains an updated list of potential biomedical databases relevant to neurodegenerative diseases. 38 38 39 -* **ADNI**: Alzheimer's Disease Imaging & Biomarkers → [[ADNI>>url:https://adni.loni.usc.edu/]] 40 -* **PPMI**: Parkinson’s Disease Imaging & Biospecimens → [[PPMI>>url:https://www.ppmi-info.org/]] 41 -* **GP2**: Whole-Genome Sequencing for PD → [[GP2>>url:https://gp2.org/]] 42 -* **Enroll-HD**: Huntington’s Disease Clinical & Genetic Data → [[Enroll-HD>>url:https://www.enroll-hd.org/]] 43 -* **GAAIN**: Multi-Source Alzheimer’s Data Aggregation → [[GAAIN>>url:https://gaain.org/]] 44 -* **UK Biobank**: Population-Wide Genetic, Imaging & Health Records → [[UK Biobank>>url:https://www.ukbiobank.ac.uk/]] 45 -* **DPUK**: Dementia & Aging Data → [[DPUK>>url:https://www.dementiasplatform.uk/]] 46 -* **PRION Registry**: Prion Diseases Clinical & Genetic Data → [[PRION Registry>>url:https://prionregistry.org/]] 47 -* **DECIPHER**: Rare Genetic Disorder Genomic Variants → [[DECIPHER>>url:https://decipher.sanger.ac.uk/]] 89 +* Reference: [[List of Potential Databases>>url:https://github.com/Fundacion-de-Neurociencias/neurodiagnoses/blob/main/data/sources/list_of_potential_databases]] 48 48 49 - ----91 +=== **1. Register for Access** === 50 50 51 - ==**1.RegisterforAccess**==93 +Each external database requires individual registration and access approval. Follow the official guidelines of each database provider. 52 52 53 -* Each external database requires **individual registration and access approval**. 54 -* Ensure compliance with **ethical approvals and data usage agreements** before integrating datasets into Neurodiagnoses. 55 -* Some repositories may require a **Data Usage Agreement (DUA)** for sensitive medical data. 95 +* Ensure that you have completed all ethical approvals and data access agreements before integrating datasets into Neurodiagnoses. 96 +* Some repositories require a Data Usage Agreement (DUA) before downloading sensitive medical data. 56 56 57 - ----98 +=== **2. Download & Prepare Data** === 58 58 59 - ==**2.Download&PrepareData**==100 +Once access is granted, download datasets while complying with data usage policies. Ensure that the files meet Neurodiagnoses’ format requirements for smooth integration. 60 60 61 -* Download datasets while adhering to **database usage policies**. 62 -* Ensure files meet **Neurodiagnoses format requirements**: 102 +==== **Supported File Formats** ==== 63 63 64 -|=**Data Type**|=**Accepted Formats** 65 -|**Tabular Data**|.csv, .tsv 66 -|**Neuroimaging**|.nii, .dcm 67 -|**Genomic Data**|.fasta, .vcf 68 -|**Clinical Metadata**|.json, .xml 104 +* Tabular Data: .csv, .tsv 105 +* Neuroimaging Data: .nii, .dcm 106 +* Genomic Data: .fasta, .vcf 107 +* Clinical Metadata: .json, .xml 69 69 70 -* **Mandatory Fields for Integration**: 71 -** **Subject ID**: Unique patient identifier 72 -** **Diagnosis**: Standardized disease classification 73 -** **Biomarkers**: CSF, plasma, or imaging biomarkers 74 -** **Genetic Data**: Whole-genome or exome sequencing 75 -** **Neuroimaging Metadata**: MRI/PET acquisition parameters 109 +==== **Mandatory Fields for Integration** ==== 76 76 77 ----- 111 +|=Field Name|=Description 112 +|Subject ID|Unique patient identifier 113 +|Diagnosis|Standardized disease classification 114 +|Biomarkers|CSF, plasma, or imaging biomarkers 115 +|Genetic Data|Whole-genome or exome sequencing 116 +|Neuroimaging Metadata|MRI/PET acquisition parameters 78 78 79 -== **3. Upload Data to Neurodiagnoses** == 118 +=== **3. Upload Data to Neurodiagnoses** === 80 80 81 - === **Option1:Upload to EBRAINSBucket**===120 +Once preprocessed, data can be uploaded to EBRAINS or GitHub. 82 82 83 -* Location: **EBRAINS Neurodiagnoses Bucket**84 -* Ensure**correctmetadatatagging** beforesubmission.122 +* ((( 123 +**Option 1: Upload to EBRAINS Bucket** 85 85 86 -=== **Option 2: Contribute via GitHub Repository** === 125 +* Location: [[EBRAINS Neurodiagnoses Bucket>>url:https://wiki.ebrains.eu/bin/view/Collabs/neurodiagnoses/Bucket]] 126 +* Ensure correct metadata tagging before submission. 127 +))) 128 +* ((( 129 +**Option 2: Contribute via GitHub Repository** 87 87 88 -* Location: **GitHub Data Repository**89 -* Create a **new folder under /data/**and includea **dataset description**.90 - * **For large datasets**, contact project administrators before uploading.131 +* Location: [[GitHub Data Repository>>url:https://github.com/Fundacion-de-Neurociencias/neurodiagnoses/tree/main/data]] 132 +* Create a new folder under /data/ and include dataset description. 133 +))) 91 91 92 - ----135 +//Note: For large datasets, please contact the project administrators before uploading.// 93 93 94 -== **4. Integrate Data into AI Models** == 137 +=== **4. Integrate Data into AI Models** === 95 95 96 -* Open **Jupyter Notebooks** on EBRAINS to run **preprocessing scripts**. 97 -* **Standardize neuroimaging and biomarker formats** using harmonization tools. 98 -* Use **machine learning models** to handle **missing data** and **feature extraction**. 99 -* Train AI models with **newly integrated patient cohorts**. 139 +Once uploaded, datasets must be harmonized and formatted before AI model training. 100 100 101 -** Reference**:See docs/data_processing.mdfordetailedinstructions.141 +==== **Steps for Data Integration** ==== 102 102 143 +* Open Jupyter Notebooks on EBRAINS to run preprocessing scripts. 144 +* Standardize neuroimaging and biomarker formats using harmonization tools. 145 +* Use machine learning models to handle missing data and feature extraction. 146 +* Train AI models with newly integrated patient cohorts. 147 +* Reference: [[Detailed instructions can be found in docs/data_processing.md>>url:https://github.com/Fundacion-de-Neurociencias/neurodiagnoses/blob/main/docs/data_processing.md]]. 148 + 103 103 ---- 104 104 105 -== ** AI-DrivenBiomarkerCategorization** ==151 +== **Database Sources Table** == 106 106 107 - Neurodiagnosesemploys**AImodels** for biomarkerclassification:153 +=== **Where to Insert This** === 108 108 109 -|=**Model Type**|=**Application** 110 -|**Graph Neural Networks (GNNs)**|Identify shared biomarker pathways across diseases 111 -|**Contrastive Learning**|Distinguish overlapping vs. unique biomarkers 112 -|**Multimodal Transformer Models**|Integrate imaging, omics, and clinical data 155 +* GitHub: [[docs/data_sources.md>>url:https://github.com/Fundacion-de-Neurociencias/neurodiagnoses/blob/main/docs/data_sources.md]] 156 +* EBRAINS Wiki: Collabs/neurodiagnoses/Data Sources 113 113 158 +=== **Key Databases for Neurodiagnoses** === 159 + 160 +|=Database|=Focus Area|=Data Type|=Access Link 161 +|ADNI|Alzheimer's Disease|MRI, PET, CSF, cognitive tests|ADNI 162 +|PPMI|Parkinson’s Disease|Imaging, biospecimens|[[PPMI>>url:https://www.ppmi-info.org/]] 163 +|GP2|Genetic Data for PD|Whole-genome sequencing|[[GP2>>url:https://gp2.org/]] 164 +|Enroll-HD|Huntington’s Disease|Clinical, genetic, imaging|[[Enroll-HD>>url:https://enroll-hd.org/]] 165 +|GAAIN|Alzheimer's & Cognitive Decline|Multi-source data aggregation|[[GAAIN>>url:https://www.gaain.org/]] 166 +|UK Biobank|Population-wide studies|Genetic, imaging, health records|[[UK Biobank>>url:https://www.ukbiobank.ac.uk/]] 167 +|DPUK|Dementia & Aging|Imaging, genetics, lifestyle factors|[[DPUK>>url:https://www.dementiasplatform.uk/]] 168 +|PRION Registry|Prion Diseases|Clinical and genetic data|[[PRION Registry>>url:https://www.prionalliance.org/]] 169 +|DECIPHER|Rare Genetic Disorders|Genomic variants|DECIPHER 170 + 171 +If you know a relevant dataset, submit a proposal in [[GitHub Issues>>url:https://github.com/Fundacion-de-Neurociencias/neurodiagnoses/issues]]. 172 + 114 114 ---- 115 115 116 116 == **Collaboration & Partnerships** == 117 117 177 +=== **Where to Insert This** === 178 + 179 +* GitHub: [[docs/collaboration.md>>url:https://github.com/Fundacion-de-Neurociencias/neurodiagnoses/blob/main/docs/collaboration.md]] 180 +* EBRAINS Wiki: Collabs/neurodiagnoses/Collaborations 181 + 118 118 === **Partnering with Data Providers** === 119 119 120 -Neurodiagnoses seeks partnerships with data repositories to: 184 +Beyond using existing datasets, Neurodiagnoses seeks partnerships with data repositories to: 121 121 122 -* Enable **API-based data integration**for real-time processing.123 -* Co-develop **harmonized AI-ready datasets**with standardized annotations.124 -* Secure **funding opportunities**through joint grant applications.186 +* Enable direct API-based data integration for real-time processing. 187 +* Co-develop harmonized AI-ready datasets with standardized annotations. 188 +* Secure funding opportunities through joint grant applications. 125 125 126 -**Interested in Partnering?** 190 +=== **Interested in Partnering?** === 127 127 128 -* If you represent a **research consortium or database provider**, reach out to explore **data-sharing agreements**. 129 -* **Contact**: [[info@neurodiagnoses.com>>mailto:info@neurodiagnoses.com]] 192 +If you represent a research consortium or database provider, reach out to explore data-sharing agreements. 130 130 194 +* Contact: [[info@neurodiagnoses.com>>mailto:info@neurodiagnoses.com]] 195 + 131 131 ---- 132 132 133 133 == **Final Notes** == 134 134 135 -Neurodiagnoses continuously expands its **data ecosystem**to support**AI-driven clinical decision-making**. Researchers and institutions are encouraged to**contribute new datasets and methodologies**.200 +Neurodiagnoses continuously expands its data ecosystem to support AI-driven clinical decision-making. Researchers and institutions are encouraged to contribute new datasets and methodologies. 136 136 137 - **For additional technical documentation**:202 +For additional technical documentation: 138 138 139 -* **GitHub Repository**: [[Neurodiagnoses GitHub>>url:https://github.com/neurodiagnoses]]140 -* **EBRAINS Collaboration Page**: [[EBRAINS Neurodiagnoses>>url:https://ebrains.eu/collabs/neurodiagnoses]]204 +* [[GitHub Repository>>url:https://github.com/Fundacion-de-Neurociencias/neurodiagnoses]] 205 +* [[EBRAINS Collaboration Page>>url:https://wiki.ebrains.eu/bin/view/Collabs/neurodiagnoses/]] 141 141 142 -**If you experience issues integrating data**, open a **GitHub Issue** or consult the **EBRAINS Neurodiagnoses Forum**. 143 - 144 ----- 145 - 146 -This **updated methodology** now incorporates [[https:~~/~~/github.com/Fundacion-de-Neurociencias/neurodiagnoses/blob/main/data/biomarker_ontology>>https://Neuromarker]] for standardized biomarker classification, enabling **cross-disease AI training** across neurodegenerative disorders. 207 +If you experience issues integrating data, open a [[GitHub Issue>>url:https://github.com/Fundacion-de-Neurociencias/neurodiagnoses/issues]] or consult the EBRAINS Neurodiagnoses Forum.