Changes for page Methodology
Last modified by manuelmenendez on 2025/03/14 08:31
From version 4.3
edited by manuelmenendez
on 2025/01/29 19:11
on 2025/01/29 19:11
Change comment:
There is no comment for this version
To version 17.1
edited by manuelmenendez
on 2025/02/09 13:01
on 2025/02/09 13:01
Change comment:
There is no comment for this version
Summary
-
Page properties (1 modified, 0 added, 0 removed)
Details
- Page properties
-
- Content
-
... ... @@ -1,109 +1,154 @@ 1 -== =**Overview** ===1 +== **Overview** == 2 2 3 - This section describes the step-by-step process used in the **Neurodiagnoses**project todevelop a novel diagnostic framework forneurologicaldiseases.The methodologyintegrates artificial intelligence(AI), biomedicalontologies,and computationalneurosciencetocreatea structured,interpretable, andscalablediagnosticsystem.3 +Neurodiagnoses develops a **tridimensional diagnostic framework** for **CNS diseases**, incorporating **AI-powered annotation tools** to improve **interpretability, standardization, and clinical utility.** 4 4 5 +This methodology integrates **multi-modal data**, including: 6 +**Genetic data** (whole-genome sequencing, polygenic risk scores). 7 +**Neuroimaging** (MRI, PET, EEG, MEG). 8 +**Neurophysiological data** (EEG-based biomarkers, sleep actigraphy). 9 +**CSF & Blood Biomarkers** (Amyloid-beta, Tau, Neurofilament Light). 10 + 11 +By applying **machine learning models**, Neurodiagnoses generates **structured, explainable diagnostic outputs** to assist **clinical decision-making** and **biomarker-driven patient stratification.** 12 + 5 5 ---- 6 6 7 -== =**1.Data Integration** ===15 +== **Data Integration & External Databases** == 8 8 9 -=== =**DataSources** ====17 +=== **How to Use External Databases in Neurodiagnoses** === 10 10 11 -* **Biomedical Ontologies**: 12 -** Human Phenotype Ontology (HPO) for phenotypic abnormalities. 13 -** Gene Ontology (GO) for molecular and cellular processes. 14 -* **Neuroimaging Datasets**: 15 -** Example: Alzheimer’s Disease Neuroimaging Initiative (ADNI), OpenNeuro. 16 -* **Clinical and Biomarker Data**: 17 -** Anonymized clinical reports, molecular biomarkers, and test results. 19 +Neurodiagnoses integrates data from multiple **biomedical and neurological research databases**. Researchers can follow these steps to **access, prepare, and integrate** data into the Neurodiagnoses framework. 18 18 21 +**Potential Data Sources** 22 +**Reference:** [[List of Potential Databases>>url:https://github.com/Fundacion-de-Neurociencias/neurodiagnoses/blob/main/data/sources/list_of_potential_databases]] 19 19 20 -=== =**Data Preprocessing** ====24 +=== **Register for Access** === 21 21 22 - 1.**Standardization**:Ensure alldata sourcesare normalizedto acommonformat.23 - 1.**FeatureSelection**: Identify relevantfeaturesfor diagnosis (e.g., biomarkers,imagingscores).24 - 1.**DataCleaning**: Handlemissingvalues and removeduplicates.26 +Each **external database** requires **individual registration** and approval. 27 +✔️ Follow the official **data access guidelines** of each provider. 28 +✔️ Ensure compliance with **ethical approvals** and **data-sharing agreements (DUAs).** 25 25 26 - ----30 +=== **Download & Prepare Data** === 27 27 28 - ===**2. AI-BasedAnalysis**===32 +Once access is granted, download datasets **following compliance guidelines** and **format requirements** for integration. 29 29 30 - ====**ModelDevelopment**====34 +**Supported File Formats** 31 31 32 -* ** Embedding Models**: Use pre-trained models like BioBERT orBioLORDfor text data.33 -* ** ClassificationModels**:34 -* *Algorithms:Random Forest, Support Vector Machines (SVM),or neural networks.35 -* *Purpose: Predict thelikelihood of specificneurologicalconditions based oninput data.36 +* **Tabular Data**: .csv, .tsv 37 +* **Neuroimaging Data**: .nii, .dcm 38 +* **Genomic Data**: .fasta, .vcf 39 +* **Clinical Metadata**: .json, .xml 36 36 37 - ====**DimensionalityReductionandInterpretability**====41 +**Mandatory Fields for Integration** 38 38 39 -* Leverage [[DEIBO>>https://drive.ebrains.eu/f/8d7157708cde4b258db0/]] (Data-driven Embedding Interpretation Based on Ontologies) to connect model dimensions to ontology concepts. 40 -* Evaluate interpretability using metrics like the Area Under the Interpretability Curve (AUIC). 43 +|=**Field Name**|=**Description** 44 +|**Subject ID**|Unique patient identifier 45 +|**Diagnosis**|Standardized disease classification 46 +|**Biomarkers**|CSF, plasma, or imaging biomarkers 47 +|**Genetic Data**|Whole-genome or exome sequencing 48 +|**Neuroimaging Metadata**|MRI/PET acquisition parameters 41 41 50 +=== **Upload Data to Neurodiagnoses** === 51 + 52 +**Option 1:** Upload to **EBRAINS Bucket** → [[Neurodiagnoses Data Storage>>url:https://wiki.ebrains.eu/bin/view/Collabs/neurodiagnoses/Bucket]] 53 +**Option 2:** Contribute via **GitHub Repository** → [[GitHub Data Repository>>url:https://github.com/Fundacion-de-Neurociencias/neurodiagnoses/tree/main/data]] 54 + 55 +**For large datasets, please contact project administrators before uploading.** 56 + 57 +=== **Integrate Data into AI Models** === 58 + 59 +Use **Jupyter Notebooks** on EBRAINS for **data preprocessing.** 60 +Standardize data using **harmonization tools.** 61 +Train AI models with **newly integrated datasets.** 62 + 63 +**Reference:** [[Data Processing Guide>>url:https://github.com/Fundacion-de-Neurociencias/neurodiagnoses/blob/main/docs/data_processing.md]] 64 + 42 42 ---- 43 43 44 -== =**3.DiagnosticFramework** ===67 +== **AI-Powered Annotation & Machine Learning Models** == 45 45 46 - ====**Axes ofDiagnosis**====69 +Neurodiagnoses applies **advanced machine learning models** to classify CNS diseases, extract features from **biomarkers and neuroimaging**, and provide **AI-powered annotation.** 47 47 48 - Theframeworkorganizesdiagnostic data into three axes:71 +=== **AI Model Categories** === 49 49 50 -1. **Etiology**: Genetic and environmental risk factors. 51 -1. **Molecular Markers**: Biomarkers such as amyloid-beta, tau, and alpha-synuclein. 52 -1. **Neuroanatomical Correlations**: Results from neuroimaging (e.g., MRI, PET). 73 +|=**Model Type**|=**Function**|=**Example Algorithms** 74 +|**Probabilistic Diagnosis**|Assigns probability scores to multiple CNS disorders.|Random Forest, XGBoost, Bayesian Networks 75 +|**Tridimensional Diagnosis**|Classifies disorders based on Etiology, Biomarkers, and Neuroanatomical Correlations.|CNNs, Transformers, Autoencoders 76 +|**Biomarker Prediction**|Predicts missing biomarker values using regression.|KNN Imputation, Bayesian Estimation 77 +|**Neuroimaging Feature Extraction**|Extracts patterns from MRI, PET, EEG.|CNNs, Graph Neural Networks 78 +|**Clinical Decision Support**|Generates AI-driven diagnostic reports.|SHAP Explainability Tools 53 53 54 - ====**RecommendationSystem** ====80 +**Reference:** [[AI Model Documentation>>url:https://github.com/Fundacion-de-Neurociencias/neurodiagnoses/blob/main/docs/models.md]] 55 55 56 -* Suggests additional tests or biomarkers if gaps are detected in the data. 57 -* Prioritizes tests based on clinical impact and cost-effectiveness. 82 +---- 58 58 84 +== **Clinical Decision Support & Tridimensional Diagnostic Framework** == 85 + 86 +Neurodiagnoses generates **structured AI reports** for clinicians, combining: 87 + 88 +**Probabilistic Diagnosis:** AI-generated ranking of potential diagnoses. 89 +**Tridimensional Classification:** Standardized diagnostic reports based on: 90 + 91 +1. **Axis 1:** **Etiology** → Genetic, Autoimmune, Prion, Toxic, Vascular. 92 +1. **Axis 2:** **Molecular Markers** → CSF, Neuroinflammation, EEG biomarkers. 93 +1. **Axis 3:** **Neuroanatomoclinical Correlations** → MRI atrophy, PET. 94 + 95 +**Reference:** [[Tridimensional Classification Guide>>url:https://github.com/Fundacion-de-Neurociencias/neurodiagnoses/blob/main/docs/classification.md]] 96 + 59 59 ---- 60 60 61 -== =**4.ComputationalWorkflow** ===99 +== **Data Security, Compliance & Federated Learning** == 62 62 63 -1. **Data Loading**: Import data from storage (Drive or Bucket). 64 -1. **Feature Engineering**: Generate derived features from the raw data. 65 -1. **Model Training**: 66 -1*. Split data into training, validation, and test sets. 67 -1*. Train models with cross-validation to ensure robustness. 68 -1. **Evaluation**: 69 -1*. Metrics: Accuracy, F1-Score, AUIC for interpretability. 70 -1*. Compare against baseline models and domain benchmarks. 101 +✔ **Privacy-Preserving AI**: Implements **Federated Learning**, ensuring that patient data **never leaves** local institutions. 102 +✔ **Secure Data Access**: Data remains **stored in EBRAINS MIP servers** using **differential privacy techniques.** 103 +✔ **Ethical & GDPR Compliance**: Data-sharing agreements **must be signed** before use. 71 71 105 +**Reference:** [[Data Protection & Federated Learning>>url:https://github.com/Fundacion-de-Neurociencias/neurodiagnoses/blob/main/docs/security.md]] 106 + 72 72 ---- 73 73 74 -== =**5. Validation** ===109 +== **Data Processing & Integration with Clinica.Run** == 75 75 76 - ====**InternalValidation**====111 +Neurodiagnoses now supports **Clinica.Run**, an **open-source neuroimaging platform** for **multimodal data processing.** 77 77 78 -* Test the system using simulated datasets and known clinical cases. 79 -* Fine-tune models based on validation results. 113 +=== **How It Works** === 80 80 81 -==== **External Validation** ==== 115 +✔ **Neuroimaging Preprocessing**: MRI, PET, EEG data is preprocessed using **Clinica.Run pipelines.** 116 +✔ **Automated Biomarker Extraction**: Extracts volumetric, metabolic, and functional biomarkers. 117 +✔ **Data Security & Compliance**: Clinica.Run is **GDPR & HIPAA-compliant.** 82 82 83 -* Collaborate with research institutions and hospitals to test the system in real-world settings. 84 -* Use anonymized patient data to ensure privacy compliance. 119 +=== **Implementation Steps** === 85 85 121 +1. Install **Clinica.Run** dependencies. 122 +1. Configure **Clinica.Run pipeline** in clinica_run_config.json. 123 +1. Run **biomarker extraction pipelines** for AI-based diagnostics. 124 + 125 +**Reference:** [[Clinica.Run Documentation>>url:https://github.com/Fundacion-de-Neurociencias/neurodiagnoses/blob/main/docs/clinica_run.md]] 126 + 86 86 ---- 87 87 88 -== =**6.Collaborative Development** ===129 +== **Collaborative Development & Research** == 89 89 90 - Theproject isopentocontributionsfrom researchers, clinicians,anddevelopers.Keytoolsinclude:131 +**We Use GitHub to Develop AI Models & Store Research Data** 91 91 92 -* **Jupyter Notebooks**: For data analysis and pipeline development. 93 -** Example: [[probabilistic imputation>>https://drive.ebrains.eu/f/4f69ab52f7734ef48217/]] 94 -* **Wiki Pages**: For documenting methods and results. 95 -* **Drive and Bucket**: For sharing code, data, and outputs. 96 -* **Collaboration with related projects: **For instance: [[//Beyond the hype: AI in dementia – from early risk detection to disease treatment//>>https://www.lethe-project.eu/beyond-the-hype-ai-in-dementia-from-early-risk-detection-to-disease-treatment/]] 133 +* **GitHub Repository:** AI model training scripts. 134 +* **GitHub Issues:** Tracks ongoing research questions. 135 +* **GitHub Wiki:** Project documentation & user guides. 97 97 137 +**We Use EBRAINS for Data & Collaboration** 138 + 139 +* **EBRAINS Buckets:** Large-scale neuroimaging and biomarker storage. 140 +* **EBRAINS Jupyter Notebooks:** Cloud-based AI model execution. 141 +* **EBRAINS Wiki:** Research documentation and updates. 142 + 143 +**Join the Project Forum:** [[GitHub Discussions>>url:https://github.com/Fundacion-de-Neurociencias/neurodiagnoses/discussions]] 144 + 98 98 ---- 99 99 100 - ===**7. ToolsandTechnologies**===147 +**For Additional Documentation:** 101 101 102 -* **Programming Languages**: Python for AI and data processing. 103 -* **Frameworks**: 104 -** TensorFlow and PyTorch for machine learning. 105 -** Flask or FastAPI for backend services. 106 -* **Visualization**: Plotly and Matplotlib for interactive and static visualizations. 107 -* **EBRAINS Services**: 108 -** Collaboratory Lab for running Notebooks. 109 -** Buckets for storing large datasets. 149 +* **GitHub Repository:** [[Neurodiagnoses AI Models>>url:https://github.com/Fundacion-de-Neurociencias/neurodiagnoses]] 150 +* **EBRAINS Wiki:** [[Neurodiagnoses Research Collaboration>>url:https://wiki.ebrains.eu/bin/view/Collabs/neurodiagnoses/]] 151 + 152 +---- 153 + 154 +**Neurodiagnoses is Open for Contributions – Join Us Today!**