Changes for page SGA2 SP3 UC002 KR3.2 - Slow Wave Analysis Pipeline
                  Last modified by robing on 2022/03/25 09:55
              
      Summary
- 
          Page properties (1 modified, 0 added, 0 removed)
- 
          Objects (1 modified, 0 added, 0 removed)
Details
- Page properties
- 
      - Content
-   ... ... @@ -2,36 +2,24 @@ 2 2 ((( 3 3 (% class="container" %) 4 4 ((( 5 -= (% style="--darkreader-inline-color:inherit; color:inherit" %)Slow Wave Analysis Pipeline SWAP)(%%) =5 += (% style="--darkreader-inline-color:inherit; color:inherit" %)Slow Wave Analysis Pipeline(%%) = 6 6 7 7 (% class="wikigeneratedid" id="HUseCaseSGA2-SP3-002:IntegratingmultiscaledataA0inareproducibleandadaptablepipeline" %) 8 -(% style="--darkreader-inline-color:inherit; color:inherit; font-size:24px" %)**Use Case SGA2-SP3-002 KR3.2: Integrating multi-scale dataand the output of simulationsin a reproducible and adaptable pipeline**8 +(% style="--darkreader-inline-color:inherit; color:inherit; font-size:24px" %)**Use Case SGA2-SP3-002: Integrating multi-scale data in a reproducible and adaptable pipeline** 9 9 10 - RobinGutzen^^1^^,Giulia De Bonis^^2^^, ElenaPastorelli^^2,3^^,CristianoCapone^^2^^,10 +To be discussed Author orders, contributions,...: 11 11 12 - Chiara DeLuca^^2,3^^, Michael Denker^^1^^,Sonja Grün^^1,4^^,12 +Experiments: ...? 13 13 14 - Pier StanislaoPaolucci^^2^^,AndrewDavison^^5^^14 +Implementation: Robin Gutzen^^1^^, Elena Pastorelli^^2^^, ... 15 15 16 - Experiments:Anna LetiziaAllegra Mascaro^^6,7^^,FrancescoResta^^6^^,FrancescoSaverio Pavone^^6^^,Maria-VictoriaSanchez-Vives^^8,9^^16 +Lead: Michael Denker^^1^^, Sonja Grün^^1^^, Pier Stanislao Paolucci^^2^^, Andrew Davison? 17 17 18 18 ,,1) Institute of Neuroscience and Medicine (INM-6) and Institute for Advanced Simulation (IAS-6) and JARA-Institute Brain Structure-Function Relationships (INM-10), Jülich Research Centre, Jülich, Germany,, 19 19 20 -,,2) IstitutoNazionaledi FisicaNucleare(INFN),Sezione di Roma,Rome,Italy,,20 +,,2) Dipartimento di Fisica, Università di Cagliari and INFN Sezione di Roma, Italy,, 21 21 22 -,,3) Ph.D. Program in Behavioural Neuroscience, “Sapienza” University of Rome, Rome, Italy,, 23 - 24 -,,4) Theoretical Systems Neurobiology, RWTH Aachen University, Aachen, Germany,, 25 - 26 -,,5) Unité de Neurosciences, Information et Complexité, Neuroinformatics Group, CNRS FRE 3693, Gif-sur-Yvette, France,, 27 - 28 -,,6) European Laboratory for Non-linear Spectroscopy (LENS), (% style="--darkreader-inline-color:inherit; color:inherit" %)University of Florence, Florence, Italy(%%),, 29 - 30 -,,7) Istituto di Neuroscienze, CNR, Pisa, Italy,, 31 - 32 -,,8) Institut d’Investigacions Biomèdiques August Pi i Sunyer (IDIBAPS), Barcelona, Spain,, 33 - 34 -,,9) Institució Catalana de Recerca i Estudis Avanc ̨ats (ICREA), Barcelona, Spain,, 22 + 35 35 ))) 36 36 ))) 37 37 ... ... @@ -41,12 +41,12 @@ 41 41 ((( 42 42 == Flexible workflows to generate multi-scale analysis scenarios == 43 43 44 -This collabllustratesthe usageofthe[[Neo>>https://neo.readthedocs.io/en/stable/]]and[[Elephant>>https://elephant.readthedocs.io/en/latest/]] toolsinperforming data analysiswith regardstothe SGA2-SP3-UC002 KR3.2,investigating sleep, anesthesia, andthe transition to wakefulness: see Chapter1 and Figure 2 of SGA2[[ Deliverable D3.2.1.>>https://drive.ebrains.eu/smart-link/17ac0d6e-e050-4a49-8ca2-e223b70a3121/]],for an overview of thescientific motivationsanda descriptionoftheUseCase workflow; Chapter 2 (samedocument) for anintroductionto KR3.2; Chapter3,fora descriptionof the mice ECoGdatasets; Chapter 5, about the Slow Wave AnalysisPipeline and Chapter 6 for the mice wide-fieldGECI data). For detailsonthedatasets used in this collab, please see the References below.32 +This Collab is aimed at experimental and computational neuroscientists interested in the usage of the Neo and Elephant tools in performing data analysis of spiking data. 45 45 46 -== How the pipeline works ==34 +== How the Pipeline works == 47 47 48 -The design of the pipeline aims at interfacing a variety of general and specific analysis and processing steps in a flexible modular manner. Hence, it enables the pipeline to adapt to diverse types of data (e. g.,electrical ECoG, or optical Calcium Imaging recordings) and to different analysis questions. This makes the analyses a) more reproducible and b) comparable amongst each other since they rely on the same stack of algorithms and any differences in the analysis are fully transparent.49 -The individual processing and analysis steps ( **blocks**//, //see// //the arrow-connected elements below) are organized in sequential**stages**//(//see the columns below//). //Following along the stages,the analysis becomes more specific but also allows to branch off at after any stage,as each stage yields useful intermediate resultsandis autonomous so that it can be reused and recombined. Within each stage, there is a collection of blocks from which the user can select and arrange the analysis via a config file. Thus, the pipeline can be thought of as a curated database of methods on which an analysis can be constructed by drawing a path along the blocks and stages.36 +The design of the pipeline aims at interfacing a variety of general and specific analysis and processing steps in a flexible modular manner. Hence, it enables the pipeline to adapt to diverse types of data (e.b. electrical EEG, or optical Calcium Imaging recordings) and to different analysis questions. This makes the analyses a) more reproducible and b) comparable amongst each other since they rely on the same stack of algorithms and any differences in the analysis are fully transparent. 37 +The individual processing and analysis steps (//blocks, //see// //the arrow-connected elements below) are organized in sequential //stages (//see the columns below//). //Following along the stages the analysis becomes more specific but also allows to branch off at after any stage as each stage yields useful intermediate results is autonomous so that it can be reused and recombined. Within each stage, there is a collection of blocks from which the user can select and arrange the analysis via a config file. Thus, the pipeline can be thought of as a curated database of methods on which an analysis can be constructed by drawing a path along the blocks and stages. 50 50 51 51 (% class="wikigeneratedid" id="H" %) 52 52 [[image:pipeline_flowchart.png]] ... ... @@ -53,111 +53,78 @@ 53 53 54 54 == Executing the pipeline == 55 55 56 - Thereare two ways of gettingstarted and testing thepipeline, i) online using thecollabdrive and jupyter hub, or ii) downloading thecode and datafrom GitHub and the collab storage and running it locally.44 +=== In the collab (beta) === 57 57 58 -=== i) In the collab === 59 - 60 60 * ((( 61 -**Copy the collab drive to your personaldrive space**47 +**Copy the collab drive to your drive space** 62 62 63 -* Open the Drive from the left menu 64 -* Select the folders //pipeline// and //datasets,// 65 -and the notebook// run_snakemake_in_collab.ipynb// 49 +* 50 + 51 +Open the Drive from the left menu 52 + 53 +* Select the folders 'pipeline' and 'datasets' 66 66 * Select 'Copy', and then 'My Library' from the dropdown 'Other Libraries' 67 67 68 68 ))) 69 69 * **Start a Jupyter Hub instance ** 70 -In another browser tab, open [[https:~~/~~/lab.ebrains.eu>>https://lab.ebrains.eu]]71 - 58 +In another browser, open [[https:~~/~~/lab.ebrains.eu>>https://lab.ebrains.eu]] 59 + 72 72 * **Edit the config files** 73 -Each stage has config files (//pipeline/<stage_name>/configs/config_<profile>.yaml//) to specify which analysis/processing blocks to execute and which parameters to use. General and specific information about the blocks and parameters can be found in the README and config files of each stage. There are preset configuration profiles for the benchmark datasets IDIBAPS ([[ECoG, anesthetized mouse>>https://kg.ebrains.eu/search/?facet_type[0]=Dataset&q=sanchez-vives#Dataset/2ead029b-bba5-4611-b957-bb6feb631396]]) and LENS ([[Calcium Imaging, anesthetized mouse>>https://kg.ebrains.eu/search/instances/Dataset/71285966-8381-48f7-bd4d-f7a66afa9d79]]). 61 +Each stage has a config file to specify which analysis/processing blocks to execute and which parameters to use. General and specific information about the blocks and parameters can found in the README and config files. The default values are set for an example dataset (ECoG, anesthetized mouse, IDIBAPS [ref]). 62 +** stage01_data_entry: [README.md](), [config.yaml]() 63 +** stage02_preprocessing: [README.md](), [config.yaml]() 64 +** stage03_trigger_detection: [README.md](), [config.yaml]() 65 +** stage04_wavefront_detection: [README.md](), [config.yaml]() 66 +** stage05_wave_characterization: [README.md](), [config.yaml]() 74 74 75 75 * **Run the notebook** 76 -In the jupyter hub, navigate to //drive/My Libraries/My Library/pipeline/showcase_notebooks/run_snakemake_in_collab.ipynb//, or where you copied the //pipeline// folder to. 77 -Follow the notebook to install the required packages into your Python kernel, set the output path, and execute the pipeline with snakemake 69 +In the jupyter hub, navigate to `drive/My Libraries/My Library/pipeline/showcase_notebooks/run_snakemake_in_collab.ipynb`, or where you copied the 'pipeline' folder to. 70 +* Follow the notebook to install the required packages into your Python kernel, set the output path, and execute the pipeline with snakemake. 71 + 72 +* **Coming soon** 73 +** Use of KnowledgeGraph API 74 +** Provenance Tracking 75 +** HPC support 78 78 79 -=== ii)Local execution ===77 +=== Local execution === 80 80 81 -//tested only with Linux!// 82 - 83 83 * **Get the code** 84 -The source code of the pipeline is available via Github: [ [INM-6/wavescalephant>>https://github.com/INM-6/wavescalephant]]and can be cloned to your machine ([[how togetstartedwithGithub>>https://guides.github.com/activities/hello-world/]]).80 +The source code of the pipeline is available via Github: [INM-6/wavescalephant]('https:~/~/github.com/INM-6/wavescalephant') and can be cloned to your machine ([how to use Github]()). 85 85 86 -* ((( 87 -**Build the Python environment** 88 -In the wavescalephant git repository, there is an environment file ([[pipeline/environment.yaml>>https://drive.ebrains.eu/smart-link/1a0b15bb-be87-46ee-b838-4734bc320d20/]]) specifying the required packages and versions. To build the environment, we recommend using conda ([[how to get started with conda>>https://docs.conda.io/projects/conda/en/latest/user-guide/getting-started.html]]). 89 -##conda env create ~-~-file environment.yaml 90 -conda activate wavescalephant_env## 82 +* **Build the Python environment** 83 +In the wavescalephant repository, there is an environment file (`pipeline/envs/wavescalephant_env.yaml`) specifying the required packages and versions. To build the environment, we recommend using *conda* ([how to get started with conda]()). 84 +`conda env create ~-~-file /envs/wavescalephant_env.yml`. 91 91 92 -))) 93 93 * **Edit the settings** 94 -The settings file specifies the path to the output folder, where results are saved to. Open the template file //[[pipeline/settings_template.py>>https://drive.ebrains.eu/lib/905d7321-a16b-4147-8cca-31d710d1f946/file/pipeline/settings_template.py]]//, set the##output_path##to the desired path, and save it as//pipeline/settings.py//.87 +The settings file specifies the path to the output folder, where results are saved to. Open the template file `pipeline/settings_template.py`, set the `output_path` to the desired path, and save it as `pipeline/settings.py`. 95 95 96 96 * **Edit the config files** 97 -Each stage uses a config file to specify which analysis/processing blocks to execute and which parameters to use. Edit the config template files //pipeline/stageXX_<stage_name>/configs/config_template.yaml// according to your dataset and analysis goal, and save them as //pipeline/stageXX_<stage_name>/configs/config_<profile>.yaml//. A detailed description of the available parameter settings and their meaning is commented in the template files, and a more general description of the working mechanism of each stage can be found in the respective README file //pipeline/stageXX_<stage_name>/README.md//. 98 -//Links are view-only// 99 -** full pipeline: [[README.md>>https://drive.ebrains.eu/smart-link/d2e93a2a-09f6-4dce-982d-0370953a4da8/]], [[config.yaml>>https://drive.ebrains.eu/smart-link/7948fbb3-bf8a-4785-9b28-d5c15a1aafa7/]] 100 -** stage01_data_entry: [[README.md>>https://drive.ebrains.eu/smart-link/896f8880-a7d1-4a30-adbf-98759860fed5/]], [[config.yaml>>https://drive.ebrains.eu/smart-link/9bef8f59-1007-48c4-b5ba-30de4ff18f34/]] 101 -** stage02_processing: [[README.md>>https://drive.ebrains.eu/smart-link/01f21fa5-94f7-4883-8388-cc50957f9c81/]], [[config.yaml>>https://drive.ebrains.eu/smart-link/7e75caf6-e2d6-4393-a97c-4f481c908cf8/]] 102 -** stage03_trigger_detection: [[README.md>>https://drive.ebrains.eu/smart-link/18d276cd-a691-4ee1-81c6-7978cef9c1b4/]], [[config.yaml>>https://drive.ebrains.eu/smart-link/dfa375c0-cc80-4f95-b3ed-40140acbd96b/]] 103 -** stage04_wavefront_detection: [[README.md>>https://drive.ebrains.eu/smart-link/a8e80096-06a0-4ff4-b645-90e134e46ac5/]], [[config.yaml>>https://drive.ebrains.eu/smart-link/3a54be8c-b9f4-4698-a85d-6ad97990b40a/]] 104 -** stage05_wave_characterization: [[README.md>>https://drive.ebrains.eu/smart-link/3009a214-a11f-424c-8a6e-13e7506545eb/]], [[config.yaml>>https://drive.ebrains.eu/smart-link/83f68955-0ca8-4123-9734-6e93349ca3e3/]] 90 +Each stage has a config file to specify which analysis/processing blocks to execute and which parameters to use. Edit the config template files `pipeline/stageXX_<stage_name>/config_template.yaml` according to your dataset and analysis goal, and save them as `pipeline/stageXX_<stage_name>/config.yaml`. A detailed description of the available parameter settings and their meaning is commented in the template files, and a more general description of the working mechanism of each stage can be found in the respective README file `pipeline/stageXX_<stage_name>/README.md`. 105 105 106 106 * **Enter a dataset** 107 -There are two test datasets in the collab drive (IDIBAPS and LENS) for which there are also corresponding config files and scripts in the data_entry stage. So, these datasets are ready to be used and analyzed. 108 -For adding new datasets see //[[pipeline/stage01_data_entry/README.md>>https://drive.ebrains.eu/smart-link/d2e93a2a-09f6-4dce-982d-0370953a4da8/]]// 93 +see `pipeline/stage01_data_entry/README.md` 109 109 110 110 * **Run the pipeline (-stages)** 111 -To run the pipeline with [[snakemake>>https://snakemake.readthedocs.io/en/stable/]]),activate the Python environment##conda activate wavescalephant_env,##make sure you are in the working directory(//pipeline/)//, and call##snakemake##to run the entire pipeline.112 - Fora moredetailedexecuted guide andhowtoexecute individual stagesandblocksseethe pipeline[[Readme>>https://drive.ebrains.eu/smart-link/3009a214-a11f-424c-8a6e-13e7506545eb/]].96 +To run the pipeline with snakemake ([intro to snakemake]()) activate the Python environment `conda activate wavescalephant_env`, make sure you are in the working directory `pipeline/`, and call `snakemake` to run the entire pipeline. 97 +To (re-)execute an individual stage, you can navigate to the corresponding stage folder and call the `snakemake` command there. For running an individual stage, you may need to manually set the path for input file for the stage (i.e. the output file of the previous stage) in the config file `INPUT: /path/to/file`. 113 113 114 114 == Accessing and using the results == 115 115 116 -All results are stored in the path specified in the //settings.py// file. The folder structure reflects the structuring of the pipeline into stages and blocks. All intermediate results are stored as //.nix// files using the [[Neo data format>>https://neo.readthedocs.io/en/stable/]] and can be loaded with ##neo.NixIO('/path/to/file.nix').read_block()##. Additionally, most blocks produce a figure, and each stage a report file, to give an overview of the execution log, parameters, intermediate results, and to help with debugging. The final stage (//stage05_wave_characterization//) stores the results as[[ //pandas.DataFrames//>>https://pandas.pydata.org/]] in //.csv// files, separately for each measure as well as in a combined dataframe for all measures. 101 +All results are stored in the path specified in the `settings.py` file. The folder structure reflects the structuring of the pipeline into stages and blocks. All intermediate results are stored as `.nix` files using the Neo data format ([Neo]()) and can be loaded with `neo.NixIO('/path/to/file.nix').read_block()` ([documentation]()). 102 +Additionally, most blocks produce a figure, and each stage a report file, to give an overview of the execution log, parameters, intermediate results, and to help with debugging. 103 +The final stage (*stage05_wave_characterization*) stores the results as pandas.DataFrames ([pandas]()) in `.csv` files, separately for each measure as well as in a combined dataframe for all measures. 117 117 118 -**Examples of the output figures (for IDIBAPS dataset)** 119 - 120 -* Stage 01 - [[example signal traces and metadata>>https://drive.ebrains.eu/smart-link/cf2fa914-260d-4d61-a2da-03ea07b7f9be/]] 121 -* Stage 02 - [[background substraction>>https://drive.ebrains.eu/smart-link/586d2f3c-591b-4dfb-94ee-8c0e28050dc4/]] 122 -* Stage 02 - [[logMUA estimation>>https://drive.ebrains.eu/smart-link/c92e4b0c-0938-44e8-9f8d-00522796b2fd/]] 123 -* Stage 02 - [[processed signal trace>>https://drive.ebrains.eu/smart-link/26ed27c6-de56-4b48-a57b-f70aab629197/]] 124 -* Stage 03 - [[amplitude distribution>>https://drive.ebrains.eu/smart-link/8ba80293-ba75-4a37-8a8f-05d44cf6f65c/]] 125 -* Stage 03 - [[UP state detection>>https://drive.ebrains.eu/smart-link/ab172be0-178e-4153-a3e6-b4bace32dd50/]] 126 -* Stage 04 - [[trigger clustering>>https://drive.ebrains.eu/smart-link/4a1f0169-8b43-49ce-80c8-f2fa0f4d50d3/]] 127 -* Stage 05 - [[planar velocities>>https://drive.ebrains.eu/smart-link/f4de8073-cb40-47a7-bc82-f97d36dbae25/]] 128 -* Stage 05 - [[directionality>>https://drive.ebrains.eu/smart-link/5485032d-0121-4cde-9ea2-3e0af3f12178/]] 129 - 130 -=== Outlook === 131 - 132 -* Using the **KnowledgeGraph API **to insert data directly from the Knowledge Graph into the pipeline and also register and store the corresponding results as Analysis Objects. Such Analysis Objects are to incorporate **Provenance Tracking, **using [[fairgraph>>https://github.com/HumanBrainProject/fairgraph]],** **to record the details of the processing and analysis steps. 133 -* Adding support for the pipeline to make use of **HPC** resources when running on the collab. 134 -* Further extending the available **methods** to address a wider variety of analysis objectives and support the processing of other datatypes. Additional documentation and guides should also make it easier for non-developers to contribute new method blocks. 135 -* Extending the **application** of the pipeline to the analysis of other types of activity waves and oscillations. 136 -* Integrating and co-developing new features of the underlying **software tools **[[Elephant>>https://elephant.readthedocs.io/en/latest/]], [[Neo>>https://neo.readthedocs.io/en/stable/]], [[Nix>>https://github.com/G-Node/nix]], [[Snakemake>>https://snakemake.readthedocs.io/en/stable/]]. 137 - 138 138 == References == 139 139 140 -* [[Celotto, Marco, et al. "Analysis and Model of Cortical Slow Waves Acquired with Optical Techniques." //Methods and Protocols// 3.1 (2020): 14.>>https://doi.org/10.3390/mps3010014]] 141 -* [[De Bonis, Giulia, et al. "Analysis pipeline for extracting features of cortical slow oscillations." //Frontiers in Systems Neuroscience// 13 (2019): 70.>>https://doi.org/10.3389/fnsys.2019.00070]] 142 -* [[Resta, F., Allegra Mascaro, A. L., & Pavone, F. (2020). "Study of Slow Waves (SWs) propagation through wide-field calcium imaging of the right cortical hemisphere of GCaMP6f mice" //EBRAINS//>>https://doi.org/10.25493/3E6Y-E8G]]// // 143 -* [[Sanchez-Vives, M. (2020). "Propagation modes of slow waves in mouse cortex". //EBRAINS//>>https://doi.org/10.25493/WKA8-Q4T]] 144 -* [[Sanchez-Vives, M. (2019). "Cortical activity features in transgenic mouse models of cognitive deficits (Fragile X Syndrome).//" EBRAINS//>>https://doi.org/10.25493/ANF9-EG3]] 145 145 146 - Codedevelopedat:108 +== License (to discuss) == 147 147 148 - [[image:https://github.githubassets.com/images/modules/logos_page/GitHub-Mark.png||height="35"width="35"]][[INM-6/wavescalephant>>https://github.com/INM-6/wavescalephant]]110 +All text and example data in this collab is licensed under Creative Commons CC-BY 4.0 license. Software code is licensed under a modified BSD license. 149 149 150 -== License == 151 - 152 -Text is licensed under the Creative Commons CC-BY 4.0 license. LENS data is licensed under the Creative Commons CC-BY-NC-ND 4.0 license. IDIBAPS data is licensed under the Creative Commons CC-BY-NC-SA 4.0 license. Software code is licensed under GNU General Public License v3.0. 153 - 154 154 [[image:https://i.creativecommons.org/l/by/4.0/88x31.png||style="float:left"]] 155 155 156 - [[image:https://licensebuttons.net/l/by-nc-sa/4.0/88x31.png||alt="https://i.creativecommons.org/l/by/4.0/88x31.png"style="float:left"]]114 +== == 157 157 158 -[[image:https://licensebuttons.net/l/by-nc-nd/4.0/88x31.png||alt="https://i.creativecommons.org/l/by/4.0/88x31.png" style="float:left"]] 159 - 160 - 161 161 == Acknowledgments == 162 162 163 163 This open source software code was developed in part or in whole in the Human Brain Project, funded from the European Union’s Horizon 2020 Framework Programme for Research and Innovation under the Specific Grant Agreement No. 785907 (Human Brain Project SGA2). ... ... @@ -167,12 +167,12 @@ 167 167 ))) 168 168 169 169 170 -== == 125 +== Executing the pipeline == 171 171 172 172 (% class="col-xs-12 col-sm-4" %) 173 173 ((( 174 174 {{box title="**Contents**"}} 175 -{{toc depth="3"/}}130 +{{toc/}} 176 176 {{/box}} 177 177 178 178 
 
- Collaboratory.Apps.Collab.Code.CollabClass[0]
-   - Description
-   ... ... @@ -1,1 +1,1 @@ 1 - Use CaseSGA2-SP3-002KR3.2: Integratingmulti-scaledata and the outputofsimulationsin areproducible and adaptablepipeline1 +Space for developing and hosting a showcase pipeline for performing reproducible and adaptable analysis with the focus of slow cortical waves.