Attention: The EBRAINS drive will be unavailable for most of the weekend starting the 25th October. Although the Lab is availble while the Drive is down, files that are stored in the Drive will not be loaded and you will be unable to save documents directly on the Lab.


Last modified by robing on 2022/03/25 09:55

From version 14.1
edited by robing
on 2020/01/20 13:41
Change comment: There is no comment for this version
To version 50.1
edited by debonisg
on 2020/04/24 12:58
Change comment: There is no comment for this version

Summary

Details

Page properties
Title
... ... @@ -1,1 +1,1 @@
1 -Slow Wave Analysis Pipeline
1 +SGA2 SP3 UC002 KR3.2 - Slow Wave Analysis Pipeline
Author
... ... @@ -1,1 +1,1 @@
1 -XWiki.robing
1 +XWiki.debonisg
Content
... ... @@ -2,9 +2,34 @@
2 2  (((
3 3  (% class="container" %)
4 4  (((
5 -= (% style="color:inherit" %)Slow Wave Analysis Pipeline(%%) =
5 += (% style="--darkreader-inline-color:inherit; color:inherit" %)Slow Wave Analysis Pipeline (SWAP)(%%) =
6 6  
7 -= (% style="color:inherit; font-size:24px" %)Integrating multiscale data in a reproducible and adaptable pipeline(%%) =
7 +(% class="wikigeneratedid" id="HUseCaseSGA2-SP3-002:IntegratingmultiscaledataA0inareproducibleandadaptablepipeline" %)
8 +(% style="--darkreader-inline-color:inherit; color:inherit; font-size:24px" %)**Use Case SGA2-SP3-002 KR3.2: Integrating multi-scale data and the output of simulations in a reproducible and adaptable pipeline**
9 +
10 +Robin Gutzen^^1^^, Giulia De Bonis^^2^^, Elena Pastorelli^^2,3^^, Cristiano Capone^^2^^,
11 +
12 +Chiara De Luca^^2,3^^, Michael Denker^^1^^, Sonja Grün^^1^^,
13 +
14 +Pier Stanislao Paolucci^^2^^, Andrew Davison^^4^^
15 +
16 +Experiments: Anna Letizia Allegra Mascaro^^5,6^^, Francesco Resta^^5^^, Francesco Saverio Pavone^^5^^, Maria-Victoria Sanchez-Vives^^7,8^^
17 +
18 +,,1) Institute of Neuroscience and Medicine (INM-6) and Institute for Advanced Simulation (IAS-6) and JARA-Institute Brain Structure-Function Relationships (INM-10), Jülich Research Centre, Jülich, Germany,,
19 +
20 +,,2) Istituto Nazionale di Fisica Nucleare (INFN), Sezione di Roma, Rome, Italy,,
21 +
22 +,,3) Ph.D. Program in Behavioural Neuroscience, “Sapienza” University of Rome, Rome, Italy,,
23 +
24 +,,4) Unité de Neurosciences, Information et Complexité, Neuroinformatics Group, CNRS FRE 3693, Gif-sur-Yvette, France,,
25 +
26 +,,5) European Laboratory for Non-linear Spectroscopy (LENS), (% style="color:inherit" %)University of Florence, Florence, Italy(%%),,
27 +
28 +,,6) Istituto di Neuroscienze, CNR, Pisa, Italy,,
29 +
30 +,,7) Institut d’Investigacions Biomèdiques August Pi i Sunyer (IDIBAPS), Barcelona, Spain,,
31 +
32 +,,8) Institució Catalana de Recerca i Estudis Avanc ̨ats (ICREA), Barcelona, Spain,,
8 8  )))
9 9  )))
10 10  
... ... @@ -12,77 +12,107 @@
12 12  (((
13 13  (% class="col-xs-12 col-sm-8" %)
14 14  (((
15 -= What can I find here? =
40 +== Flexible workflows to generate multi-scale analysis scenarios ==
16 16  
17 -...
42 +This Collab is aimed at experimental and computational neuroscientists interested in the usage of the [[Neo>>https://neo.readthedocs.io/en/stable/]] and [[Elephant>>https://elephant.readthedocs.io/en/latest/]] tools in performing data analysis of spiking data.
43 +Here, the collab illustrates the tool usage with regards to KR3.2, investigating sleep, anesthesia, and the transition to wakefulness.
18 18  
19 -= Who has access? =
45 +== How the Pipeline works ==
20 20  
21 -Describe the audience of this collab.
47 +The design of the pipeline aims at interfacing a variety of general and specific analysis and processing steps in a flexible modular manner. Hence, it enables the pipeline to adapt to diverse types of data (e.g., electrical ECoG, or optical Calcium Imaging recordings) and to different analysis questions. This makes the analyses a) more reproducible and b) comparable amongst each other since they rely on the same stack of algorithms and any differences in the analysis are fully transparent.
48 +The individual processing and analysis steps (**blocks**//, //see// //the arrow-connected elements below) are organized in sequential **stages**// (//see the columns below//). //Following along the stages the analysis becomes more specific but also allows to branch off at after any stage as each stage yields useful intermediate results is autonomous so that it can be reused and recombined. Within each stage, there is a collection of blocks from which the user can select and arrange the analysis via a config file. Thus, the pipeline can be thought of as a curated database of methods on which an analysis can be constructed by drawing a path along the blocks and stages.
22 22  
50 +(% class="wikigeneratedid" id="H" %)
51 +[[image:pipeline_flowchart.png]]
52 +
23 23  == Executing the pipeline ==
24 24  
25 -[[image:pipeline_flowchart.png]]
55 +There are two ways of getting started and testing the pipeline, i) online using the collab drive and jupyter hub, or ii) downloading the code and data from GitHub and the collab storage and running it locally.
26 26  
27 -=== in the collab (beta) ===
57 +=== i) In the collab ===
28 28  
29 -* **Edit the config files**
30 -Each stage has a config file to specify which analysis/processing blocks to execute and which parameters to use. General and specific information about the blocks and parameters can found in the README and config files. The default values are set for an example dataset (ECoG, anesthetized mouse, IDIBAPS [ref]).
31 -** stage01_data_entry: [README.md](), [config.yaml]()
32 -** stage02_preprocessing: [README.md](), [config.yaml]()
33 -** stage03_trigger_detection: [README.md](), [config.yaml]()
34 -** stage04_wavefront_detection: [README.md](), [config.yaml]()
35 -** stage05_wave_characterization: [README.md](), [config.yaml]()
59 +* (((
60 +**Copy the collab drive to your personal drive space**
61 +
62 +* Open the Drive from the left menu
63 +* Select the folders //pipeline// and //datasets,//
64 +and the notebook// run_snakemake_in_collab.ipynb//
65 +* Select 'Copy', and then 'My Library' from the dropdown 'Other Libraries'
36 36  
37 -* **Start a Jupyter Hub instance**
38 -[[jupyterhub-preview.apps-dev.hbp.eu>>jupyterhub-preview.apps-dev.hbp.eu]]
67 +)))
68 +* **Start a Jupyter Hub instance **
69 +In another browser tab, open [[https:~~/~~/lab.ebrains.eu>>https://lab.ebrains.eu]]
39 39  
40 -* **Follow the notebook**
41 -In the jupyter hub, navigate to `drive/Shared with groups/Slow Wave Analysis Pipeline/pipeline/showcase_notebooks/run_snakemake_in_collab.ipynb`.
71 +* **Edit the config files**
72 +Each stage has a config file (//pipeline/<stage_name>/config.yaml//) to specify which analysis/processing blocks to execute and which parameters to use. General and specific information about the blocks and parameters can found in the README and config files of each stage. The default values are set for an example dataset (ECoG, anesthetized mouse, [[IDIBAPS>>https://kg.ebrains.eu/search/?facet_type[0]=Dataset&q=sanchez-vives#Dataset/2ead029b-bba5-4611-b957-bb6feb631396]]]).
73 +
74 +* **Run the notebook**
75 +In the jupyter hub, navigate to //drive/My Libraries/My Library/pipeline/showcase_notebooks/run_snakemake_in_collab.ipynb//, or where you copied the //pipeline// folder to.
42 42  Follow the notebook to install the required packages into your Python kernel, set the output path, and execute the pipeline with snakemake.
77 +
43 43  * **Coming soon**
44 44  ** Use of KnowledgeGraph API
45 45  ** Provenance Tracking
46 46  ** HPC support
47 47  
48 -=== locally ===
83 +=== ii) Local execution ===
49 49  
50 50  * **Get the code**
51 -The source code of the pipeline is available via Github: [INM-6/wavescalephant]('https:~/~/github.com/INM-6/wavescalephant') and can be cloned to your machine ([how to use Github]()).
86 +The source code of the pipeline is available via Github: [[INM-6/wavescalephant>>https://github.com/INM-6/wavescalephant]] and can be cloned to your machine ([[how to Github>>https://guides.github.com/activities/hello-world/]]).
52 52  
53 -* **Build the Python environment**
54 -In the wavescalephant repository, there is an environment file (`pipeline/envs/wavescalephant_env.yaml`) specifying the required packages and versions. To build the environment, we recommend using *conda* ([how to get started with conda]()).
55 -`conda env create ~-~-file /envs/wavescalephant_env.yaml`.
88 +* (((
89 +**Build the Python environment**
90 +In the wavescalephant git repository, there is an environment file ([[pipeline/envs/wavescalephant_env.yaml>>https://drive.ebrains.eu/f/efe2ecf0874d4402bb11/]]) specifying the required packages and versions. To build the environment, we recommend using conda ([[how to get started with conda>>https://docs.conda.io/projects/conda/en/latest/user-guide/getting-started.html]]).
91 +##conda env create ~-~-file /envs/wavescalephant_env.yml##
56 56  
93 +)))
57 57  * **Edit the settings**
58 -The settings file specifies the path to the output folder, where results are saved to. Open the template file `pipeline/settings_template.py`, set the `output_path` to the desired path, and save it as `pipeline/settings.py`.
95 +The settings file specifies the path to the output folder, where results are saved to. Open the template file //[[pipeline/settings_template.py>>https://drive.ebrains.eu/f/b6dbd9f15e4f4d97af17/]]//, set the ##output_path## to the desired path, and save it as //pipeline/settings.py//.
59 59  
60 60  * **Edit the config files**
61 -Each stage has a config file to specify which analysis/processing blocks to execute and which parameters to use. Edit the config template files `pipeline/stageXX_<stage_name>/config_template.yaml` according to your dataset and analysis goal, and save them as `pipeline/stageXX_<stage_name>/config.yaml`. A detailed description of the available parameter settings and their meaning is commented in the template files, and a more general description of the working mechanism of each stage can be found in the respective README file `pipeline/stageXX_<stage_name>/README.md`.
98 +Each stage has a config file to specify which analysis/processing blocks to execute and which parameters to use. Edit the config template files //pipeline/stageXX_<stage_name>/config_template.yaml// according to your dataset and analysis goal, and save them as //pipeline/stageXX_<stage_name>/config.yaml//. A detailed description of the available parameter settings and their meaning is commented in the template files, and a more general description of the working mechanism of each stage can be found in the respective README file //pipeline/stageXX_<stage_name>/README.md//.
99 +//Links are view-only//
100 +** full pipeline:[[ README.md>>https://drive.ebrains.eu/f/ec474df6919a4089832e/]], config.yaml
101 +** stage01_data_entry: [[README.md>>https://drive.ebrains.eu/f/b46ffe259b3a4a51a277/]], [[config.yaml>>https://drive.ebrains.eu/f/8de751f48d7d47edaec1/]]
102 +** stage02_processing: [[README.md>>https://drive.ebrains.eu/f/7f19d89913624425bf63/]], [[config.yaml>>https://drive.ebrains.eu/f/b1607671f6f2468aa43c/]]
103 +** stage03_trigger_detection: [[README.md>>https://drive.ebrains.eu/f/94d12860dde84bbab7b1/]], [[config.yaml>>https://drive.ebrains.eu/f/6dfb712d5fa24f4f9fcf/]]
104 +** stage04_wavefront_detection: [[README.md>>https://drive.ebrains.eu/d/9c53abd5eaf543b28615/]], [[config.yaml>>https://drive.ebrains.eu/f/9534e46c4fae41c78f17/]]
105 +** stage05_wave_characterization: [[README.md>>https://drive.ebrains.eu/f/4d79f3e314474c22a781/]], [[config.yaml>>https://drive.ebrains.eu/f/1689dda03be04251b85f/]]
62 62  
63 63  * **Enter a dataset**
64 -see `pipeline/stage01_data_entry/README.md`
108 +There are two test datasets in the collab drive (IDIBAPS and LENS) for which there are also corresponding config files and scripts in the data_entry stage. So, these datasets are ready to be used and analyzed.
109 +For adding new datasets see //[[pipeline/stage01_data_entry/README.md>>https://drive.ebrains.eu/f/b46ffe259b3a4a51a277/]]//
65 65  
66 66  * **Run the pipeline (-stages)**
67 -To run the pipeline with snakemake ([intro to snakemake]()) activate the Python environment `conda activate wavescalephant_env`, make sure you are in the working directory `pipeline/`, and call `snakemake` to run the entire pipeline.
68 -To (re-)execute an individual stage, you can navigate to the corresponding stage folder and call the `snakemake` command there. For running an individual stage, you may need to manually set the path for input file for the stage (i.e. the output file of the previous stage) in the config file `INPUT: /path/to/file`.
112 +To run the pipeline with snakemake ([intro to snakemake]()) activate the Python environment ##conda activate wavescalephant_env,## make sure you are in the working directory `pipeline/`, and call ##snakemake## to run the entire pipeline.
113 +To (re-)execute an individual stage, you can navigate to the corresponding stage folder and call the ##snakemake## command there. For running an individual stage, you may need to manually set the path for input file for the stage (i.e. the output file of the previous stage) in the config file ##INPUT: /path/to/file##.
69 69  
70 70  == Accessing and using the results ==
71 71  
72 -All results are stored in the path specified in the `settings.py` file. The folder structure reflects the structuring of the pipeline into stages and blocks. All intermediate results are stored as `.nix` files using the Neo data format ([Neo]()) and can be loaded with `neo.NixIO('/path/to/file.nix').read_block()` ([documentation]()).
73 -Additionally, most blocks produce a figure, and each stage a report file, to give an overview of the execution log, parameters, intermediate results, and to help with debugging.
74 -The final stage (*stage05_wave_characterization*) stores the results as pandas.DataFrames ([pandas]()) in `.csv` files, separately for each measure as well as in a combined dataframe for all measures.
117 +All results are stored in the path specified in the //settings.py// file. The folder structure reflects the structuring of the pipeline into stages and blocks. All intermediate results are stored as //.nix// files using the [[Neo data format>>https://neo.readthedocs.io/en/stable/]] and can be loaded with ##neo.NixIO('/path/to/file.nix').read_block()##. Additionally, most blocks produce a figure, and each stage a report file, to give an overview of the execution log, parameters, intermediate results, and to help with debugging. The final stage (//stage05_wave_characterization//) stores the results as[[ //pandas.DataFrames//>>https://pandas.pydata.org/]] in //.csv// files, separately for each measure as well as in a combined dataframe for all measures.
75 75  
76 76  == References ==
77 77  
78 78  
122 +== License (to discuss) ==
123 +
124 +All text and example data in this collab is licensed under Creative Commons CC-BY 4.0 license. Software code is licensed under a modified BSD license.
125 +
126 +[[image:https://i.creativecommons.org/l/by/4.0/88x31.png||style="float:left"]]
127 +
128 +== ==
129 +
79 79  == Acknowledgments ==
80 80  
81 -This open source software code was developed in part or in whole in the Human Brain Project, funded from the European Union’s Horizon 2020 Framework Programme for Research and Innovation
82 -under the Specific Grant Agreement No. 785907 (Human Brain Project SGA2).
132 +This open source software code was developed in part or in whole in the Human Brain Project, funded from the European Union’s Horizon 2020 Framework Programme for Research and Innovation under the Specific Grant Agreement No. 785907 (Human Brain Project SGA2).
133 +
134 +
135 +[[image:logos_sga2_sp3_uc002.png||alt="Logos SP3 Use Case 2"]]
83 83  )))
84 84  
85 85  
139 +== Executing the pipeline ==
140 +
86 86  (% class="col-xs-12 col-sm-4" %)
87 87  (((
88 88  {{box title="**Contents**"}}
logos_sga2_sp3_uc002.png
Author
... ... @@ -1,0 +1,1 @@
1 +XWiki.denker
Size
... ... @@ -1,0 +1,1 @@
1 +1.1 MB
Content
Collaboratory.Apps.Collab.Code.CollabClass[0]
Description
... ... @@ -1,1 +1,1 @@
1 -Space for developing and hosting a showcase pipeline for performing reproducible and adaptable analysis steps.
1 +Space for developing and hosting a showcase pipeline for performing reproducible and adaptable analysis with the focus of slow cortical waves.