Attention: The EBRAINS drive will be unavailable for most of the weekend starting the 25th October. Although the Lab is availble while the Drive is down, files that are stored in the Drive will not be loaded and you will be unable to save documents directly on the Lab.


Changes for page data-curation-copy

Last modified by eapapp on 2023/07/04 16:46

From version 31.1
edited by ingrreit
on 2023/03/26 07:15
Change comment: There is no comment for this version
To version 118.1
edited by spieschnik
on 2023/05/08 13:19
Change comment: There is no comment for this version

Summary

Details

Page properties
Author
... ... @@ -1,1 +1,1 @@
1 -XWiki.ingrreit
1 +XWiki.spieschnik
Content
... ... @@ -1,4 +1,4 @@
1 -== Publishing data, models and software via EBRAINS ==
1 +== Publishing neuroscience data, models and software via EBRAINS ==
2 2  
3 3  The aim of this collab is to provide you with all the information you need to publish your experimental data, simulations, computational models, and software via EBRAINS. Have you already published your data somewhere else? You can increase the exposure and impact of your shared dataset by also listing it on EBRAINS.
4 4  
... ... @@ -17,14 +17,15 @@
17 17   Search existing data, models and software in [[the EBRAINS Knowledge Graph Search>>https://kg.ebrains.eu/search/?facet_type[0]=Dataset]]
18 18  
19 19  
20 -(% style="color:#e74c3c" %)[EDIT required]
21 -
22 -(% style="color:#e74c3c" %)Sharing your data, models or code (research products) via EBRAINS makes it discoverable amongst other research products available in the EBRAINS Knowledge Graph>>(%%)[[(% style="color:#e74c3c" %)https:~~/~~/kg.ebrains.eu/>>https://kg.ebrains.eu/]]]]. This is made possible by the highly flexible metadata framework describing neuroscience data in detail. EBRAINS is gradually implementing interconnected tools and analysis workflows developed in the Human Brain Project (HBP) to further enhance the output from adding your dataset to the database.
23 -
24 24  ----
25 25  
26 26  == **The EBRAINS curation process** ==
27 27  
24 +(% class="box successmessage" %)
25 +(((
26 +EBRAINS accepts **experimental data,** of all modalities and from all species, **models**, **software**, **web services **and **metadata models**. You'll find detailed information about each research product below.
27 +)))
28 +
28 28  In EBRAINS, multimodal and heterogenous neuroscience data, models and software are categorised and described in a standardised manner so that they can be effectively searched, compared, and analysed. This effort is referred to as curation. 
29 29  
30 30  >The EBRAINS curation process involves organising and annotating neuroscientific data to make the data discoverable and reusable.
... ... @@ -31,14 +31,20 @@
31 31  
32 32  Behind this process is the EBRAINS Curation team. Our mandate is to support you in sharing your data in line with the [[**FAIR principles**>>https://www.go-fair.org/fair-principles/]], whether you choose to describe only the key aspects of your data, or can invest in adding more detailed metadata.
33 33  
34 -(% class="box floatinginfobox" %)
35 +Curated data, models and software are made available in the [[the EBRAINS Knowledge Graph>>https://kg.ebrains.eu/]]. This makes the data and metadata discoverable in the [[Knowledge Graph Search>>url:https://search.kg.ebrains.eu/]] and programmatically via the [[Knowledge Graph API>>url:https://docs.kg.ebrains.eu/8387ccd27a186dea3dd0b949dc528842/api_endpoints.html]]. The data, models and software are integrated in the EBRAINS Knowledge Graph by interoperable metadata schemas as defined in [[openMINDS>>url:https://github.com/HumanBrainProject/openMINDS/wiki]].Data and models are linked to and discoverable via the species-specific [[EBRAINS siibra atlas viewer>>url:https://ebrains.eu/services/atlases/brain-atlases]] by using interoperable metadata schemas as defined in [[SANDS>>url:https://github.com/HumanBrainProject/SANDS/wiki]].
36 +
37 +
38 +(% class="box infomessage" %)
35 35  (((
36 -We strongly recommend to start preparing for data sharing as early as possible. With a structured data repository and adequate notes on how the data was acquired, you greatly minimize the effort required to publish your data. The time it takes to share data on EBRAINS heavily depends on on the engagement from the researcher and how well the data and metadata is prepared before-hand. **[[Contact us to prepare for sharing>>mailto:curation-support@ebrains.eu]]. **
40 +**We strongly recommend to start preparing for data sharing as early as possible.** With a structured data repository and adequate notes on how the data was acquired, you greatly minimize the effort required to publish your data. The time it takes to share data on EBRAINS heavily depends on on the engagement from the researcher and how well the data and metadata is prepared before-hand. **[[Contact us for personalised guidance on how to prepare for sharing>>mailto:curation-support@ebrains.eu]]. **
37 37  )))
38 38  
39 -=== ===
43 +(% class="box successmessage" %)
44 +(((
45 +**Particular needs? Contact us! **The workflows for sharing can be modified for researchers or research groups aiming to frequently publish larger numbers of their research products through EBRAINS. Please contact the curation service team in such cases.
46 +)))
40 40  
41 -=== ===
48 +----
42 42  
43 43  === Step by step - Experimental data ===
44 44  
... ... @@ -45,6 +45,8 @@
45 45  
46 46  [[image:image-20230326054341-1.png]]
47 47  
55 +==== ====
56 +
48 48  ==== **1. Provide some general information about your dataset** ====
49 49  
50 50  The [[Curation request form>>https://nettskjema.no/a/277393#/]] collects preliminary information about your data, allowing us to assess whether the dataset fits within the scope of EBRAINS. The submission generates a curation ID allowing us to track the case.
... ... @@ -56,9 +56,9 @@
56 56  
57 57  EBRAINS offers secure, long-term storage at [[CSCS Swiss National Supercomputing Centre>>url:https://www.cscs.ch/]], with currently no upper limit of storage capacity. The data must be consistently structured prior to upload. 
58 58  
59 -For smaller datasets with a reasonable amount of files, we recommend using the **Collab-Bucket solution (drag-and-drop)**. A Collab Bucket must first be assigned to a dataset, which happens when a datasets is accepted for sharing.
68 +For smaller datasets with a reasonable amount of files, we recommend using the Collab-Bucket solution (drag-and-drop). A Collab Bucket must first be assigned to a dataset, which happens when a datasets is accepted for sharing.
60 60  
61 -For larger datasets or datasets with a large amount of files, we recommend using a **programmatic approach**. The [[python script>>https://github.com/eapapp/ebrains-data-storage/tree/main/data-proxy]] is interactive and does not require any additional programming.
70 +For larger datasets or datasets with a large amount of files, we recommend using a programmatic approach. The [[python script>>https://github.com/eapapp/ebrains-data-storage/tree/main/data-proxy]] is interactive and does not require any additional programming.
62 62  
63 63  
64 64  If a data collection is already uploaded elsewhere, we may link to the already existing repository.
... ... @@ -68,7 +68,7 @@
68 68  
69 69  Easily submit openMINDS-compatible metadata via our [[metadata wizard>>https://ebrains-metadata-wizard.apps.hbp.eu/]]. This form covers all the required metadata for sharing data via EBRAINS. When you're ready to 'Submit', the metadata and all uploaded files will be sent to the Curation team.
70 70  
71 -For power-users interested in exploring the full span of the openMINDS framework, please check out the [[openMINDS GitHub>>https://github.com/HumanBrainProject/openMINDS]] to learn more about how to programmatically gather your metadata. A stable version of the openMINDS package can be found on [[PyPi>>https://pypi.org/project/openMINDS/]]. We accept openMINDS metadata as JSON-LD (share these with us via curation-support@ebrains.eu). Additional documentation of openMINDS metadata submodules and schemas can be found on [[the openMINDS GitHub Wiki>>https://humanbrainproject.github.io/openMINDS/]].
80 +For power-users interested in exploring the full span of the openMINDS framework, please check out the [[openMINDS GitHub>>https://github.com/HumanBrainProject/openMINDS]] to learn more about how to programmatically gather your metadata. A stable version of the openMINDS package can be found on [[PyPi>>https://pypi.org/project/openMINDS/]]. We accept openMINDS metadata as JSON-LD (share these with us via curation-support@ebrains.eu). Additional documentation of openMINDS metadata submodules and schemas can be found on [[the openMINDS GitHub Wiki>>https://humanbrainproject.github.io/openMINDS/]]. We have prepared [[a list of the metadata properties that are required>>https://drive.ebrains.eu/lib/47995dbc-f576-4008-a76c-eefbfd818529/file/ebrains-minimum-required-metadata.xlsx]] for publishing data on EBRAINS.
72 72  
73 73  
74 74  ==== **4. Write a Data Descriptor ** ====
... ... @@ -109,8 +109,7 @@
109 109  
110 110  **Data users** must request access to the data (via their EBRAINS account) and will receive access provided they actively accept the [[EBRAINS Access Policy>>https://ebrains.eu/terms#access-policy]], the [[EBRAINS General Terms of Use>>https://ebrains.eu/terms#general-terms-of-use]], and the [[EBRAINS Data Use Agreement>>https://ebrains.eu/terms#data-use-agreement]]. The account holder also have to accept that information about their request and access to specific data under HDG is being tracked and stored.
111 111  \\**Data owners** must be aware that sharing under the HDG affects the legal responsibilities for the data. They must agree to joint control of the data (see the [[Data Provision Protocol v1>>url:https://strapi-prod.sos-ch-dk-2.exo.io/EBRAINS_Data_Provision_Protocol_dfe0dcb104.pdf]], section 1.4 - 1.5) and the Data Protection Officers of the responsible institutions must have accepted that the data can be shared under HDG.
112 -\\**Human Data Gateway, Background**
113 -HDG was introduced in February 2021 and developed across multiple teams in the HBP. The initiative to create the service and the initial design originated from EBRAINS Curation in close collaboration with the Data compliance team and the HBP Data Governance Working Group. HDG is a response to the needs of multiple data providers who are bringing data of human origin to EBRAINS. HDG covers the sharing of a limited range of data of human origin, i.e., data without direct identifiers and with very few indirect identifiers (strongly pseudonymized, de-identified). It is an extension of the existing services and does not replace the future EBRAINS Service for sensitive data (planned for 2024) which is outside the domain of the current EBRAINS Data and Knowledge services.
121 +\\The Human Data Gateway (HDG) was introduced in February 2021 and developed across multiple teams in the HBP. The initiative to create the service and the initial design originated from EBRAINS Curation in close collaboration with the Data compliance team and the HBP Data Governance Working Group. HDG is a response to the needs of multiple data providers who are bringing data of human origin to EBRAINS. HDG covers the sharing of a limited range of data of human origin, i.e., data without direct identifiers and with very few indirect identifiers (strongly pseudonymized, de-identified). It is an extension of the existing services and does not replace the future EBRAINS Service for sensitive data (planned for 2024) which is outside the domain of the current EBRAINS Data and Knowledge services.
114 114  
115 115  
116 116  ----
... ... @@ -146,68 +146,126 @@
146 146  
147 147  ----
148 148  
149 -=== Output / result, when you've completed the curation process, what do you get ===
157 +=== Webservices and metadata models ===
150 150  
151 -Curated data, models and software are made available in the [[the EBRAINS Knowledge Graph>>https://kg.ebrains.eu/]]. This makes the data and metadata discoverable in the [[Knowledge Graph Search>>url:https://search.kg.ebrains.eu/]] and programmatically via the [[Knowledge Graph API>>url:https://docs.kg.ebrains.eu/8387ccd27a186dea3dd0b949dc528842/api_endpoints.html]]. The data, models and software are integrated in the EBRAINS Knowledge Graph by interoperable metadata schemas as defined in [[openMINDS>>url:https://github.com/HumanBrainProject/openMINDS/wiki]].
159 +(% class="wikigeneratedid" id="HContact...." %)
160 +(% style="color:#e74c3c; font-size:16px" %)Contact....(% style="color:#4a5568; font-size:16px" %)
152 152  
153 -Data and models are linked to and discoverable via the species-specific [[EBRAINS Interactive Atlas Viewer>>url:https://ebrains.eu/services/atlases/brain-atlases]] by using interoperable metadata schemas as defined in [[SANDS>>url:https://github.com/HumanBrainProject/SANDS/wiki]].
162 +----
154 154  
164 +== **The curation team: meet the curators** ==
165 +
166 +**Located in Norway:**
167 +
168 +
169 +|(% style="width:303px" %)(((
170 +[[image:My project2.jpg||height="209" width="167"]]
171 +
172 +**Archana Golla**
173 +
174 +(% class="small" %)Curation Scientist
175 +Neuroscience (PhD)(%%)
176 +Behavioral neuroscience and microscopy
177 +)))|(% style="width:303px" %)(((
178 +[[image:My project1.jpg||height="209" width="167"]]
179 +
180 +**Sophia Pieschnik**
181 +
182 +(% class="small" %)Curation Scientist
183 +Neurocognitive Psychology (M. Sc.)(%%)
184 +Neuroimaging
185 +)))|(% style="width:303px" %)(((
186 +[[image:My project (1).jpg||height="209" width="167"]]
187 +
188 +**Ingrid Reiten**
189 +
190 +(% class="small" %)Curation Scientist,
191 +Phd Student
192 +Neuroscience (M. Sc.)(%%)
193 +Neuroanatomy and structural connectivity
194 +)))|(% style="width:303px" %)(((
195 +[[image:My project.jpg||height="209" width="167"]]
196 +
197 +**Camilla H. Blixhavn**
198 +
199 +(% class="small" %)Curation Scientist,
200 +Phd Student
201 +Neuroscience (M. Sc.)(%%)
202 +Neuroanatomy and data integration
203 +)))
204 +
205 +
206 +
207 +
208 +**Located in Germany:**
209 +
210 +[[image:My project (2).jpg||height="209" width="167"]]
211 +
212 +**Jan Gündling**
213 +
214 +(% class="small" %)Curation Scientist
215 +Sensors and Cognitive Psychology (M. Sc.)(%%)
216 +Human-Computer Interaction
217 +
155 155  ----
156 156  
157 -== **Resources for researchers looking to share data** ==
220 +== **Information and resources for researchers looking to share data** ==
158 158  
159 159  Below you can find some resources that can come in handy if you are looking to share data via EBRAINS, or in general.
160 160  
161 161  ----
162 162  
163 -=== **Why should I share data?** ===
226 +=== **Improve your research product ** ===
164 164  
165 -By sharing your data via EBRAINS, you gain access to the following benefits:
228 +==== **Add a tutorial or learning resource to your research product ** ====
166 166  
167 -[[image:image-20230324170841-3.png]]
230 +==== (% style="color:#e74c3c" %)- Learning resource [information](%%) ====
168 168  
169 169  
233 +==== **Create a workflow for .... ? ** ====
170 170  
171 -We support you to better follow the FAIR^^ ^^guiding principles for data management and stewardship{{footnote}}Wilkinson, M., Dumontier, M., Aalbersberg, I. et al. The FAIR Guiding Principles for scientific data management and stewardship. Sci Data 3, 160018 (2016). https://doi.org/10.1038/sdata.2016.18 {{/footnote}}.  Publishing data, models or code via EBRAINS will provide you with a citeable [[DataCite DOI>>https://www.doi.org/the-identifier/resources/handbook/]] for your research product.
235 +==== (% style="color:#e74c3c" %)- Workflows [information](%%) ====
172 172  
173 -
174 174  ----
175 175  
176 -(% style="font-family:inherit" %) (% style="color:#1a202c; font-family:inherit; font-size:26px" %)**What can I share on EBRAINS? **
239 +=== **Integrate your data, models or software with other services** ===
177 177  
178 -(% class="wikigeneratedid" id="H" %)
179 -[[image:image-20230324170829-2.png]]
241 +EBRAINS supports further integration for a variety of data, and is continuously looking to increase the number of interoperable services.
180 180  
243 +* Integrate image data with //the Mio viewer//: EBRAINS Multi-Image OpenSeadragon viewer provides an intuitive way of navigating high-resolution 2D image series. It has browser-based classic pan and zoom capabilities. A collection can be displayed as a filmstrip (Filmstrip Mode) or as a table (Collection Mode) with adjustable number of row and columns. See [[Mio viewer links available for this dataset>>https://search.kg.ebrains.eu/?category=Dataset&q=nr2f1#9677359c-73fa-4425-b8fa-3de794e9017a]] as an example. MioViewer user manual is found [[here>>https://multi-image-osd.readthedocs.io/en/latest/index.html]].
244 +* Integrate atlas-registered 2D image data with //the LocaliZoom viewer//: The EBRAINS LocaliZoom serial section viewer displays series of registered 2D section images with atlas overlay, allowing the users to zoom into high-resolution images and have information about the brain regions. See the [[LocaliZoom links available for this dataset>>https://doi.org/10.25493/T686-7BX]] as an example. LocaliZoom user manual is found [[here>>https://localizoom.readthedocs.io/en/latest/index.html]].
245 +* Add your data, models or software to a// Live paper//: (% style="color:#e74c3c" %)[description]
246 +* Integrate your data to //the Siibra//-explorer: The siibra-explorer is used for visualizing volumetric brain data in all the brain atlases provided by EBRAINS (Human, Monkey, Rat and Mouse). The siibra-explorer viewer uses siibra-api to enable navigation of brain region hierarchies, maps in different coordinate spaces, and linked regional data features. Furthermore, it is connected with the siibra toolsuite providing several analytical workflows. To learn more about how to integrate your data to atlases, check out the [[Atlas services>>https://ebrains.eu/services/atlases#Integratedatatoanatlas]] on ebrains.eu.
181 181  
182 182  ----
183 183  
184 -=== **Useful information about sharing of experimental data on EBRAINS ** ===
250 +=== **The benefits of sharing data ** ===
185 185  
252 +(% style="color:#000000" %)Sharing your data, models or code (research products) via EBRAINS makes it discoverable amongst other research products available in the (%%)[[(% style="color:#000000" %)EBRAINS Knowledge Graph>>https://kg.ebrains.eu/]](%%). This is made possible by the highly flexible metadata framework describing neuroscience data in detail.
186 186  
187 -|(% style="width:593px" %)(((
188 -[[[[image:image-20230324171114-2.png]]>>https://drive.ebrains.eu/f/dfd374b9b43a458192e9/]]
189 -)))|(% style="width:1240px" %)(((
190 -[[[[image:image-20230324171109-1.png]]>>https://drive.ebrains.eu/f/c1ccb78be52e4bdba7cf/]]
191 -)))
192 -|(% style="width:593px" %)//Collection of useful information for researchers looking to share experimental data on EBRAINS.//|(% style="width:1240px" %)//The EBRAINS data descriptor//
254 +(% style="color:#000000" %)EBRAINS is gradually implementing interconnected tools and analysis workflows developed in the Human Brain Project (HBP) to further enhance the output from adding your dataset to the database.
193 193  
194 -----
195 195  
196 -=== **Introduction to data organisation ** ===
257 +By sharing your data via EBRAINS, you gain access to the following benefits:
197 197  
198 -Have you ever experienced not being able to find a file that you were sure you had somewhere? We have prepared a [[collection of guidelines>>https://drive.ebrains.eu/smart-link/25299f04-c4e5-4028-8f5f-3b8208f9a532/]] and [[advice>>https://drive.ebrains.eu/lib/f5cf4964-f095-49bd-8c34-e4ffda05a497/file/DataOrganisation.zip]] on how to organise files and folders to ensure consistency and reproducibility in the future.
259 +[[image:image-20230324170841-3.png]]
199 199  
200 -* Why is data organisation important?
201 -* How to organise my data repository?
202 -* What is a Data Descriptor and why do I need one?
203 203  
262 +
263 +We support you to better follow the FAIR^^ ^^guiding principles for data management and stewardship{{footnote}}Wilkinson, M., Dumontier, M., Aalbersberg, I. et al. The FAIR Guiding Principles for scientific data management and stewardship. Sci Data 3, 160018 (2016). https://doi.org/10.1038/sdata.2016.18 {{/footnote}}.  Publishing data, models or code via EBRAINS will provide you with a citeable [[DataCite DOI>>https://www.doi.org/the-identifier/resources/handbook/]] for your research product.
264 +
265 +
204 204  ----
205 205  
206 -=== **Integrate your data in the EBRAINS atlas services** ===
268 +=== **At a glance: "Sharing experimental data on EBRAINS" ** ===
207 207  
208 -EBRAINS supports viewers for a variety of data, and is continuously looking to improve the services for visualising data. For 2D histology image data that is registered to an EBRAINS supported brain atlas, the data and the overlying atlas plates can be uploaded to the LocaliZoom viewer. See for example the [[LocaliZoom links available for this dataset>>https://doi.org/10.25493/T686-7BX]] as an example.
209 209  
210 -To learn more about how to integrate your data to atlases, check out the [[Atlas services>>https://ebrains.eu/services/atlases#Integratedatatoanatlas]] on ebrains.eu.
271 +|(% style="width:439px" %)(((
272 +[[[[image:image-20230324171114-2.png||height="354" width="250"]]>>https://drive.ebrains.eu/f/dfd374b9b43a458192e9/]]
273 +)))|(% style="width:461px" %)(((
274 +[[[[image:image-20230324171109-1.png||height="352" width="250"]]>>https://drive.ebrains.eu/f/c1ccb78be52e4bdba7cf/]]
275 +)))|(% style="width:416px" %)[[[[image:image-20230330120354-1.png||height="352" width="250"]]>>https://drive.ebrains.eu/f/707147a883b94fae8e69/]]
276 +|(% style="width:439px" %)//Collection of useful information for researchers looking to share experimental data on EBRAINS.//|(% style="width:461px" %)//The EBRAINS data descriptor: a general overview //|(% style="width:416px" %)//Introduction to data organization: A [[collection of guidelines>>https://drive.ebrains.eu/smart-link/25299f04-c4e5-4028-8f5f-3b8208f9a532/]] on how to organise files and folders to ensure consistency and reproducibility in the future. //
211 211  
212 212  ----
213 213  
... ... @@ -216,8 +216,6 @@
216 216  (((
217 217  ==== ====
218 218  
219 ->
220 -
221 221  (((
222 222  >The curation process is time consuming and difficult
223 223  )))
69399800_399857187337623_8446982631391756288_n.jpg
Author
... ... @@ -1,0 +1,1 @@
1 +XWiki.spieschnik
Size
... ... @@ -1,0 +1,1 @@
1 +198.8 KB
Content
Image-1.jpg
Author
... ... @@ -1,0 +1,1 @@
1 +XWiki.spieschnik
Size
... ... @@ -1,0 +1,1 @@
1 +154.3 KB
Content
My project (1).jpg
Author
... ... @@ -1,0 +1,1 @@
1 +XWiki.spieschnik
Size
... ... @@ -1,0 +1,1 @@
1 +772.0 KB
Content
My project (2).jpg
Author
... ... @@ -1,0 +1,1 @@
1 +XWiki.spieschnik
Size
... ... @@ -1,0 +1,1 @@
1 +564.9 KB
Content
My project.jpg
Author
... ... @@ -1,0 +1,1 @@
1 +XWiki.spieschnik
Size
... ... @@ -1,0 +1,1 @@
1 +607.5 KB
Content
My project1.jpg
Author
... ... @@ -1,0 +1,1 @@
1 +XWiki.spieschnik
Size
... ... @@ -1,0 +1,1 @@
1 +718.0 KB
Content
My project2.jpg
Author
... ... @@ -1,0 +1,1 @@
1 +XWiki.spieschnik
Size
... ... @@ -1,0 +1,1 @@
1 +701.2 KB
Content
ansattfoto2.jpg
Author
... ... @@ -1,0 +1,1 @@
1 +XWiki.spieschnik
Size
... ... @@ -1,0 +1,1 @@
1 +1.8 MB
Content
csm_Jan_Gruendling_3_bf9567ebb1.jpg
Author
... ... @@ -1,0 +1,1 @@
1 +XWiki.spieschnik
Size
... ... @@ -1,0 +1,1 @@
1 +108.6 KB
Content
fb6d6429-709e-4f4d-be3b-eab0a46695bd.JPG
Author
... ... @@ -1,0 +1,1 @@
1 +XWiki.spieschnik
Size
... ... @@ -1,0 +1,1 @@
1 +302.3 KB
Content
image-20230330120354-1.png
Author
... ... @@ -1,0 +1,1 @@
1 +XWiki.ingrreit
Size
... ... @@ -1,0 +1,1 @@
1 +140.9 KB
Content