Changes for page Technical details
Last modified by lzehl on 2021/07/05 18:57
Summary
-
Page properties (2 modified, 0 added, 0 removed)
Details
- Page properties
-
- Parent
-
... ... @@ -1,1 +1,1 @@ 1 -Collabs.openminds.o penMINDS core.WebHome1 +Collabs.openminds.Documentation.WebHome - Content
-
... ... @@ -10,104 +10,137 @@ 10 10 Please find below a documentation of the layout and requirements needed to keep the openMINDS modularity, the syntax of the openMINDS schema template, as well as the openMINDS integration pipeline. 11 11 ))) 12 12 13 -=== Overview of the openMINDS layout===13 +=== The openMINDS umbrella === 14 14 15 15 (% style="text-align: justify;" %) 16 16 In summary, openMINDS is the overall umbrella for a set of distributed GitHub repositories, each defining a particular metadata model for neuroscience research products. 17 17 18 -=== The openMINDS schema template syntax === 18 +(% style="text-align: justify;" %) 19 +The main (or central) [[openMINDS GitHub repository>>https://github.com/HumanBrainProject/openMINDS||rel="noopener noreferrer" target="_blank"]] ingests all these GitHub repositories as [[git-submodules>>https://git-scm.com/docs/git-submodule||rel="noopener noreferrer" target="_blank"]]. Furthermore it stores the openMINDS vocabulary (**##vocab##**), providing general definitions and references for **types** and **properties** used in schemas across all openMINDS repositories (cf. below). And last but not least, it holds the schema representations for all supported metadata formats created by the openMINDS integration pipeline (cf. below). 19 19 20 20 (% style="text-align: justify;" %) 21 - AllopenMINDSmetadatamodelsusea light-weightedschematemplatesyntaxfordefiningtheexpectedmetadata.The correspondinglyformattedschema files usethe extension: **##.schema.tpl.json##**.22 +For this to work smoothly for the existing, but also for all new openMINDS metadata models, the corresponding openMINDS submodules (GitHub repositories) have to meet the following requirements: 22 22 23 23 (% style="text-align: justify;" %) 24 - Although,as thefile extension suggests, this openMINDSschema template syntax is inspiredby JSON-Schema, itfacilitatesor even excludestechnical aspectsthataregenerally expectedfor the openMINDSschemasmaking them more human-readable, especially for untrained eyes. Behind the scenes,withintheopenMINDSintegration pipeline (cf.below), thisschema template syntax is then interpretedandflexibly translatedtovarious formalmetadata formats(e.g., JSON-Schema).25 +**(1)** The openMINDS metadata model has to be located on a **public GitHub repository** and published under an **MIT license**. 25 25 26 26 (% style="text-align: justify;" %) 27 - Despitethesimplificationin comparison to JSON-Schema, the openMINDSschematemplates arealso,at thecore, specially formattedJSON files usingaparticularsyntax,meaning specialkey-valuepairsthat define the validation rules ofaschema.28 +**(2)** The GitHub repository should have at least one **version branch** (e.g., "v1"). 28 28 29 29 (% style="text-align: justify;" %) 30 - Pleasefindin the following afulldocumentationoftheopenMINDSschematemplate syntaxandhow it's key-valuepairsneed tobefined and interpreted.31 +**(3)** The version branch should have the following **main directory folders**: **##schemas##** (required), **##tests##** (recommended), **##examples##** (recommended), and **##img##** (optional). 31 31 32 32 (% style="text-align: justify;" %) 33 - =====Target&concept templates=====34 +**(4)** The **##schemas##** folder should contain the schemas of that metadata model implemented in the **openMINDS schema template syntax** (cf. below). The directory of the schemas can be further structured or flat. 34 34 35 35 (% style="text-align: justify;" %) 36 - Sameas in JSON-Schema,all openMINDSschematemplates definethe expected name (written in**##lowerCamelCase##**)and valueof themetadata,typicallycalledproperty,under thekey **##properties##**asnesteddictionaries.Furthermore,themes ofobligatorymetadatacanbelisted underthe key **##required##**. Here a generalizedexample:37 +**(5)** The **##tests##** folder should contain test-instances (JSON-LDs) for the schemas in a flat directory. The file names for these test-instances should follow the convention of 37 37 38 -{{code language="json"}} 39 -{ 40 - "properties": { 41 - "propertyNameA": {}, 42 - "propertyNameB": {}, 43 - "propertyNameC": {} 44 - }, 45 - "required": [ 46 - "propertyNameA", 47 - "propertyNameC" 48 - ] 49 -} 50 -{{/code}} 39 +(% style="text-align: center;" %) 40 +**##<<XXX>>-<<YYY>>.jsonld##** 51 51 52 52 (% style="text-align: justify;" %) 53 - In addition,an openMINDSschema//hastohave// a key **##"_type"##**to be recognized as**target template**. In other words, the **##"_type"##** isusedto define the openMINDS namespace of a correspondingschemausing a particular naming convention.Hereagaina generalizedexample:43 +for files that should pass the tests, and 54 54 55 -{{code language="json"}} 56 -{ 57 - "_type": "https:~/~/openminds.ebrains.eu/<<schema-model>>/<<schema-name>>", 58 - "properties": {} 59 -} 60 -{{/code}} 45 +(% style="text-align: center;" %) 46 +**##<<XXX>>-<<YYY>>-nok.jsonld##** 61 61 62 62 (% style="text-align: justify;" %) 63 - Note that **##<<schema-model>>##**hasto be replaced with the label of theopenMINDSmetadatamodelto whichthecorrespondingschemabelongso, and **##<<schema-name>>##** hasto beplacedwithhecorresponding nameofthechema(writtenin**##CamelCase##**).49 +for files that should fail the test. In both cases, **##<<XXX>>##** should be replaced with the label of the schema that is tested, and **##<<YYY>>##** with a user defined label for what aspect is tested (e.g., **##person-withoutCI.jsonld##**). 64 64 65 65 (% style="text-align: justify;" %) 66 - IfanopenMINDSschematemplate//doesnot//defineakey **##"_type"##**(asfirstexample above),it is interpretedsacept template**which//hasto be//extendedto atargettemplate.52 +**(6)** The **##examples##** folder should contain examples for valid instance collections for that metadata model. Each example should receive its own directory (folder) with a **##README.md##** describing the example, and an **##metadataCollection##** subfolder containing the openMINDS instances (JSON-LDs). This subfolder can be further structured or flat. 67 67 68 68 (% style="text-align: justify;" %) 69 - Concepttemplates areandshouldbe used whenmultipletargettemplateshave the samesubsetof properties, becausetheyfacilitate the long-termmaintenanceof thosesharedproperties: Instead ofdefiningthesame propertiesrepeatedlywithinmultipletarget templates,thecommon subset can bedefined within a singleconcept template and passed onto all extendingtargettemplates.55 +**(7)** The **##img##** folder should contain image files used on that GitHub repository (e.g., the logo of the new openMINDS metadata model). The directory of the images can be further structured or flat. 70 70 57 +=== The openMINDS vocabulary === 58 + 71 71 (% style="text-align: justify;" %) 72 -To definethat a target templateistheextension ofaconcepttemplate, thetargettemplatecanstateunder**##"_extends"##**therelativepathto theconcepttemplate.Forexample,thepenMINDScoretargettemplate##Dataset##** extendsthe coreconcepttemplate **##researchProduct##**as indicated here:60 +Through the integration pipeline of the openMINDS generator, the openMINDS vocabulary is automatically gathered and stored in the main openMINDS GitHub in order to centrally maintain general definitions and references for **types** and **properties** used in schemas across all openMINDS repositories. How this works is explained in the following. 73 73 62 +(% style="text-align: justify;" %) 63 +Schema types and properties are stored in dedicated JSON files (**##types.json##** and **##properties.json##**) under the folder **##vocab##** located in the main openMINDS GitHub directory. Each schema type and property occurring in the openMINDS metadata models is automatically represented in those files as nested dictionaries. Here a cutout of the **##types.json##**: 64 + 74 74 {{code language="json"}} 75 75 { 76 - "_type": "https:~/~/openminds.ebrains.eu/core/Dataset", 77 - "_extends": "products/researchProduct.schema.tpl.json" 67 + ..., 68 + "https://openminds.ebrains.eu/core/Person": { 69 + "description": "Structured information on a person (alive or dead).", 70 + "name": "Person", 71 + "translatableTo": [ 72 + "https://schema.org/Person" 73 + ] 74 + }, 75 + ... 78 78 } 79 79 {{/code}} 80 80 79 +... and a cutout of the **##properties.json##**: 80 + 81 +{{code language="json"}} 82 +{ 83 + ..., 84 + "givenName": { 85 + "description": "Name given to a person, including all potential middle names, but excluding the family name.", 86 + "name": "Given name", 87 + "nameForReverseLink": "Is given name of", 88 + "sameAs": [ 89 + "https://schema.org/givenName" 90 + ], 91 + "schemas": [ 92 + "core/v3/actors/person.schema.tpl.json" 93 + ] 94 + }, 95 + ... 96 +} 97 +{{/code}} 98 + 81 81 (% style="text-align: justify;" %) 82 - Notethatthisconvention requirestheconcept andcorrespondingtarget templatesto belocated inthe same openMINDSmetadatamodelrepository.Notealsothatforproperties,the followingrulespply fortarget andconcept template:100 +The keywords of those nested dictionaries are pre-defined to consistently capture for all schema types and properties their namespace, their occurrence (cf. **##"schemas"##** in **##properties.json##**), their general description (cf. **##"description"##** in **##types.json##** and **##properties.json##**), and possible references to related or matching schema types (cf. **##"translatableTo"##** in **##types.json##**) and properties (cf. **##""sameAs""##** in **##properties.json##**) of other metadata initiatives (e.g., schema.org). 83 83 84 -1. A concept template has to define some properties which will be inherited by all extending target templates. 85 -1. If a concept template additionally states that some of these properties are required, all extending target templates will require the same properties. 86 -1. A target template can require properties of the concept template, that are not explicitly required within the concept template. In such a case, the other target templates extending the same concept template will not require those properties. 87 -1. A target template can (but does not have to) define and require additional properties that were not defined and required in the concept template. These additionally defined and required properties will not be shared with the other target templates extending the same concept template. 102 +(% style="text-align: justify;" %) 103 +This setup also allows us to define some values/entries to be automatically filled in by the openMINDS integration pipeline with each commit to one of the openMINDS repositories (**##"name"##**, **##"schemas"##**) and others to be manually editable later on (**##"description"##**, **##"translatableTo"##**, **##"sameAs"##**, **##"nameForReverseLink"##**). 88 88 89 89 (% style="text-align: justify;" %) 90 - Howtodefine the expectedvalue ofa property willbe explainedforthe differentpropertytypes in thefollowingsections.106 +For security, outdated entries in those openMINDS vocabulary files (e.g., because the namespace of the schema type or property changed or the schema type or property was deleted) are not automatically deleted, but kept and marked as being deprecated. After evaluation, deprecated schema types or properties can be deleted manually from openMINDS vocabulary. 91 91 92 92 (% style="text-align: justify;" %) 93 - =====Defining expectedvalues=====109 +With that, the openMINDS vocab reflects always an up-to-date status of the schema types and properties in use across all openMINDS metadata models, while providing the opportunity to centrally review and maintain their consistency and references. 94 94 111 +=== The openMINDS schema template syntax === 112 + 95 95 (% style="text-align: justify;" %) 96 - Theexpectedvalue ofapropertycan be defined inlarge partsin thesamewaywith some openMINDS syntaxspecificsimplifications andmodifications.114 +All openMINDS metadata models are defined using a light-weighted schema template syntax. Although this schema template syntax is inspired by JSON-Schema, it outsources most schema technicalities to be handled in the openMINDS integration pipeline, making the openMINDS schemas more human-readable, especially for untrained eyes. The few remaining customized technical properties which need additional interpretation are prefixed in the syntax with **##"_"##**. Within the openMINDS integration pipeline (cf. below), the schema template syntax is interpreted, extended and flexibly translated to various formal metadata formats (e.g., JSON-Schema). The specifications of the openMINDS schema template syntax are described in following. 97 97 98 98 (% style="text-align: justify;" %) 99 -On the first level, the **##"type"##** of the expected property value needs to be defined. In principle, the openMINDS template syntax supports the same value types as JSON-Schema Draft 7.0, meaning: 100 -+ **##"string"##** 101 -+ **##"number"##** 102 -+ **##"integer"##** 103 -+ **##"array"##** 104 -+ **##"boolean"##** 105 -+ **##"null"##** 106 -+ **##"object"##** 117 +All openMINDS schemas need to have the extension **##.schema.tpl.json##** and each schema is defined as a nested associative array (dictionary) with the following conceptual structure: 107 107 108 -Also very similar to JSON-Schema, additional type-specific keys can be used to set further requirements for the expected value. H 119 +{{code language="json"}} 120 +{ 121 + "_type": "https://openminds.ebrains.eu/LABEL_OF_METADATA_MODEL/SCHEMA_NAME", 122 + "properties": { 123 + "PROPERTY_NAME": { 124 + "type": "DATA_TYPE", 125 + "_instruction": "METADATA_ENTRY_INSTRUCTION" 126 + }, 127 + "required": [ 128 + "PROPERTY_NAME" 129 + ] 130 +} 131 +{{/code}} 109 109 133 +(% style="text-align: justify;" %) 134 +**##"_type"##** defines the schema type (or namespace) with the depicted naming convention, where the label of the respective openMINDS metadata model (e.g., **##"core"##**) and the schema name (format: UpperCamelCase; e.g. **##"Person"##**) have to be specified. The schema name should be meaningful and provide some insides into what metadata content the schema covers. 110 110 136 +(% style="text-align: justify;" %) 137 +Under **##"properties"##** a nested associated array is defined, where each key defines the property name (format: lowerCamelCase; e.g. **##"givenName"##**) and the associated value is again a nested associated array defining the expected data **##"type"##** and the **##"_instructions"##** for entering the correct metadata for the respective property. Currently the openMINDS schema template syntax supports the following data types: **##"string, **##"integer"##**, **##"float"##**, **##"boolean"##**, **##"array"##** and **##"object"##**. Depending on the expected data type additional constraints can be made for the metadata entry of a respective property (cf. below). 138 + 139 +(% style="text-align: justify;" %) 140 +Under **##"required"##** a list of property names can be provided that are obligatory to be present in a correctly instantiated metadata instance of the respective schema. If none of the properties are required, this key-value pair does not have to be specified. 141 + 111 111 === The openMINDS integration pipeline === 112 112 113 -(//**coming soon**//) 144 +(//**coming soon**//) If you'd like to learn more about the openMINDS integration pipeline, especially if you'd like to contribute to it, please get in touch with us (the openMINDS development team) via the issues on the openMINDS or openMINDS_generator GitHub or the support email: openminds@ebrains.eu 145 + 146 +{{putFootnotes/}}