Changes for page Technical details
Last modified by lzehl on 2021/07/05 18:57
Summary
-
Page properties (2 modified, 0 added, 0 removed)
Details
- Page properties
-
- Parent
-
... ... @@ -1,1 +1,1 @@ 1 -Collabs.openminds. Documentation.WebHome1 +Collabs.openminds.openMINDS core.WebHome - Content
-
... ... @@ -10,132 +10,109 @@ 10 10 Please find below a documentation of the layout and requirements needed to keep the openMINDS modularity, the syntax of the openMINDS schema template, as well as the openMINDS integration pipeline. 11 11 ))) 12 12 13 -=== The openMINDSumbrella ===13 +=== Overview of the openMINDS layout === 14 14 15 15 (% style="text-align: justify;" %) 16 16 In summary, openMINDS is the overall umbrella for a set of distributed GitHub repositories, each defining a particular metadata model for neuroscience research products. 17 17 18 -(% style="text-align: justify;" %) 19 -The main (or central) [[openMINDS GitHub repository>>https://github.com/HumanBrainProject/openMINDS||rel="noopener noreferrer" target="_blank"]] ingests all these GitHub repositories as [[git-submodules>>https://git-scm.com/docs/git-submodule||rel="noopener noreferrer" target="_blank"]]. Furthermore it stores the openMINDS vocabulary (**##vocab##**), providing general definitions and references for **types** and **properties** used in schemas across all openMINDS repositories (cf. below). And last but not least, it holds the schema representations for all supported metadata formats created by the openMINDS integration pipeline (cf. below). 18 +=== The openMINDS schema template syntax === 20 20 21 21 (% style="text-align: justify;" %) 22 - Forthistowork smoothlyfor theexisting, but alsoforall newopenMINDSmetadatamodels, the corresponding openMINDSsubmodules(GitHub repositories)haveto meetthefollowing requirements:21 +All openMINDS metadata models use a light-weighted schema template syntax for defining the metadata. The correspondingly formatted schema files use the extension: **##.schema.tpl.json##**. 23 23 24 24 (% style="text-align: justify;" %) 25 - **(1)**The openMINDS metadatamodel has tobe located ona**publicGitHubrepository**andpublishedunder an**MITlicense**.24 +Although, as the file extension suggests, this openMINDS schema template syntax is inspired by JSON-Schema, it facilitates or even excludes technical aspects that are generally expected for the openMINDS schemas making them more human-readable, especially for untrained eyes. Behind the scenes, within the openMINDS integration pipeline (cf. below), this schema template syntax is then interpreted and flexibly translated to various formal metadata formats (e.g., JSON-Schema). 26 26 27 27 (% style="text-align: justify;" %) 28 - **(2)**TheGitHubrepositoryshouldhave at leastone**versionbranch** (e.g., "v1").27 +Despite the simplification in comparison to JSON-Schema, the openMINDS schema templates are also, at the core, specially formatted JSON files using a particular syntax, meaning special key-value pairs that define the validation rules of a schema. 29 29 30 30 (% style="text-align: justify;" %) 31 - **(3)** Theversionbranchshould havethe following**maindirectoryfolders**:**##schemas##**(required), **##tests##**(recommended),**##examples##**(recommended),and**##img##** (optional).30 +Please find in the following a full documentation of the openMINDS schema template syntax and how it's key-value pairs need to be defined and interpreted. 32 32 33 33 (% style="text-align: justify;" %) 34 - **(4)**The **##schemas##** foldershould containtheschemasof that metadata model implemented inthe**openMINDS schematemplateyntax**(cf. below). The directory of the schemas can be further structured or flat.33 +===== Target & context templates ===== 35 35 36 36 (% style="text-align: justify;" %) 37 - **(5)**The**##tests##**foldershouldcontain test-instances(JSON-LDs)for theschemasinaflat directory. The filenamesforthese test-instancesshouldfollowthe convention of36 +Same as in JSON-Schema, all openMINDS schemas define the name and value of the metadata they expect under the key **##properties##** as nested dictionaries, and which of those properties is obligatory under the key **##required##** listing the corresponding property names, as depicted here: 38 38 39 -(% style="text-align: center;" %) 40 -**##<<XXX>>-<<YYY>>.jsonld##** 38 +{{code language="json"}} 39 +{ 40 + "properties": { 41 + "propertyNameA": {}, 42 + "propertyNameB": {}, 43 + "propertyNameC": {} 44 + }, 45 + "required": [ 46 + "propertyNameA", 47 + "propertyNameC" 48 + ] 49 +} 50 +{{/code}} 41 41 42 42 (% style="text-align: justify;" %) 43 - forfiles that should passthe tests, and53 +In addition, an openMINDS schema //has to have// a key **##"_type"##** to be recognized as **target template**. In other words, the **##"_type"##** is used to define the openMINDS namespace of a corresponding schema using a particular naming convention, as depicted here: 44 44 45 -(% style="text-align: center;" %) 46 -**##<<XXX>>-<<YYY>>-nok.jsonld##** 55 +{{code language="json"}} 56 +{ 57 + "_type": "https:~/~/openminds.ebrains.eu/<<schema-model>>/<<schema-name>>", 58 + "properties": {}, 59 + "required": [] 60 +} 61 +{{/code}} 47 47 48 48 (% style="text-align: justify;" %) 49 - for files that should fail thetest.In both cases,**##<<XXX>>##**shouldbe replaced with the label of theschema that istested, and **##<<YYY>>##** witha userdefinedlabelforwhataspectistested(e.g.,**##person-withoutCI.jsonld##**).64 +where **##<<schema-model>>##** has to be replaced with the label of the openMINDS metadata model to which the corresponding schema belongs to, and **##<<schema-name>>##** has to be replaced with the corresponding name of that schema (written in **##CamelCase##**). 50 50 51 51 (% style="text-align: justify;" %) 52 - **(6)** The **##examples##**foldershould containexamples for valid instancecollectionsfor that metadatamodel. Eachexampleshouldreceive itsown directory (folder)witha **##README.md##**describingthe example,andan**##metadataCollection##**subfoldercontainingtheopenMINDSinstances(JSON-LDs). Thissubfoldercanbefurtherstructuredor flat.67 +If an openMINDS schema template //does not// define a key **##"_type"##** (as in the first example above), it is interpreted as a **context template** which //has to be// extended to a target template. 53 53 54 54 (% style="text-align: justify;" %) 55 - **(7)** The**##img##** folder shouldcontainimagefilesusedonthatGitHubrepository(e.g.,thelogo of the newopenMINDSmetadatamodel).Thedirectoryoftheimagescanbefurtherstructuredorflat.70 +Context templates are and should be used when multiple openMINDS schemas (target templates) have the same subset of properties. Such a common subset of properties can then be defined within a single context schema instead of each target template which facilitates the long-term maintenance of these properties. 56 56 57 -=== The openMINDS vocabulary === 58 - 59 59 (% style="text-align: justify;" %) 60 -T hroughthetegrationpipeline oftheopenMINDSgenerator,theopenMINDS vocabularyisautomatically gatheredand storedinthemainopenMINDS GitHub in ordertocentrallymaintaingeneral definitionsandreferencesfor**types**and**properties**usedinschemasacrossallopenMINDS repositories.Howthisworksisexplained inthefollowing.73 +To define that a target template is the extension of a context template, the target template can state under **##"_extends"##** the relative path to the context template. For example, the openMINDS core target template **##Dataset##** extends the core concept template **##researchProduct##**: 61 61 62 -(% style="text-align: justify;" %) 63 -Schema types and properties are stored in dedicated JSON files (**##types.json##** and **##properties.json##**) under the folder **##vocab##** located in the main openMINDS GitHub directory. Each schema type and property occurring in the openMINDS metadata models is automatically represented in those files as nested dictionaries. Here a cutout of the **##types.json##**: 64 - 65 65 {{code language="json"}} 66 66 { 67 - ..., 68 - "https://openminds.ebrains.eu/core/Person": { 69 - "description": "Structured information on a person (alive or dead).", 70 - "name": "Person", 71 - "translatableTo": [ 72 - "https://schema.org/Person" 73 - ] 74 - }, 75 - ... 77 + "_type": "https:~/~/openminds.ebrains.eu/core/Dataset", 78 + "_extends": "products/researchProduct.schema.tpl.json", 79 + "properties": [] 76 76 } 77 77 {{/code}} 78 78 79 - ...and a cutout ofthe**##properties.json##**:83 +Note that this convention requires the context and corresponding target templates to be located in the same openMINDS metadata model repository. Note also that both, target and concept template, can or cannot define required properties. In this context, the following rules apply: 80 80 81 -{{code language="json"}} 82 -{ 83 - ..., 84 - "givenName": { 85 - "description": "Name given to a person, including all potential middle names, but excluding the family name.", 86 - "name": "Given name", 87 - "nameForReverseLink": "Is given name of", 88 - "sameAs": [ 89 - "https://schema.org/givenName" 90 - ], 91 - "schemas": [ 92 - "core/v3/actors/person.schema.tpl.json" 93 - ] 94 - }, 95 - ... 96 -} 97 -{{/code}} 85 +1. If a concept template requires properties, the target templates extending this concept template require the same properties. 86 +1. A target template can require properties of the concept template, that are not explicitly required within the concept template. Other target templates extending the same concept template will not require those properties. 87 +1. A target template can define and require additional properties that were not defined and required in the concept template. These properties are not shared with the other target templates that extend the same concept template. 98 98 89 +How to define the expected value of a property will be explained for the different property types in the following sections. 90 + 99 99 (% style="text-align: justify;" %) 100 - Thekeywords ofthose nested dictionaries are pre-defined to consistently capture for all schema types and properties their namespace, their occurrence (cf. **##"schemas"##** in **##properties.json##**), theirgeneraldescription (cf. **##"description"##** in **##types.json##** and **##properties.json##**),and possible references to related or matching schema types (cf. **##"translatableTo"##** in **##types.json##**) and properties (cf. **##""sameAs""##** in **##properties.json##**) of other metadata initiatives (e.g., schema.org).92 +===== String properties ===== 101 101 102 102 (% style="text-align: justify;" %) 103 - This setup also allows us to define some values/entries to be automatically filled in by the openMINDS integration pipeline with each commit to one of the openMINDS repositories(**##"name"##**, **##"schemas"##**) andothers to bemanually editable later on(**##"description"##**, **##"translatableTo"##**, **##"sameAs"##**, **##"nameForReverseLink"##**).95 +(//**coming soon**//) 104 104 105 105 (% style="text-align: justify;" %) 106 - Forsecurity, outdated entries in those openMINDS vocabulary files (e.g., because the namespace of the schema type orproperty changed or the schema type or property was deleted) are not automaticallydeleted, but kept and marked as being deprecated. After evaluation, deprecated schema types orpropertiescan be deleted manually from openMINDS vocabulary.98 +===== Numerical properties ===== 107 107 108 108 (% style="text-align: justify;" %) 109 - With that, the openMINDS vocab reflects always an up-to-date status of the schema types and properties inuse across allopenMINDS metadata models, while providing the opportunity to centrally review and maintain their consistency and references.101 +(//**coming soon**//) 110 110 111 -=== The openMINDS schema template syntax === 112 - 113 113 (% style="text-align: justify;" %) 114 - AllopenMINDS metadata models use a light-weighted schematemplatesyntax fordefining the expected metadata. The correspondingly formatted schema filesuse the extension: **##.schema.tpl.json##**.104 +===== Object properties ===== 115 115 116 116 (% style="text-align: justify;" %) 117 - Although, as the file extension suggests, this openMINDS schema template syntax is inspired by JSON-Schema, it facilitatesor even excludes technical aspectsmakingthe openMINDSschemas more human-readable, especially for untrained eyes. Behind the scenes, within the openMINDS integration pipeline (cf. below), this schema template syntax is then interpreted and flexibly translated to various formal metadata formats (e.g., JSON-Schema).107 +(//**coming soon**//) 118 118 119 119 (% style="text-align: justify;" %) 120 - Despitethe simplification in comparison to JSON-Schema, the openMINDS schema templates are also, atthe core, speciallyformatted JSON files using a particularsyntax, meaning special key-value pairsthat define the validation rules of a schema. Please find in the following a full documentation of the openMINDS schema template syntax and how it's key-value pairs need to be defined and interpreted.110 +===== Property arrays ===== 121 121 122 122 (% style="text-align: justify;" %) 123 - For the more inexperienced programmers, let's start by explainingfirst some general terms that will be later used in theopenMINDS schema template syntax specification. More experienced programmers can of course skip these explanations and jump directly further down to Specifications.113 +(//**coming soon**//) 124 124 125 -**What are strings, integers, floats or booleans?** Generally speaking strings, integers, floats or booleans are derived **data types**. A **string** is defined as a sequence of characters between quotes (e.g., ##"Is this a string? YES!"## or ##'thisIsAlsoAString'##). For a string, openMINDS accepts Unicode characters. An **integer** is a whole number, positive or negative, without decimals, of unlimited length (e.g., ##5##, ##-5## or ##1238921234##). A **float** represents a real number, written with a decimal point dividing the integer and fractional part, both of unlimited length (e.g., ##5.15##, ##-5.15## or ##1238921234.1345##). A **boolean** represents a logical proposition by means of the binary digits ##0## (##false##) and ##1## (##true##), especially in computing and electronics. How a boolean is written depends highly on the format or computational language. 126 126 127 -**What is a list or array?** A **list** is a data structure that is a mutable ordered sequence of values (also called items). The values of a list are typically defined between square brackets (e.g., ##[value1, value2, value3]##). Note that the values within a list do not have to have the same data type. In contrast, an **array** is a data structure that is a mutable unordered sequence of values of the same data type. What data types are accepted for values in a list or in an array is highly depending on the format or computational language. 128 - 129 -**What is a key-value pair or an associative array?** A **key-value pair** (sometimes also called name-value pair, attribute-value pair, property-value pair, or field-value pair) is a basic data representation and standard language feature in computing languages, systems and applications. In most cases this concept is used to build an **associative array** (also called **dictionary**), meaning an unordered list of unique keys with associated values typically defined within curly brackets (e.g., ##{key1: value1, key3: value3, key2:value2}##). What data types are accepted for keys and values highly depends on the format or computational language. Note that a value could also be a data structure, such as a list, an array or an associative array. 130 - 131 -**What is JSON?** JSON is short for **J**ava**S**cript **O**bject **N**otation, a lightweight data-interchange format which is built on associative arrays with key-value pairs and lists. Each JSON document/file begins as associative array. The keys are separated from the values via a colon and key-value pairs are separated by a comma. While a key always has to be a string in double quotes, a value can be a string in double quotes, an integer, a float, a boolean (written as true or false), null, a list or an associative array. Nesting of these structures is unlimited. For more information please go to the official webpage: [[https:~~/~~/www.json.org/>>https://www.json.org/]]. Several serialisation formats have been built on the JSON specification, such as JSON-LD (cf.[[ Application details: JSON-LD - the openMINDS serialization format>>doc:Collabs.openminds.Documentation.Application details.WebHome||target="_blank"]]). In addition, several schema languages have been developed to annotate and validate JSON documents, such as JSON-Schema and SHACL (cf. The openMINDS integration pipeline). 132 - 133 -==== Specifications ==== 134 - 135 -(coming soon) 136 - 137 137 === The openMINDS integration pipeline === 138 138 139 -(//**coming soon**//) If you'd like to learn more about the openMINDS integration pipeline, especially if you'd like to contribute to it, please get in touch with us (the openMINDS development team) via the issues on the openMINDS or openMINDS_generator GitHub or the support email: openminds@ebrains.eu 140 - 141 -{{putFootnotes/}} 118 +(//**coming soon**//)