Changes for page Technical details

Last modified by lzehl on 2021/07/05 18:57

From version 50.1
edited by lzehl
on 2021/03/21 15:21
Change comment: There is no comment for this version
To version 65.1
edited by lzehl
on 2021/06/24 15:55
Change comment: There is no comment for this version

Summary

Details

Page properties
Parent
... ... @@ -1,1 +1,1 @@
1 -Collabs.openminds.openMINDS core.WebHome
1 +Collabs.openminds.Documentation.WebHome
Content
... ... @@ -21,125 +21,121 @@
21 21  (% style="text-align: justify;" %)
22 22  For this to work smoothly for the existing, but also for all new openMINDS metadata models, the corresponding openMINDS submodules (GitHub repositories) have to meet the following requirements:
23 23  
24 -(% style="text-align:justify" %)
25 -* The openMINDS metadata model has to be located on a **public GitHub repository** and published under an **MIT license**.
26 -* The GitHub repository should have at least one **version branch** (e.g. "v1").
27 -* The version branch should have the following **main directory folders**: **##schemas##** (required), **##tests##** (recommended),  **##examples##** (recommended), and **##img##** (optional).
28 -* The **##schemas##** folder should contain the schemas of that metadata model implemented in the **openMINDS schema template syntax** (cf. below). The directory of the schemas can be further structured or flat.
29 -* The **##tests##** folder should contain test-instances (JSON-LDs) for the schemas in a flat directory. The file names for these test-instances should follow the convention of **##<<XXX>>-<<YYY>>.jsonld##** for files that should pass the tests, and **##<<XXX>>-<<YYY>>-nok.jsonld##** for files that should fail the test. In both cases, **##<<XXX>>##** should be replaced with the label of the schema that is tested, and **##<<YYY>>##** with a user defined label for what aspect is tested (e.g., **##person-withoutCI.jsonld##**).
30 -* The **##examples##** folder should contain examples for valid instance collections for that metadata model. Each example should receive its own directory (folder) with a **##README.md##** describing the example, and an **##metadataCollection##** subfolder containing the openMINDS instances (JSON-LDs). This subfolder can be further structured to group related instances.
31 -* The **##img##** folder should contain image files used on that GitHub repository (e.g., the logo of the new openMINDS metadata model). The directory of the images can be further structured or flat.
32 -
33 -=== The openMINDS vocabulary ===
34 -
35 35  (% style="text-align: justify;" %)
36 -Through the integration pipeline of the openMINDS generator, the openMINDS vocabulary is automatically gathered and stored in the main openMINDS GitHub in order to centrally maintain general definitions and references for **types** and **properties** used in schemas across all openMINDS repositories. How this works is explained in the following.
25 +**(1)** The openMINDS metadata model has to be located on a **public GitHub repository** and published under an **MIT license**.
37 37  
38 38  (% style="text-align: justify;" %)
39 -Schema types and properties are stored in dedicated JSON files (**##types.json##** and **##properties.json##**) under the folder **##vocab##** located in the main openMINDS GitHub directory. Each schema type and property occurring in the openMINDS metadata models is automatically represented in those files as nested dictionaries.
28 +**(2)** The GitHub repository should have at least one **version branch** (e.g., "v1").
40 40  
41 41  (% style="text-align: justify;" %)
42 -The keywords of those nested dictionaries are pre-defined to consistently capture for all schema types and properties their namespace, their occurrence, their general description, and possible references to related or matching schema types and properties of other metadata initiatives (e.g. schema.org). This setup also allows us to define some values/entries to be automatically filled in by the openMINDS integration pipeline with each commit to one of the openMINDS repositories (e.g., the namespace and occurrence) and others to be manually editable later on (e.g., the general description and references).
31 +**(3)** The version branch should have the following **main directory folders**: **##schemas##** (required), **##tests##** (recommended),  **##examples##** (recommended), and **##img##** (optional).
43 43  
44 44  (% style="text-align: justify;" %)
45 -For security, outdated entries in those openMINDS vocabulary files (e.g. because the namespace of the schema type or property changed or the schema type or property was deleted) are not automatically deleted, but kept and marked as being deprecated. After evaluation, deprecated schema types or properties can be deleted manually from openMINDS vocabulary.
34 +**(4)** The **##schemas##** folder should contain the schemas of that metadata model implemented in the **openMINDS schema template syntax** (cf. below). The directory of the schemas can be further structured or flat.
46 46  
47 47  (% style="text-align: justify;" %)
48 -With that, the openMINDS vocab reflects always an up-to-date status of the schema types and properties in use across all openMINDS metadata models, while providing the opportunity to centrally review and maintain their consistency and references.
37 +**(5)** The **##tests##** folder should contain test-instances (JSON-LDs) for the schemas in a flat directory. The file names for these test-instances should follow the convention of
49 49  
50 -=== The openMINDS schema template syntax ===
39 +(% style="text-align: center;" %)
40 +**##<<XXX>>-<<YYY>>.jsonld##**
51 51  
52 52  (% style="text-align: justify;" %)
53 -All openMINDS metadata models use a light-weighted schema template syntax for defining the expected metadata. The correspondingly formatted schema files use the extension: **##.schema.tpl.json##**.
43 +for files that should pass the tests, and
54 54  
45 +(% style="text-align: center;" %)
46 +**##<<XXX>>-<<YYY>>-nok.jsonld##**
47 +
55 55  (% style="text-align: justify;" %)
56 -Although, as the file extension suggests, this openMINDS schema template syntax is inspired by JSON-Schema, it facilitates or even excludes technical aspects that are generally expected for the openMINDS schemas making them more human-readable, especially for untrained eyes. Behind the scenes, within the openMINDS integration pipeline (cf. below), this schema template syntax is then interpreted and flexibly translated to various formal metadata formats (e.g., JSON-Schema).
49 +for files that should fail the test. In both cases, **##<<XXX>>##** should be replaced with the label of the schema that is tested, and **##<<YYY>>##** with a user defined label for what aspect is tested (e.g., **##person-withoutCI.jsonld##**).
57 57  
58 58  (% style="text-align: justify;" %)
59 -Despite the simplification in comparison to JSON-Schema, the openMINDS schema templates are also, at the core, specially formatted JSON files using a particular syntax, meaning special key-value pairs that define the validation rules of a schema.
52 +**(6)** The **##examples##** folder should contain examples for valid instance collections for that metadata model. Each example should receive its own directory (folder) with a **##README.md##** describing the example, and an **##metadataCollection##** subfolder containing the openMINDS instances (JSON-LDs). This subfolder can be further structured or flat.
60 60  
61 61  (% style="text-align: justify;" %)
62 -Please find in the following a full documentation of the openMINDS schema template syntax and how it's key-value pairs need to be defined and interpreted.
55 +**(7)** The **##img##** folder should contain image files used on that GitHub repository (e.g., the logo of the new openMINDS metadata model). The directory of the images can be further structured or flat.
63 63  
57 +=== The openMINDS vocabulary ===
58 +
64 64  (% style="text-align: justify;" %)
65 -===== Target & concept templates =====
60 +Through the integration pipeline of the openMINDS generator, the openMINDS vocabulary is automatically gathered and stored in the main openMINDS GitHub in order to centrally maintain general definitions and references for **types** and **properties** used in schemas across all openMINDS repositories. How this works is explained in the following.
66 66  
67 67  (% style="text-align: justify;" %)
68 -Same as in JSON-Schema, all openMINDS schema templates define the expected name (written in **##lowerCamelCase##**) and value of the metadata, typically called property, under the key **##properties##** as nested dictionaries. Furthermore, the names of obligatory metadata can be listed under the key **##required##**. Here a generalized example:
63 +Schema types and properties are stored in dedicated JSON files (**##types.json##** and **##properties.json##**) under the folder **##vocab##** located in the main openMINDS GitHub directory. Each schema type and property occurring in the openMINDS metadata models is automatically represented in those files as nested dictionaries. Here a cutout of the **##types.json##**:
69 69  
70 70  {{code language="json"}}
71 71  {
72 - "properties": {
73 - "propertyNameA": {},
74 - "propertyNameB": {},
75 - "propertyNameC": {}
76 - },
77 - "required": [
78 - "propertyNameA",
79 - "propertyNameC"
80 - ]
67 + ...,
68 + "https://openminds.ebrains.eu/core/Person": {
69 + "description": "Structured information on a person (alive or dead).",
70 + "name": "Person",
71 + "translatableTo": [
72 + "https://schema.org/Person"
73 + ]
74 + },
75 + ...
81 81  }
82 82  {{/code}}
83 83  
84 -(% style="text-align: justify;" %)
85 -In addition, an openMINDS schema //has to have// a key **##"_type"##** to be recognized as **target template**. In other words, the **##"_type"##** is used to define the openMINDS namespace of a corresponding schema using a particular naming convention. Here again a generalized example:
79 +... and a cutout of the **##properties.json##**:
86 86  
87 87  {{code language="json"}}
88 88  {
89 - "_type": "https:~/~/openminds.ebrains.eu/<<schema-model>>/<<schema-name>>",
90 - "properties": {}
83 + ...,
84 + "givenName": {
85 + "description": "Name given to a person, including all potential middle names, but excluding the family name.",
86 + "name": "Given name",
87 + "nameForReverseLink": "Is given name of",
88 + "sameAs": [
89 + "https://schema.org/givenName"
90 + ],
91 + "schemas": [
92 + "core/v3/actors/person.schema.tpl.json"
93 + ]
94 + },
95 + ...
91 91  }
92 92  {{/code}}
93 93  
94 94  (% style="text-align: justify;" %)
95 -Note that **##<<schema-model>>##** has to be replaced with the label of the openMINDS metadata model to which the corresponding schema belongs to, and **##<<schema-name>>##** has to be replaced with the corresponding name of the schema (written in **##CamelCase##**).
100 +The keywords of those nested dictionaries are pre-defined to consistently capture for all schema types and properties their namespace, their occurrence (cf. **##"schemas"##** in **##properties.json##**), their general description (cf. **##"description"##** in **##types.json##** and **##properties.json##**), and possible references to related or matching schema types (cf. **##"translatableTo"##** in **##types.json##**) and properties (cf. **##""sameAs""##** in **##properties.json##**) of other metadata initiatives (e.g., schema.org).
96 96  
97 97  (% style="text-align: justify;" %)
98 -If an openMINDS schema template //does not// define a key **##"_type"##** (as in the first example above), it is interpreted as a **concept template** which //has to be// extended to a target template.
103 +This setup also allows us to define some values/entries to be automatically filled in by the openMINDS integration pipeline with each commit to one of the openMINDS repositories (**##"name"##**, **##"schemas"##**) and others to be manually editable later on (**##"description"##**, **##"translatableTo"##**, **##"sameAs"##**, **##"nameForReverseLink"##**).
99 99  
100 100  (% style="text-align: justify;" %)
101 -Concept templates are and should be used when multiple target templates have the same subset of properties, because they facilitate the long-term maintenance of those shared properties: Instead of defining the same properties repeatedly within multiple target templates, the common subset can be defined within a single concept template and passed on to all extending target templates.
106 +For security, outdated entries in those openMINDS vocabulary files (e.g., because the namespace of the schema type or property changed or the schema type or property was deleted) are not automatically deleted, but kept and marked as being deprecated. After evaluation, deprecated schema types or properties can be deleted manually from openMINDS vocabulary.
102 102  
103 103  (% style="text-align: justify;" %)
104 -To define that a target template is the extension of a concept template, the target template can state under **##"_extends"##** the relative path to the concept template. For example, the openMINDS core target template **##Dataset##** extends the core concept template **##researchProduct##** as indicated here:
109 +With that, the openMINDS vocab reflects always an up-to-date status of the schema types and properties in use across all openMINDS metadata models, while providing the opportunity to centrally review and maintain their consistency and references.
105 105  
106 -{{code language="json"}}
107 -{
108 - "_type": "https:~/~/openminds.ebrains.eu/core/Dataset",
109 - "_extends": "products/researchProduct.schema.tpl.json"
110 -}
111 -{{/code}}
111 +=== The openMINDS schema template syntax ===
112 112  
113 113  (% style="text-align: justify;" %)
114 -Note that this convention requires the concept and corresponding target templates to be located in the same openMINDS metadata model repository. Note also that for properties, the following rules apply for target and concept template: 
114 +All openMINDS metadata models use a light-weighted schema template syntax for defining the expected metadata. The correspondingly formatted schema files use the extension: **##.schema.tpl.json##**.
115 115  
116 -1. A concept template has to define some properties which will be inherited by all extending target templates.
117 -1. If a concept template additionally states that some of these properties are required, all extending target templates will require the same properties. 
118 -1. A target template can require properties of the concept template, that are not explicitly required within the concept template. In such a case, the other target templates extending the same concept template will not require those properties.
119 -1. A target template can (but does not have to) define and require additional properties that were not defined and required in the concept template. These additionally defined and required properties will not be shared with the other target templates extending the same concept template.
120 -
121 121  (% style="text-align: justify;" %)
122 -How to define the expected value of a property will be explained for the different property types in the following sections.
117 +Although, as the file extension suggests, this openMINDS schema template syntax is inspired by JSON-Schema, it facilitates or even excludes technical aspects making the openMINDS schemas more human-readable, especially for untrained eyes. Behind the scenes, within the openMINDS integration pipeline (cf. below), this schema template syntax is then interpreted and flexibly translated to various formal metadata formats (e.g., JSON-Schema).
123 123  
124 124  (% style="text-align: justify;" %)
125 -===== Defining expected values =====
120 +Despite the simplification in comparison to JSON-Schema, the openMINDS schema templates are also, at the core, specially formatted JSON files using a particular syntax, meaning special key-value pairs that define the validation rules of a schema. Please find in the following a full documentation of the openMINDS schema template syntax and how it's key-value pairs need to be defined and interpreted.
126 126  
127 127  (% style="text-align: justify;" %)
128 -The expected value of a property can be defined in large parts in the same way as in JSON-Schema, with some openMINDS syntax specific simplifications and modifications.
123 +For the more inexperienced programmers, let's start by explaining first some general terms that will be later used in the openMINDS schema template syntax specification. More experienced programmers can of course skip these explanations and jump directly further down to Specifications.
129 129  
130 -(% style="text-align: justify;" %)
131 -On the first level, the **##"type"##** of the expected property value needs to be defined. In principle, the openMINDS template syntax supports the same value types as JSON-Schema Draft 7.0, meaning:
132 -+ **##"string"##**
133 -+ **##"number"##**
134 -+ **##"integer"##**
135 -+ **##"array"##**
136 -+ **##"boolean"##**
137 -+ **##"null"##**
138 -+ **##"object"##** 
125 +**What are strings, integers, floats or booleans?** Generally speaking strings, integers, floats or booleans are derived **data types**. A **string** is defined as a sequence of characters between quotes (e.g., ##"Is this a string? YES!"## or ##'thisIsAlsoAString'##). For a string, openMINDS accepts Unicode characters. An **integer** is a whole number, positive or negative, without decimals, of unlimited length (e.g., ##5##, ##-5## or ##1238921234##). A **float** represents a real number, written with a decimal point dividing the integer and fractional part, both of unlimited length (e.g., ##5.15##, ##-5.15## or ##1238921234.1345##). A **boolean** represents a logical proposition by means of the binary digits ##0## (##false##) and ##1## (##true##), especially in computing and electronics. How a boolean is written depends highly on the format or computational language.
139 139  
140 -Also very similar to JSON-Schema, additional type-specific keys can be used to set further requirements for the expected value. H
127 +**What is a list or array?** A **list** is a data structure that is a mutable ordered sequence of values (also called items). The values of a list are typically defined between square brackets (e.g., ##[value1, value2, value3]##). Note that the values within a list do not have to have the same data type. In contrast, an **array** is a data structure that is a mutable unordered sequence of values of the same data type. What data types are accepted for values in a list or in an array is highly depending on the format or computational language.
141 141  
129 +**What is a key-value pair or an associative array?** A **key-value pair** (sometimes also called name-value pair, attribute-value pair, property-value pair, or field-value pair) is a basic data representation and standard language feature in computing languages, systems and applications. In most cases this concept is used to build an **associative array** (also called **dictionary**), meaning an unordered list of unique keys with associated values typically defined within curly brackets (e.g., ##{key1: value1, key3: value3, key2:value2}##). What data types are accepted for keys and values highly depends on the format or computational language. Note that a value could also be a data structure, such as a list, an array or an associative array.
142 142  
131 +**What is JSON?** JSON is short for **J**ava**S**cript **O**bject **N**otation, a lightweight data-interchange format which is built on associative arrays with key-value pairs and lists. Each JSON file begins as associative array. The keys are separated from the values via a colon and key-value pairs are separated by a comma. While a key always has to be a string in double quotes, a value can be a string in double quotes, an integer, a float, a boolean (written as true or false), null, a list or an associative array. Nesting of these structures is unlimited. For more information please go to the official webpage: [[https:~~/~~/www.json.org/>>https://www.json.org/]].
132 +
133 +==== Specifications ====
134 +
135 +(coming soon)
136 +
143 143  === The openMINDS integration pipeline ===
144 144  
145 -(//**coming soon**//)
139 +(//**coming soon**//) If you'd like to learn more about the openMINDS integration pipeline, especially if you'd like to contribute to it, please get in touch with us (the openMINDS development team) via the issues on the openMINDS or openMINDS_generator GitHub or the support email: openminds@ebrains.eu
140 +
141 +{{putFootnotes/}}
Public

openMINDS