Changes for page Technical details
Last modified by lzehl on 2021/07/05 18:57
Summary
-
Page properties (2 modified, 0 added, 0 removed)
Details
- Page properties
-
- Parent
-
... ... @@ -1,1 +1,1 @@ 1 -Collabs.openminds. Documentation.WebHome1 +Collabs.openminds.openMINDS core.WebHome - Content
-
... ... @@ -10,155 +10,150 @@ 10 10 Please find below a documentation of the layout and requirements needed to keep the openMINDS modularity, the syntax of the openMINDS schema template, as well as the openMINDS integration pipeline. 11 11 ))) 12 12 13 -=== The openMINDSumbrella ===13 +=== Overview of the openMINDS layout === 14 14 15 15 (% style="text-align: justify;" %) 16 -openMINDS is the overall umbrella for a set of integratedmetadata models for describing neuroscience research products in graph databases. The correctintegration of these metadata models as well as the provision of a central access point is handledby an integration pipeline. All metadatamodels are developed on separateGitHub repositories,in order to facilitateextensions andmaintenance ofexisting, as well as development andintegrationof new openMINDS metadatamodels and schemas. The integrationpipeline ensures that the central openMINDS GitHub repository ingests all these GitHub repositories as [[git-submodules>>https://git-scm.com/docs/git-submodule||rel="noopener noreferrer" target="_blank"]],integrates the respectivemetadata modelsand builds the openMINDS GitHub pages, as well as ZIPfiles containing therespectiveopenMINDS schemas in the supported formats, such as theopenMINDS syntax (cf. below),JSON-Schema, orHTML. In thefollowing we willsummarizethe contentsandrequirements for thecentral openMINDS GitHub repository as well as all metadata model git-submodules. TheopenMINDS integration pipeline will be handled in a separatechapter (cf.below).16 +In summary, openMINDS is the overall umbrella for a set of distributed GitHub repositories, each defining a particular metadata model for neuroscience research products. 17 17 18 -(% style="text-align: justify;" %) 19 -Let us start with the [[central openMINDS GitHub repository>>https://github.com/HumanBrainProject/openMINDS||rel="noopener noreferrer" target="_blank"]] which has a **main branch**, a **documentation branch**, and **version branches** (naming convention: vX; e.g., v1). Official releases (naming convention: vX.Y; e.g., v1.0) are tagged and provided as release packages. 18 +=== The openMINDS schema template syntax === 20 20 21 21 (% style="text-align: justify;" %) 22 - The**main branch** hosts thegeneral README, the LICENSEdocument, the CONTRIBUTING document,andhe generalopenMINDSlogo.Inaddition, it maintainstheopenMINDS vocabulary(vocab;cf. below) which providesgeneraldefinitionsand references for schematypes andproperties usedacross all openMINDSmetadatamodels and theirversions, as wellasthebash script thatbuildsthecontent of thedocumentation and versionbranches.21 +All openMINDS metadata models use a light-weighted schema template syntax for defining the expected metadata. The correspondingly formatted schema files use the extension: **##.schema.tpl.json##**. 23 23 24 24 (% style="text-align: justify;" %) 25 - The **documentation** branchhoststheHTMLfilesthat build the[[openMINDSGitHub pages>>https://humanbrainproject.github.io/openMINDS/||rel="noopenernoreferrer"target="_blank"]],aswellasaZIPfilefor eachversionbranch andofficialreleaseontainingthe respective openMINDS schemas in thecurrentlysupportedformats,suchas the openMINDSsyntax (`.schema.tpl.json`;cf. below),JSON-Schema(`.schema.json`),orHTML(`.html`).24 +Although, as the file extension suggests, this openMINDS schema template syntax is inspired by JSON-Schema, it facilitates or even excludes technical aspects that are generally expected for the openMINDS schemas making them more human-readable, especially for untrained eyes. Behind the scenes, within the openMINDS integration pipeline (cf. below), this schema template syntax is then interpreted and flexibly translated to various formal metadata formats (e.g., JSON-Schema). 26 26 27 27 (% style="text-align: justify;" %) 28 - The**version** brancheshosttherespective openMINDS schemas of a major versionbyingestingthecorrespondingmetadata models as git-submodules.Note that these versionbranches canhave official release tags.Ifa version branchhas an official releasetag, only backwards compatiblechangescan be mergedon this branch. This can include correctionsf typos in instructions,introduction ofadditionalproperties to schemas,looseningconstraints onexpected valuenumbers orformats, granting additional relations between schemas, andaddingnew schemas (ifthey do not require relational changesin existingschemas).Except for typo corrections, these changes aretypically tagged as sub-releases forthe respective major version(e.g., v1.1). If a version branch does nothave an official release tag,yet, also non-backwards compatiblechanges can be merged on this branch. This can include renamingof existingproperties, increasing constraintsonexpectedvaluenumbers or formats, removingrelationsbetween schemasandadding new schemas, ifthey cause relational changesin existing schemas.In case allversion branches have officialrelease tags, a new non-backwards compatible changewould leadtothe creation of anew version branch(with a respectively increasedmajor version number).27 +Despite the simplification in comparison to JSON-Schema, the openMINDS schema templates are also, at the core, specially formatted JSON files using a particular syntax, meaning special key-value pairs that define the validation rules of a schema. 29 29 30 30 (% style="text-align: justify;" %) 31 - As mentioned already above, thesetupofthe centralopenMINDS GitHub repository is maintainedbythe openMINDS integration pipeline (cf. below).Note thatthe pipelineis configured insuch a way,thateach commiton one of the openMINDS submoduleswilltriggera new build ofthecentralopenMINDSrepositoryensuringthatits content is always up-to-date.30 +Please find in the following a full documentation of the openMINDS schema template syntax and how it's key-value pairs need to be defined and interpreted. 32 32 33 33 (% style="text-align: justify;" %) 34 - Forthis to work smoothly for theexisting,butalsofor allnew openMINDS metadatamodels,thecorresponding openMINDS submodules (GitHub repositories) haveto meet the following requirements:33 +===== Target & concept templates ===== 35 35 36 36 (% style="text-align: justify;" %) 37 - **(1)**The openMINDSmetadatamodelhastobe located ona**publicGitHubrepository**andpublished underan**MIT license**.36 +Same as in JSON-Schema, all openMINDS schema templates define the expected name (written in **##lowerCamelCase##**) and value of the metadata, typically called property, under the key **##properties##** as nested dictionaries. Furthermore, the names of obligatory metadata can be listed under the key **##required##**. Here a generalized example: 38 38 39 -(% style="text-align: justify;" %) 40 -**(2)** The GitHub repository of such a metadata model should have at least one **version branch** (naming convention: vX, where X is a major version number). 38 +{{code language="json"}} 39 +{ 40 + "properties": { 41 + "propertyNameA": {}, 42 + "propertyNameB": {}, 43 + "propertyNameC": {} 44 + }, 45 + "required": [ 46 + "propertyNameA", 47 + "propertyNameC" 48 + ] 49 +} 50 +{{/code}} 41 41 42 42 (% style="text-align: justify;" %) 43 -** (3)**Theversionbranch shouldhave the following**folders**&**files**:53 +In addition, an openMINDS schema //has to have// a key **##"_type"##** to be recognized as **target template**. In other words, the **##"_type"##** is used to define the openMINDS namespace of a corresponding schema using a particular naming convention. Here again a generalized example: 44 44 45 -* **##schemas/##** (required) - contains the schemas of the respective metadata model implemented in the reduced openMINDS syntax (cf. below). The sub-directory of the "schemas" folder can be further structured or flat. 46 -* **##tests/##** (recommended) - contains test-instances (JSON-LDs) for checking the constraints defined in the schemas of the respective metadata model. The sub-directory of the "tests" folder should follow the same structure as the "schemas" folder with an additional sub-directory for each schema. The file names of the test-instances should be written in lowerCamelCase and state first the name of the respectively tested schema and second, separated with an underscore, what schema constrain is tested (e.g., **##contactInformation_validEmail.jsonld##**). If a test-instance is expected to fail the schema validation the file name should receive the postfix **##"_nok"##** (e.g., **##contactInformation_invalidEmail_nok.jsonld##**). 47 -* **##examples/##** (recommended) - contains examples for valid instance collections for the respective metadata model. Each example should receive its own directory (folder) with a **##README.md##** describing the example, and an **##metadataCollection##** subfolder containing the openMINDS instances (JSON-LDs). This subfolder can be further structured or flat. 48 -* **##img/##** (optional) - contains typically the logo of the openMINDS submodule. 49 -* **##instances/##** (optional) - contains the controlled metadata instances (JSON-LDs) for selected schemas of the respective metadata model. The sub-directory of the "instances" folder should follow the same structure as the "schemas" folder with an additional sub-directory for each schema. The file names of the controlled instances should be written in lowerCamelCase and state the simple, human-readable identifier of the instance (e.g., **##homoSapiens.jsonld##** for the controlledTerms schema **##Species##**). 50 -* **##version.txt##** (required) - states the identifier of the version branch (e.g., **##v1##**) of the respective metadata model. 51 -* **##README.md##** (required) - contains a short content description of the respective metadata model. 52 -* **##LICENSE.txt##** (required) - states the license for the respective metadata model (has to be MIT). 53 - 54 -=== The openMINDS vocabulary === 55 - 56 -(% style="text-align: justify;" %) 57 -Located under the folder **##vocab##** in the main openMINDS GitHub directory, the openMINDS vocabulary is semi-automatically gathered and stored in dedicated JSON files ([[**##types.json##**>>https://raw.githubusercontent.com/HumanBrainProject/openMINDS/v2/vocab/types.json||rel="noopener noreferrer" target="_blank"]] and [[**##properties.json##**>>https://raw.githubusercontent.com/HumanBrainProject/openMINDS/v2/vocab/properties.json||rel="noopener noreferrer" target="_blank"]]). The openMINDS integration pipeline makes sure that both files are updated with each commit to any of the GitHub repositories for the openMINDS metadata models. With that, the openMINDS vocab reflects always an up-to-date status of the general attributes of existing **schemas** and **properties** across all openMINDS metadata models, while providing the opportunity to centrally review and maintain their consistency. In addition, this design allows us to centrally define and maintain multiple references to related schemas and matching schema properties of other metadata initiatives. How this works in detail is explained in the following. 58 - 59 -(% style="text-align: justify;" %) 60 -The **##types.json##** file is an associative array listing all existing openMINDS schemas (via their type). For each openMINDS schema, a small list of general attributes are provided in a nested associative array. Currently, the following attributes are captured: 61 - 62 62 {{code language="json"}} 63 63 { 64 - "OPENMINDS_SCHEMA_TYPE": { 65 - "description": "GENERAL_DESCRIPTION", 66 - "name": "DISPLAY_LABEL", 67 - "translatableTo": [ 68 - "REFERENCE_TO_RELATED_SCHEMA_OF_OTHER_INITIATIVE" 69 - ] 70 - } 57 + "_type": "https:~/~/openminds.ebrains.eu/<<schema-model>>/<<schema-name>>", 58 + "properties": {} 71 71 } 72 72 {{/code}} 73 73 74 74 (% style="text-align: justify;" %) 75 - Witheachnew schemacommittedto one of theopenMINDSmetadatamodels,anew entry isappendedtothe**##types.json##** file,with thedisplay labelautomatically derivedfromtherespectiveschema typendhe remainingattributes predefined with a nullvalue. Once an entry fora schemais made inthe**##types.json##** file,thevaluesof all attributes (**##"name"##**, **##"description"##**, and **##"translatableTo"##**)canbe manually edited. All manual editionswillbepreserved and not overwritten when the file is updatedagainwitha new commit. In case a schemais deleted from the openMINDS metadata models, thecorrespondingentry in the**##types.json##**fileis markedasbeing deprecated (additionalattribute-valuepair;**##"deprecated": true##**). It only can bepermanently removed from the **##types.json##**file, if the entry is manually deleted.63 +Note that **##<<schema-model>>##** has to be replaced with the label of the openMINDS metadata model to which the corresponding schema belongs to, and **##<<schema-name>>##** has to be replaced with the corresponding name of the schema (written in **##CamelCase##**). 76 76 77 77 (% style="text-align: justify;" %) 78 - Similartothe**##types.json##**file,the **##properties.json##**file is anassociativearraylisting all propertiesacrossallexistingopenMINDS schemas(viatheroperty name).For eachopenMINDS property,amalllistfgeneral attributesareprovidedinanestedassociative array. Currently,thefollowingattributes are captured:66 +If an openMINDS schema template //does not// define a key **##"_type"##** (as in the first example above), it is interpreted as a **concept template** which //has to be// extended to a target template. 79 79 68 +(% style="text-align: justify;" %) 69 +Concept templates are and should be used when multiple target templates have the same subset of properties, because they facilitate the long-term maintenance of those shared properties: Instead of defining the same properties repeatedly within multiple target templates, the common subset can be defined within a single concept template and passed on to all extending target templates. 70 + 71 +(% style="text-align: justify;" %) 72 +To define that a target template is the extension of a concept template, the target template can state under **##"_extends"##** the relative path to the concept template. For example, the openMINDS core target template **##Dataset##** extends the core concept template **##researchProduct##** as indicated here: 73 + 80 80 {{code language="json"}} 81 81 { 82 - "PROPERTY_NAME": { 83 - "description": "GENERAL_DESCRIPTION", 84 - "name": "DISPLAY_LABEL", 85 - "nameForReverseLink": "DISPLAY_LABEL_OF_REVERSED_LINK", 86 - "sameAs": [ 87 - "REFERENCE_TO_MATCHING_SCHEMA-PROPERTY_OF_OTHER_INITIATIVE" 88 - ], 89 - "schemas": [ 90 - "RELATIVE_PATH_TO_OPENMINDS-SCHEMA_USING_THIS_PROPERTY" 91 - ] 92 - } 76 + "_type": "https:~/~/openminds.ebrains.eu/core/Dataset", 77 + "_extends": "products/researchProduct.schema.tpl.json" 93 93 } 94 94 {{/code}} 95 95 96 96 (% style="text-align: justify;" %) 97 - With each new property committedto a schemaof one oftheopenMINDS metadata models, a new entryisappended tothe **##properties.json##** file, with the display label and listof schemasinwhich this property occurs automatically derived. The remaining attributesare initially provided witha null value.Oncean entry for apropertyis madein the **##properties.json##** file, the values of all attributes(**##"name"##**, **##"description"##**, **##"nameForReversedLink"##**,and **##"sameAs"##**) can be manually edited,exceptfor **##"schemas"##** which will bealways automatically updated. All thosemanual editionswillbepreserved and not overwritten when the file is updated again with a new commit. Incase a property is not usedanymoreinany ofthe schemas fromthe openMINDS metadata models,the corresponding entryin the **##properties.json##**file ismarked as being deprecated (additionalattribute-valuepair; **##"deprecated":true##**).Itonly canbepermanentlyremovedfromthe **##properties.json##**file, iftheentry ismanually deleted.82 +Note that this convention requires the concept and corresponding target templates to be located in the same openMINDS metadata model repository. Note also that for properties, the following rules apply for target and concept template: 98 98 99 -=== The openMINDS syntax === 84 +1. A concept template has to define some properties which will be inherited by all extending target templates. 85 +1. If a concept template additionally states that some of these properties are required, all extending target templates will require the same properties. 86 +1. A target template can require properties of the concept template, that are not explicitly required within the concept template. In such a case, the other target templates extending the same concept template will not require those properties. 87 +1. A target template can (but does not have to) define and require additional properties that were not defined and required in the concept template. These additionally defined and required properties will not be shared with the other target templates extending the same concept template. 100 100 101 101 (% style="text-align: justify;" %) 102 - AllopenMINDSmetadata modelsaredefinedusing a light-weightedschemasyntax. Althoughthisschemasyntaxisinspiredby JSON-Schema,itoutsourcesmostschematechnicalities to behandled bytheopenMINDS integrationpipeline,makingthe openMINDS schemas more human-readable,especially for untrained eyes.90 +How to define the expected value of a property will be explained for the different property types in the following sections. 103 103 104 104 (% style="text-align: justify;" %) 105 - Thefewremaining customized technicalpropertieswhich need additional interpretation or translation to a formal schema languages (e.g. JSON-Schema) have an underscore as prefix (e.g., **##"_type"##**). Withinthe openMINDS integrationpipeline (cf. below), the schematemplatesyntax is interpreted, extended and flexibly translatedtovarious formalschema languages.All further specifications of the openMINDS schema template syntax are described below.93 +===== Properties expecting a string value ===== 106 106 107 107 (% style="text-align: justify;" %) 108 - ====BasicopenMINDSschemastructure====96 +Properties that expect values of type string can be defined in large parts in the same way as in JSON-Schema. Without any further formatting requirements the definition of such a property looks like this: 109 109 110 -(% style="text-align: justify;" %) 111 -All openMINDS schemas need to have the extension **##.schema.tpl.json##** and each schema is defined as a nested associative array (dictionary) with the following conceptual structure: 112 - 113 113 {{code language="json"}} 114 114 { 115 - "_type": "https://openminds.ebrains.eu/LABEL_OF_METADATA_MODEL/SCHEMA_NAME", 116 116 "properties": { 117 - "PROPERTY_NAME": { 118 - "type": "DATA_TYPE", 119 - "_instruction": "METADATA_ENTRY_INSTRUCTION" 120 - }, 121 - "required": [ 122 - "PROPERTY_NAME" 123 - ] 101 + "freeStringProperty": { 102 + "type": "string" 103 + } 104 + } 124 124 } 125 125 {{/code}} 126 126 127 127 (% style="text-align: justify;" %) 128 - **##"_type"##**definestheschema type(or namespace)with the depictednamingconvention,wherethelabelof therespectiveopenMINDSmetadatamodel(e.g., **##"core"##**)andtheschemaname (format:UpperCamelCase; e.g. **##"ContactInformation"##**)haveto bespecified.Obviously,the schemaameshould bemeaningfulandprovidesome insidesintowhatmetadatacontenttheschemacovers.109 +In accordance with JSON-Schema, it is possible, though, to further define a selected list of built-in formats that are accepted for the expected string. Supported are all [[built-in formats of JSON-Schema Draft 7.0>>https://json-schema.org/understanding-json-schema/reference/string.html#built-in-formats||rel="noopener noreferrer" target="_blank"]]. Assuming that my string property only accepts a value of format "email", the openMINDS syntax is the following: 129 129 130 -(% style="text-align: justify;" %) 131 -Under **##"properties"##** a nested associative array is defined, where each key defines the property name (format: lowerCamelCase; e.g. **##"givenName"##**). The corresponding value is again a nested associative array defining the expected data **##"type"##** (cf. below) and the **##"_instructions"##** for entering the correct metadata for the respective property. 111 +{{code language="json"}} 112 +{ 113 + "properties": { 114 + "emailProperty": { 115 + "type": "string", 116 + "_formats": [ 117 + "email" 118 + ] 119 + } 120 + } 121 +} 122 +{{/code}} 132 132 133 133 (% style="text-align: justify;" %) 134 - Under**##"required"##**alistofpropertynamescanbe providedthat areobligatoryto beentin acorrectlyinstantiatedmetadatainstance oftherespectiveschema.Ifnoneoftheproperties arerequired, this key-valuepairdoesnothavetobespecified.125 +Also like JSON-Schema, it is possible to define a customized regular expression pattern for the expected string, supporting the [[ECMA-262 regex-dialect>>https://www.ecma-international.org/publications-and-standards/standards/ecma-262/]]. Assuming that my string property only accepts a Gregorian calendar year, the corresponding openMINDS syntax look like this: 135 135 136 -(% style="text-align: justify;" %) 137 -==== Schemas extending a context-schema ==== 127 +{{code language="json"}} 128 +{ 129 + "properties": { 130 + "yearProperty": { 131 + "type": "string", 132 + "pattern": "([0-9]{4})" 133 + } 134 + } 135 +} 136 +{{/code}} 138 138 139 139 (% style="text-align: justify;" %) 140 - Inthe case that several schemas are highly related and contain a common set ofproperties,it ispossibleo definea non-type context-schema with these common properties thatcanbe extendedand modified by the group of relatedschemas.139 +===== Properties expecting a numerical value ===== 141 141 142 142 (% style="text-align: justify;" %) 143 - All properties and constraints(e.g. required properties, expected data types) defined in the context-schema are passed on to the schemas extendingthiscontext-schema. Eachof these schemas candefine additional properties, or (if necessary) can overwrite the constraints of the context-schema (incl.**##"_instructions"##**). In order to state that a schema is extending a context-schema, the following additional key-value pair has to be added to the schema template above:142 +(//**coming soon**//) 144 144 145 -(% style="text-align: center;" %) 146 -**##"_extends": "RELATIVE_PATH_TO_OPENMINDS-CONTEXT-SCHEMA"##** 147 - 148 148 (% style="text-align: justify;" %) 149 - Thisdesign not only makes it easier to identify highly related schemas, but also facilitates the maintenance of the commonly used properties. A good hands-on example, is the context-schema [[ResearchProduct>>https://raw.githubusercontent.com/HumanBrainProject/openMINDS_core/v3/schemas/products/researchProduct.schema.tpl.json||rel="noopenernoreferrer" target="_blank"]] which is extended by the following schema set: [[Dataset>>https://raw.githubusercontent.com/HumanBrainProject/openMINDS_core/v3/schemas/products/dataset.schema.tpl.json||rel="noopener noreferrer" target="_blank"]],[[MetaDataModel>>https://raw.githubusercontent.com/HumanBrainProject/openMINDS_core/v3/schemas/products/metaDataModel.schema.tpl.json]],[[Model>>https://raw.githubusercontent.com/HumanBrainProject/openMINDS_core/v3/schemas/products/model.schema.tpl.json||rel="noopenernoreferrer" target="_blank"]], and [[Software>>https://raw.githubusercontent.com/HumanBrainProject/openMINDS_core/v3/schemas/products/software.schema.tpl.json||rel="noopener noreferrer" target="_blank"]].145 +===== Properties expecting another object ===== 150 150 151 151 (% style="text-align: justify;" %) 152 - ==== Data type dependingconstraints ====148 +(//**coming soon**//) 153 153 154 154 (% style="text-align: justify;" %) 155 - Dependingonthe expected data **##"type"##** additionalconstraintscan be made forthe metadata entry ofa respective property. Currently, the openMINDS schematemplate syntax supports thefollowing data types:**##"string"##**, ##**"integer"**##, **##"float"##**, **##"boolean"##**, **##"array"##** and **##"object"##**.151 +===== Properties expecting an array of values ===== 156 156 157 157 (% style="text-align: justify;" %) 158 - If the expected data**##"type"##** is a **##"string"##** the expected numberof characters, the format or a regularexpression patternof the string can be further defined.154 +(//**coming soon**//) 159 159 156 + 160 160 === The openMINDS integration pipeline === 161 161 162 -(//**coming soon**//) If you'd like to learn more about the openMINDS integration pipeline, especially if you'd like to contribute to it, please get in touch with us (the openMINDS development team) via the issues on the openMINDS or openMINDS_generator GitHub or the support email: openminds@ebrains.eu 163 - 164 -{{putFootnotes/}} 159 +(//**coming soon**//)