the k8s cluster at JSC and all prod k8s-based ( including Jupyter Lab at JSC ) services will be down tonight, the 31st of March at 20:30 CEST for 30 minutes for a critical update. For a list of affected services see https://gitlab.ebrains.eu/ri/tech-hub/apps/apps-base/-/issues/18


Last modified by melissargos on 2024/10/11 18:22

Show last authors
1 (% class="jumbotron" %)
2 (((
3 (% class="container" %)
4 (((
5 = Onboarding to the Medical Informatics Platform MIP =
6
7 Step-by-step guidance
8 )))
9 )))
10
11 (% class="row" %)
12 (((
13 (% class="col-xs-12 col-sm-8" %)
14 (((
15 = What can I find here? =
16
17 * Creation of a MIP User Account
18 * MIP Data Governance
19 * MIP Data Flow
20 * MIP GDPR compliance assessment
21
22 = =
23
24
25
26
27
28 (((
29
30
31 ----
32
33
34
35 [[image:https://mip.ebrains.eu/img/section1.2b1f04df.png||alt="MIP user interface"]]
36
37 //**Figure 1:** User Interface of the Medical Informatics Platform MIP//
38
39 == ​Creation of a MIP User Account ==
40
41 **Prerequisite – Step 1**: Access to the MIP requires an EBRAINS user account, which needs to be permitted and authenticated. EBRAINS user accounts are available to users with a legitimate interest (mainly research and development) from Europe and beyond.
42
43 Request an EBRAINS user account: [[https:~~/~~/www.ebrains.eu/page/sign-up>>url:https://www.ebrains.eu/page/sign-up]]
44 The EBRAINS user account allows users to directly access the** Public MIP ([[https:~~/~~/mip.ebrains.eu/>>url:https://mip.ebrains.eu/]]**) with no further accreditation being required.
45
46 **Access to a specific MIP Federation – Step 2:** EBRAINS authorised Users with an active EBRAINS account can request access to a specific MIP Federation by contacting [[support@ebrains.eu>>path:mailto:support@ebrains.eu]], who will forward the specific request to the MIP Management team. Users can also get in direct contact with the MIP team via the online form on the EBRAINS website: [[https:~~/~~/www.ebrains.eu/tools/medical-informatics-platform>>url:https://www.ebrains.eu/tools/medical-informatics-platform]]
47
48 The Data Science Steering Committee (DSSC) of the specific federation will be involved in the accreditation process to receive access approvals. The creation of a new MIP Federation projects can be initiated at any time.
49
50 Users are required to accept the EBRAINS Terms and Policies [[https:~~/~~/www.ebrains.eu/page/terms-and-policies>>url:https://www.ebrains.eu/page/terms-and-policies]], to indicate acceptance and compliance with all applicable laws, regulations, rules, and approvals in the use and sharing of the data, including, but not limited to, the General Data Protection Regulation (GDPR).
51
52 Upon login to the MIP, users are mandated to accept the Terms of Use of the MIP. Accredited users access the MIP through a web-based interface, which will provide them with direct access to the respective federation on the MIP.
53
54 == MIP Data Governance ==
55
56 [[image:1728560441625-436.png]]
57
58 **Figure 2:** MIP Data Governance Flow
59
60 //T(% class="small" %)his illustration depicts how data governance and data flow in the MIP are organised and how the legal framework and data management are interlinked. Decision points are indicated.//
61
62 (% style="color:#c0392b" %)//~*~*The **MIP Data Protection Impact Assessment (DPIA) **is currently under full revision and will become functional upon final approval by the CHUV DPO. Per Article 35(3)(b) of GDPR a Data Protection Impact Assessment is required for processing of sensitive data.//
63
64 === //MIP and data anonymisation// ===
65
66 **Note**: (% style="color:#27ae60" %)**The MIP is handling anonymised data.**(%%) The definition for anonymisation (//ISO standard (ISO 29100:2011)//) of personal data is the process of encrypting or removing personally identifiable data from data so that a person can no longer be directly or indirectly identified (see also **Recital 26 of the GDPR)**. As soon a person cannot be re-identified the data is no longer considered personal data and the GDPR does not apply for further use.
67
68 However, processing personal data **for the purpose of anonymisation** is still processing that must have a **legal basis under Article 6 of GDPR**. The anonymisation process is defined as “**further processing**” and this processing must be compliant with the principle of purpose limitation. The process of data anonymisation can be used to improve data protection compliance, e.g., as part of the “**privacy by design**” strategy, with the goal to improve the protection of the processed data; or as part of the “**data minimisation**” strategy, where data can be anonymised and used without the risk of harming the data subjects.
69
70 (% style="color:#27ae60" %)**Both strategies are followed by the MIP.**
71
72 === MIP concepts and definitions ===
73
74 * **Common Data Elements (CDEs)**
75
76 A set of standard variables defined by clinical experts and data scientists, which would be used by researchers to perform analysis on specific medical conditions at the federation level. In the MIP context, we use the term CDEs to refer to the standardised federated datamodels only.
77
78 * **Data Element**
79
80 In metadata, the term data element is an atomic unit of data that has precise meaning or precise semantics.
81
82 * **Datamodel (Metadata)**
83
84 A Datamodel (Metadata) describes the structure of database variables found in specific extracts of a hospital database, including descriptive metadata, structural metadata, administrative metadata, reference metadata and statistical metadata.
85
86 * **Database Variables**
87
88 A variable or scalar is a storage address (identified by an index or address) paired with an associated symbolic name, which contains some known or unknown quantity of information referred to as a value.
89
90 * **Electronic Health Records (EHR)**
91
92 Health information and clinical records registered per each patient per visit in the hospital's database (Oracle, SQL, or any other database system) and usually transferred in db or CSV format. EHRs usually contain different levels of data; we might define them in this context as spaces, domain, and sub-domain. For example, a space might include demographics, social status, or patient's medical history as different data domains. On the other hand, EHR contain other data spaces related to the specific medical condition such as Dementia or Epilepsy where each space includes specific domain and sub-domain, such as medical assessments and tests, diagnoses, treatment, and operations, etc.
93
94 * **Medical Conditions**
95
96 Diseases are often known to be medical conditions that are associated with specific symptoms and signs.
97
98 == MIP Data Flow ==
99
100 ​[[image:1728560774193-188.png]]
101
102
103 |(((
104 **Figure 3**// MIP Data Flow//
105
106 //T(% class="small" %)his diagram illustrates the MIP Data Flow, indicating processing steps prior to data upload and steps after data upload to the MIP.  EHR – electronic health record, MRI - magnetic resonance imaging, ETL - data integration (extract, transform, load), CDE – common data elements, ML – machine learning, GUI – graphical user interface, VM – virtual machine. Data pre-processing: extract data from EHR records and produce pseudonymised data in .csv format; optional Step1: extract brain volumes from MRI images and merge with data extracted from EHR records; Data Quality and Harmonisation: Prepare CDE: if CDE exists – Steps 2B, 4 and 5 are followed; if CDE needs to be prepared, first Steps 2A and 3A need to be performed, followed by Steps 2B, 4 and 5. Data Analysis and ML: anonymised dataset is uploaded either to the federated node in the institution or the dedicated VM on EBRAINS CSCS. Data Analysis can be performed via the Federation Service Layer and User Interface: use of predefined federated algorithms, aggregated results will be retrieved via the GUI.//
107 )))
108
109 == MIP GDPR compliance assessment ==
110
111 Several aspects are crucial for demonstrating GDPR compliance. Hereunder is a compliance assessment based on the GDPR core principles:
112
113 (% style="color:#27ae60" %)**Lawfulness, Fairness, and Transparency (Article 5 GDPR)**
114
115 **Lawfulness and Fairness:** In alignment with GDPR requirements for lawful processing (Article 6(1)(a)), the MIP legal contracts with Data Providers require that data processing is based on informed consent obtained from data subjects. It requires users to accept the EBRAINS General Terms of Use, adhering to all applicable laws and regulations, including GDPR. Data Transfer Agreements (DTAs) and Data Sharing Agreements (DSAs) provide a legal framework and are mandated before any data transfer or data sharing, ensuring compliance with Article 28(3) regarding processor agreements (GDPR Articles 5(1)(a), 6, and 7). Strict authentication and authorisation procedures are in place, to only provide access to accredited users. Data anonymisation is required before integration in the MIP, which minimises the risk of reidentification, protecting data subjects from potential harm (GDPR Article 6(1)(a)). An additional built in privacy threshold restricts data analysis to receiving aggregate results of at least 10 participant records.
116
117 **Transparency:** The open-source nature of the MIP promotes transparency by providing accessible source code, fostering community involvement, and offering clear information about data governance, federated queries, and data usage without moving original data from its location. Detailed technical and user documentation is available at [[https:~~/~~/github.com/HBPMedical/mip-docs>>url:https://github.com/HBPMedical/mip-docs]], an interactive user guide is accessible directly on the platform.
118
119 (% style="color:#27ae60" %)**Purpose Limitation**
120
121 The MIP processes data for specified explicit, and legitimate purposes related to clinical research of each of the MIP Federations (dementia, traumatic brain injury, epilepsy, mental health, and stroke). Data is not moved or downloaded from the platform, maintaining the integrity of the purpose limitation principle (GDPR Article 5(1)(b)).
122
123 (% style="color:#27ae60" %)**Data Minimisation**
124
125 The MIP adheres to the principle of data minimisation by only processing data necessary for the research purposes stated. This includes the use of Common Data Elements (CDEs) to standardise and limit the scope of data collected or re-used. All data is anonymised, minimising the exposure of personal data (GDPR Article 5(1)(c)).
126
127 (% style="color:#27ae60" %)**Accuracy**
128
129 MIP includes tools like the MIP Data Catalogue and the MIP-DQC Tool to help data managers/curators to ensure data accuracy and quality before data is integrated. Data validation and cleaning are integral parts of the data preparation process (GDPR Article 5(1)(d)).
130
131 (% style="color:#27ae60" %)**Storage Limitation**
132
133 Data within the MIP is kept only as long as necessary for the scientific research purposes. The platform’s architecture, which involves retaining data control at the level of the data provider, mitigates the risks associated with long-term storage, supporting compliance with GDPR’s storage limitation principles (GDPR Article 5(1)(e)). Data Providers can at any time decide that a federation is to be discontinued, either based on the time limits set in the legal contracts or at any time this seems to be appropriate.
134
135 (% style="color:#27ae60" %)**Integrity and Confidentiality**
136
137 MIP employs strong authentication, encryption, and a secure VPN for data protection. The federated analysis framework ensures that data remains confidential and is only accessed by accredited users (GDPR Articles 5(1)(f), 25, and 32).
138
139 (% style="color:#27ae60" %)**Accountability**
140
141 Data owners are responsible for ensuring ethical compliance and the integrity of research data. MIP’s governance framework enforces accountability among data controllers and processors by maintaining records of processing activities including legal agreements and ensuring that data controllers and processors adhere to GDPR requirements. (GDPR Article 5(2)).
142
143 (% style="color:#27ae60" %)**Data Protection by Design and by Default**
144
145 The terms of use of the platform ensures that data is anonymised and remains within the original hospital’s control, reflecting a privacy by design approach. Default privacy settings (e.g., aggregation of results) restrict data analysis, strong authentication and accreditation processes enhance the security of MIP's federations, providing a secure environment for data analysis without exposing individual data (GDPR Article 25).
146
147 (% style="color:#27ae60" %)**Data Subject Rights**
148
149 As MIP processes anonymised data, GDPR data subject rights (e.g., access, rectification, erasure) do not directly apply. However, ethical considerations and informed consent ensure that patients’ rights are respected (GDPR Articles 15, 16, 17, and 18, as applicable to the non-anonymised data collection phase). The system's design respects data ownership and control by data controllers, ensuring they can determine accessibility and availability of their data.
150
151 (% style="color:#27ae60" %)**Data Transfers (Articles 44-50)**
152
153 The MIP ensures that any data transfers comply with GDPR’s requirements for international data transfers. This is achieved using DTAs and DSAs, ensuring that data transferred across borders is protected under equivalent data protection standards. If data is transferred, secure file transfer solutions are used.
154
155 **Summary of legal steps to be followed, depending on the purpose of the processing or project:**
156
157 * Patient consent for usage of data for research purposes (specific, general, re-use, anonymisation)
158 * Ethical clearance for research projects and planned processing
159 * (% style="color:#c0392b" %)//DPIA * under preparation//
160 * Data Transfer agreement or Data Sharing Agreement
161 * Collaboration Agreement
162 * MIP User Charter
163 * MIP Installation Agreement
164 )))
165 )))
166
167
168 (% class="col-xs-12 col-sm-4" %)
169 (((
170 {{box title="**Contents**"}}
171 {{toc/}}
172 {{/box}}
173
174
175 )))
176 )))