The Data Anonymization and Data Pseudonymization Library has been implemented in Java programming language and can be utilized by any Android application. The aim of the library is to either anonymize or pseudonymize the personal data – and more specifically the health data – of the data subjects whenever they want to participate in a research study.
In order to integrate the Data Anonymization and Data Pseudonymization Library there are some steps which should be followed.
- In case of a Gradle project, the following line should be appended in the dependencies section of the build.gradle file:
implementation(group:'eu.interopehrate', name:'rdsanoni', version: '0.1.1')
- In case of a Maven project, the same dependency should be expressed with the following Maven syntax:
<dependency>
<groupId>eu.interopehrate</groupId>
<artifactId>rdsanoni</artifactId>
<version>0.1.1</version>
</dependency>
The Data Anonymization and Data Pseudonymization Library is deployed at the citizens mobile phone – by the S-EHR application – and has five methods.
It stores the below variables locally on the citizen’s phone.
Parameters:
- the pseudo, which is either a pseudo-identity or a pseudonym,
- the pseudoType, which indicates whether the pseudo is a pseudo-identity or a pseudonym, and
- the studyID, which is the ID of the current research study.
Response: void
It retrieves the previously stored pseudo (pseudo-identity or pseudonym).
Parameters:
- the studyID, which is the ID of the current research study.
Response: The pseudo-identity or the pseudonym.
It retrieves a pseudonym from the Pseudonym Provider.
Parameters:
- the anAssertion, which is the anonymous assertion token (*), and
- the publicKey, which is the public key of the user's certificate.
(*) This is the transient anonymous identity of the user retrieved by eIDAS at an earlier stage.
Response: A pseudonym.
It pseudonymizes the citizen’s dataset.
Parameters:
- the data, which will get pseudonymized,
- the fileType, which is the type of the data file, and
- the studyID, which is the ID of the current research study (*).
(*) The studyID variable is utilized in order for the library to call getPseudo and retrieve the pseudo-identity/pseudonym from the citizen’s mobile phone.
Response: The pseudonymized dataset.
It anonymizes the citizen’s dataset.
Parameters:
- the data, which will get anonymized, and
- the fileType, which is the type of the data file.
Response: The anonymized dataset.