GDPR Compliance in the WSO2 Micro Integrator¶
WSO2 Micro Integrator can persist a user's personally identifiable information (PII) in various sources, namely log files and RDBMSs. However, organizations that use the Micro Integrator have a legal obligation to remove all instances of a user's PII from the system if the relevant user requests. For example, consider a situation where an employee resigns from the organization and, thereby, requests the organization to remove all instances of one's PII from the organization's system. You can fulfill this requirement by anonymizing the user's PII in the system, or (in some cases) by completely removing such PII from the system.
See the topics given below for instructions on how to remove PII from the WSO2 Micro Integrator.
What is GDPR?¶
The General Data Protection Regulation (GDPR) is a new legal framework that was formalized by the European Union (EU) in 2016. It comes into effect from 28, May 2018. GDPR requires any organization that processes Personally Identifiable Information (PII) of individuals who live in Europe, to be compliant with the regulations. Organizations that fail to demonstrate GDPR compliance are subjected to financial penalties.
Info
Do you want to learn more about GDPR?
If you are new to GDPR, we recommend that you take a look at our tutorial series on Creating a Winning GDPR Strategy.
-
Part 1 - Introduction to GDPR
-
Part 2 - 7 Steps for GDPR Compliance
-
Part 4 - GDPR Compliant Consent Design
For more resources on GDPR, see the white papers, case studies, solution briefs, webinars, and talks published on our WSO2 GDPR homepage . You can also find the original GDPR legal text here.
How the WSO2 Micro Integrator persists PII¶
The Micro Integrator can persist PII in various log files (carbon logs, audit logs, API logs, and service-specific logs) depending on the mediation logic defined. The Micro Integrator does not persist a user's PII in any RDBMS by default.
Tools for removing PII in WSO2 Micro Integrator¶
-
The Forget-Me Tool , which can anonymize a user's PII in log files and RDBMSs by replacing all occurrences of the deleted user with either a randomly generated UUID value or a specified pseudonym. You can download the tool from this link. . Find out about all the capabilities of the Forget-Me tool from here.
Important
In the case of log files, note that the Forget-Me Tool does not replace PII values in the actual log files. Instead, the tool will create a new set of log files with anonymized PII values. The organization can then remove the original log files.
Note
If you want to use the Forget-Me tool to remove PII in multiple WSO2 products at the same time, you can use the standalone version of the tool. Find more information on how to build and run the Forget-Me tool in standalone mode.
Prerequisites for removing PII¶
As explained in How the WSO2 Micro Integrator persists a user's PII , the Micro Integrator will store user information in log files. Note that we can only remove a deleted user's PII from archived log files, and not the live log files that are connected to the system.
Therefore, before you start removing PII stored by the Micro Integrator, be sure that the relevant user has been inactive in the system for a sufficient amount of time. This will ensure that all of the user's PII contained in log files are successfully archived. You can then follow the instructions given below to remove the user's PII references from the archived log files.
Removing PII from the Micro Integrator¶
Before you begin:
- Find out about how the Micro Integrator stores a user's PII.
- See the prerequisites for removing PII from the MI.
- In the instructions given below, we will use a proxy service that logs PII to demonstrate how PII can be removed from the Micro Integrator. However, please note that it is not recommended to log PII from a proxy service.
Anonymizing PII references¶
You can use the Forget-Me Tool to remove references to personally identifiable information (PII) from logs in the Micro Integrator. For example, consider a proxy service that logs the username that is sent through a payload. A Log mediator can be used for this as shown below.
<log level="custom">
<property expression="//Authentication/username" name="USER_NAME"/>
</log>
The user name that is used when you invoke this query will be logged in the following log files: wso2carbon.log file, audit.log file, warn.log, and the service-specific log file that is enabled for the proxy service.
INFO - LogMediator USER_NAME = Sam
Let's look at how to anonymize the username value in log files.
-
Download the Forget-Me tool and extract the contents. The location of the extracted folder will be referred to as
TOOL_HOME
from this point onwards. -
Every log statement follows the same pattern where the
USER\_NAME
keyword is followed by an actual username (in this example it isSam
). The regex pattern of this log statement will be as shown below. The Forget-Me Tool will use the below regex pattern to anonymize the username. This pattern should be added to theei-patterns.xml
file (stored in theTOOL_HOME/conf/log-config/
directory).<pattern key="pattern3"> <detectPattern>(.)*(USER_NAME)(.)*${username}(.)*</detectPattern> <replacePattern>${username}</replacePattern> </pattern>
-
Update the
config.json
file (stored in theTOOL_HOME/conf/
directory) as shown below. This file contains references to all the log files (except any service-specific log file in the system that store the above user information. If you have enabled a service-specific log file, you need to add that file name (see the element descriptions given below).{ "processors" : [ "log-file" ], "directories": [ { "dir": "log-config", "type": "log-file", "processor" : "log-file", "log-file-path" : "<MI_HOME>/repository/logs", "log-file-name-regex" : "(audit.log|warn.log|wso2carbon.log)(.)*" } ] }
The elements in the above configuration are explained below.
- "processors" : The processors listed for this element specifies whether the tool will on log files, RDBMSs, or analytics streams. In the case of the Micro Integrator, we only need to remove PII from log files, and therefore, the processor is set to "log-file".
- "directories" : This element lists the directories that correspond to the processors. In the case of the Micro Integrator, we need to specify the directories that store log files.
-
"log-file-path" : This specifies the directory path to the log files. Note that all the relevant log files are stored in the
MI_HOME/repository/logs/
directory.Note
Be sure to replace the
log-file-path
value with the correct absolute path to the location where the log files are stored. If you are on Windows , be sure to use the forward slash (/
) instead of the back slash (\
). For example:C:/Users/Administrator/Desktop/wso2mi-1.1.0/repository/logs
. -
"log-file-name-regex" : This gives the list of log files (stored in the log-file-path) that will persist the user's PII. Note that the above log-file-name-regex includes the audit.log, warn.log, and wso2carbon.log files, as well as the archived files of the same logs. If you have enabled a service-specific log file, be sure to add the file name to this list.
-
Open a command prompt and navigate to the
TOOL_HOME/bin
directory. -
Execute the following command to anonymize the user information that was added to the ei-patterns.xml file.
-
On Linux:
./forgetme.sh -U Sam
-
On Windows:
forgetme.bat -U
This will result in the following:
-
Copies will be created of all the log files specified in the config.json file. The following is the format of the log copy :
anon-<time_stamp>-<original_log_name>.log
. For example ,anon-1520946791793-warn.log
. -
The PII will be anonymized in the copies. The log files will display the user information as a pseudonym.
INFO - LogMediator USER_NAME = 86c3bfd9-f97c-4b08-9f15-772dcb0c1c
Info
For the list of commands you can run using the Forget-Me tool, see this link.
-
Deleting original (archived) log files¶
Note that the PII is not removed from the original log files. It is the responsibility of the organization to remove the original log files that contain the user's PII.