PII Pseudonymization Knowledge Model
Overview
The PII Pseudonymization Knowledge Model (display name: PII Pseudonymization EN v#) aims at pseudonymizing Personal Identifiable Information (PII) contained in a text.
Automatic pseudonymization of PII is one of the ways to protect a person's identity when dealing with private or sensitive documents, another being anonymization, which can be attained with the PII Anonymization Knowledge Model.
The PII Pseudonymization Knowledge Model pseudonymises the same PII detected by the PII Knowledge Model.
Every PII is associated to a pseudonym of the same semantic nature. For example, a given name is replaced by a pseudo given name, an address is replaced by a pseudo address and so on.
In the scope of document, a specific PII is consistently replaced with the same pseudonym, so, for example, Paul is always replaced by John.
Output structure
The model output has the same structure as any other model and is affected by the functional options of the workflow block.
The peculiar part of the output if the modifiedText
property of the extraData
object, containing the pseudonymised text, so it's necessary to turn on the Output rules extra data functional option of the workflow block.
Example
Considering the following text:
CREDIT CARD REQUEST
NAME: John Smith
ADDRESS: 339 Rardin Dr., Broad Run, VA, 20137
BIRTHPLACE: 02/06/1992, Charlottetown, Canada
PHONE: 985-281-4501
EMAIL: [email protected]
Dear sir or madam,
I lost my credit card. Please find below the credit card's details:
Card Type: American Express 1234 4567 8901 1234 , Expiration date 09/2024, CVV 123.
This email is to request that you issue me a replacement card as soon as possible. The canceled card should be not authorized under any circumstances.
Thank you for your attention.
Sincerely
the pseudonymised text is the value of the modifiedText
property of the extraData
output object:
"extraData": {
"modifiedText": "CREDIT CARD REQUEST \nNAME: Yusuf Sanders \nADDRESS: 801 Alena Fields, Kirkland, KS, 04099 \nBIRTHPLACE: 1990-02-25, Khartoum \nPHONE: +1-200-555-0101 \nEMAIL: [email protected] \nDear sir or madam,\nI lost my credit card. Please find below the credit card's details:\nCard Type: American Express 4716941181146190, Expiration date 08/2023, CVV 999.\nThis email is to request that you issue me a replacement card as soon as possible. The canceled card should be not authorized under any circumstances.\nThank you for your attention.\nSincerely \n"
}