Share via


Sensitive information types in Exchange Server

Data loss prevention (DLP) includes 80 sensitive information types that are ready for you to use in your DLP policies. This topic lists all of these sensitive information types and shows what a DLP policy looks for when it detects each type. A sensitive information type is defined by a pattern that can be identified by a regular expression or a function. In addition, corroborative evidence such as keywords and checksums can be used to identify a sensitive information type. Confidence level and proximity are also used in the evaluation process.

ABA Routing Number

Format: Nine digits that may be in a formatted or unformatted pattern.

Pattern:

Formatted:

  • Four digits beginning with 0, 1, 2, 3, 6, 7, or 8

  • A hyphen

  • Four digits

  • A hyphen

  • A digit

Unformatted: Nine consecutive digits beginning with 0, 1, 2, 3, 6, 7, or 8

Checksum: No

Definition:

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_aba_routing finds content that matches the pattern.

  • A keyword from Keyword_ABA_Routing is found.

<!-- ABA Routing Number -->
<Entity id="cb353f78-2b72-4c3c-8827-92ebe4f69fdf" patternsProximity="300" recommendedConfidence="75">
      <Pattern confidenceLevel="75">
        <IdMatch idRef="Func_aba_routing" />
        <Match idRef="Keyword_ABA_Routing" />
      </Pattern>
 </Entity>

Keywords:

Keyword_ABA_Routing
aba
aba #
aba routing #
aba routing number
aba#
abarouting#
aba number
abaroutingnumber
american bank association routing #
american bank association routing number
americanbankassociationrouting#
americanbankassociationroutingnumber
bank routing number
bank routing#
bank routing number
routing transit number
RTN

Argentina National Identity (DNI) Number

Format: Eight digits separated by periods

Pattern: Eight digits:

  • Two digits

  • A period

  • Three digits

  • A period

  • Three digits

Checksum: No

Definition:

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The regular expression Regex_argentina_national_id finds content that matches the pattern.

  • A keyword from Keyword_argentina_national_id is found.

<!-- Argentina National Identity (DNI) Number -->
<Entity id="eefbb00e-8282-433c-8620-8f1da3bffdb2" recommendedConfidence="75" patternsProximity="300">
   <Pattern confidenceLevel="75">
      <IdMatch idRef="Regex_argentina_national_id"/>
      <Match idRef="Keyword_argentina_national_id"/>
  </Pattern>
</Entity>

Keywords:

Keyword_argentina_national_id
Argentina National Identity number
Identity
Identification National Identity Card
DNI
NIC National Registry of Persons
Documento Nacional de Identidad
Registro Nacional de las Personas
Identidad
Identificación

Australia Bank Account Number

Format: 6-10 digits with or without a bank state branch number

Pattern: Account number is 6-10 digits. Australia bank state branch number:

  • Three digits

  • A hyphen

  • Three digits

Checksum: No

Definition:

A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The regular expression Regex_australia_bank_account_number finds content that matches the pattern..

  • A keyword from Keyword_australia_bank_account_number is found.

  • The regular expression Regex_australia_bank_account_number_bsb finds content that matches the pattern.

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The regular expression Regex_australia_bank_account_number finds content that matches the pattern..

  • A keyword from Keyword_australia_bank_account_number is found.

<!-- Australia Bank Account Number -->
<Entity id="74a54de9-2a30-4aa0-a8aa-3d9327fc07c7" patternsProximity="300" recommendedConfidence="75">
  <Pattern confidenceLevel="85">
        <IdMatch idRef="Regex_australia_bank_account_number" />
        <Match idRef="Keyword_australia_bank_account_number" />
        <Match idRef="Regex_australia_bank_account_number_bsb" />
  </Pattern>
  <Pattern confidenceLevel="75">
        <IdMatch idRef="Regex_australia_bank_account_number" />
        <Match idRef="Keyword_australia_bank_account_number" />
  </Pattern>
 </Entity>

Keywords:

Keyword_australia_bank_account_number
swift bank code
correspondent bank
base currency
usa account
holder address
bank address
information account
fund transfers
bank charges
bank details
banking information
full names
idea

Australia Driver's License Number

Format: Nine letters and digits

Pattern: Nine letters and digits:

  • Two digits or letters (not case sensitive)

  • Two digits

  • Five digits or letters (not case sensitive)

    OR

  • 1-2 optional letters (not case sensitive)

  • 4-9 digits

    OR

  • Nine digits or letters (not case sensitive)

Checksum: No

Definition:

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The regular expression Regex_australia_drivers_license_number finds content that matches the pattern.

  • A keyword from Keyword_australia_drivers_license_number is found.

  • No keyword from Keyword_australia_drivers_license_number_exclusions is found.

<!-- Australia Drivers License Number -->
<Entity id="1cbbc8f5-9216-4392-9eb5-5ac2298d1356" patternsProximity="300" recommendedConfidence="75">
   <Pattern confidenceLevel="75">
        <IdMatch idRef="Regex_australia_drivers_license_number" />
        <Match idRef="Keyword_australia_drivers_license_number" />
        <Any minMatches="0" maxMatches="0">
          <Match idRef="Keyword_australia_drivers_license_number_exclusions" />
        </Any>
  </Pattern>
</Entity>

Keywords:

Keyword_australia_drivers_license_number Keyword_australia_drivers_license_number_exclusions
international driving permits
australian automobile association
Sydney nsw
international driving permit
DriverLicence
DriverLicences
Driver Lic
Driver Licence
Driver Licences
DriversLic
DriversLicence
DriversLicences
Drivers Lic
Drivers Lics
Drivers Licence
Drivers Licences
Driver'Lic
Driver'Lics
Driver'Licence
Driver'Licences
Driver' Lic
Driver' Lics
Driver' Licence
Driver' Licences
Driver'sLic
Driver'sLics
Driver'sLicence
Driver'sLicences
Driver's Lic
Driver's Lics
Driver's Licence
Driver's Licences
DriverLic#
DriverLics#
DriverLicence#
DriverLicences#
Driver Lic#
Driver Lics#
Driver Licence#
Driver Licences#
DriversLic#
DriversLics#
DriversLicence#
DriversLicences#
Drivers Lic#
Drivers Lics#
Drivers Licence#
Drivers Licences#
Driver'Lic#
Driver'Lics#
Driver'Licence#
Driver'Licences#
Driver' Lic#
Driver' Lics#
Driver' Licence#
Driver' Licences#
Driver'sLic#
Driver'sLics#
Driver'sLicence#
Driver'sLicences#
Driver's Lic#
Driver's Lics#
Driver's Licence#
Driver's Licences#
aaa
DriverLicense
DriverLicenses
Driver License
Driver Licenses
DriversLicense
DriversLicenses
Drivers License
Drivers Licenses
Driver'License
Driver'Licenses
Driver' License
Driver' Licenses
Driver'sLicense
Driver'sLicenses
Driver's License
Driver's Licenses
DriverLicense#
DriverLicenses#
Driver License#
Driver Licenses#
DriversLicense#
DriversLicenses#
Drivers License#
Drivers Licenses#
Driver'License#
Driver'Licenses#
Driver' License#
Driver' Licenses#
Driver'sLicense#
Driver'sLicenses#
Driver's License#
Driver's Licenses#

Australia Medical Account Number

Format: 10-11 digits

Pattern: 10-11 digits:

  • First digit is in the range 2-6

  • Ninth digit is a check digit

  • Tenth digit is the issue digit

  • Eleventh digit (optional) is the individual number

Checksum: Yes

Definition:

A DLP policy is 95% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_australian_medical_account_number finds content that matches the pattern.

  • A keyword from Keyword_Australia_Medical_Account_Number is found.

  • The checksum passes.

A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_australian_medical_account_number finds content that matches the pattern.

  • The checksum passes.

<!-- Australia Medical Account Number -->
<Entity id="104a99a0-3d3b-4542-a40d-ab0b9e1efe63" recommendedConfidence="85" patternsProximity="300">
    <Pattern confidenceLevel="95">
     <IdMatch idRef="Func_australian_medical_account_number"/>
     <Any minMatches="1">
     <Match idRef="Keyword_Australia_Medical_Account_Number"/>
     </Any>
  </Pattern>
<Pattern confidenceLevel="85">
     <IdMatch idRef="Func_australian_medical_account_number"/>
     <Any minMatches="0" maxMatches="0">
  <Match idRef="Keyword_Australia_Medical_Account_Number"/>
     </Any>
  </Pattern>
</Entity>

Keywords:

Keyword_Australia_Medical_Account_Number
bank account details
medicare payments
mortgage account
bank payments
information branch
credit card loan
department of human services
local service
medicare

Australia Passport Number

Format: A letter followed by seven digits

Pattern: A letter (not case sensitive) followed by seven digits

Checksum: No

Definition:

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The regular expression Regex_australia_passport_number finds content that matches the pattern.

  • A keyword from Keyword_passport or Keyword_australia_passport_number is found.

<!-- Australia Passport Number -->
<Entity id="29869db6-602d-4853-ab93-3484f905df50" patternsProximity="300" recommendedConfidence="75">
  <Pattern confidenceLevel="75">
        <IdMatch idRef="Regex_australia_passport_number" />
        <Any minMatches="1">
          <Match idRef="Keyword_passport" />
          <Match idRef="Keyword_australia_passport_number" />
        </Any>
   </Pattern>
</Entity>

Keywords:

Keyword_passport Keyword_australia_passport_number
Passport Number
Passport No
Passport #
Passport#
PassportID
Passportno
passportnumber
パスポート
パスポート番号
パスポートのNum
パスポート #
Numéro de passeport
Passeport n °
Passeport Non
Passeport #
Passeport#
PasseportNon
Passeportn °
passport
passport details
immigration and citizenship
commonwealth of australia
department of immigration
residential address
department of immigration and citizenship
visa
national identity card
passport number
travel document
issuing authority

Australia Tax File Number

Format: 8-9 digits

Pattern: 8-9 digits typically presented with spaces as follows:

  • Three digits

  • An optional space

  • Three digits

  • An optional space

  • 2-3 digits where the last digit is a check digit

Checksum: Yes

Definition:

A DLP policy is 95% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_australian_tax_file_number finds content that matches the pattern.

  • A keyword from Keyword_Australia_Tax_File_Number is found.

  • No keyword from Keyword_number_exclusions is found.

  • The checksum passes.

A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_australian_tax_file_number finds content that matches the pattern.

  • No keyword from Keyword_Australia_Tax_File_Number or Keyword_number_exclusions is found.

  • The checksum passes.

<!-- Australia Tax File Number -->
<Entity id="e29bc95f-ff70-4a37-aa01-04d17360a4c5" patternsProximity="300" recommendedConfidence="85">
    <Pattern confidenceLevel="95">
        <IdMatch idRef="Func_australian_tax_file_number" />
        <Any minMatches="1">
          <Match idRef="Keyword_Australia_Tax_File_Number" />
        </Any>
        <Any minMatches="0" maxMatches="0">
          <Match idRef="Keyword_number_exclusions" />
        </Any>
  </Pattern>
  <Pattern confidenceLevel="85">
        <IdMatch idRef="Func_australian_tax_file_number" />
        <Any minMatches="0" maxMatches="0">
          <Match idRef="Keyword_Australia_Tax_File_Number" />
          <Match idRef="Keyword_number_exclusions" />
        </Any>
  </Pattern>
</Entity>

Keywords:

Keyword_Australia_Tax_File_Number Keyword_number_exclusions
australian business number
marginal tax rate
medicare levy
portfolio number
service veterans
withholding tax
individual tax return
tax file number
00000000
11111111
22222222
33333333
44444444
55555555
66666666
77777777
88888888
99999999
000000000
111111111
222222222
333333333
444444444
555555555
666666666
777777777
888888888
999999999
0000000000
1111111111
2222222222
3333333333
4444444444
5555555555
6666666666
7777777777
8888888888
9999999999

Belgium National Number

Format: 11 digits plus delimiters

Pattern: 11 digits plus delimiters:

  • Six digits and two periods in the format YY.MM.DD for date of birth

  • A hyphen

  • Three sequential digits (odd for males, even for females)

  • A period

  • Two digits that are a check digit

Checksum: Yes

Definition:

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_belgium_national_number finds content that matches the pattern.

  • A keyword from Keyword_belgium_national_number is found.

  • The checksum passes.

<!-- Belgium National Number -->
  <Entity id="fb969c9e-0fd1-4b18-8091-a2123c5e6a54" recommendedConfidence="75" patternsProximity="300">
   <Pattern confidenceLevel="75">
     <IdMatch idRef="Func_belgium_national_number"/>
     <Match idRef="Keyword_belgium_national_number"/>
  </Pattern>
</Entity>

Keywords:

Keyword_belgium_national_number
Identity
Registration
Identification
ID
Identiteitskaart
Registratie nummer
Identificatie nummer
Identiteit
Registratie
Identificatie
Carte d'identité
numéro d'immatriculation
numéro d'identification
identité
inscription
Identifikation
Identifizierung
Identifikationsnummer
Personalausweis
Registrierung
Registrationsnummer

Format: 14 digits that include a registration number, branch number, and check digits, plus delimiters

Pattern: 14 digits, plus delimiters:

  • Two digits

  • A period

  • Three digits

  • A period

  • Three digits (these first eight digits are the registration number)

  • A forward slash

  • Four-digit branch number

  • A hyphen

  • Two digits that are check digits

Checksum: Yes

Definition:

A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_brazil_cnpj finds content that matches the pattern.

  • A keyword from Keyword_brazil_cnpj is found.

  • The checksum passes.

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_brazil_cnpj finds content that matches the pattern.

  • The checksum passes.

<!-- Brazil Legal Entity Number (CNPJ) -->
<Entity id="9b58b5cd-5e90-4df6-b34f-1ebcc88ceae4" recommendedConfidence="85" patternsProximity="300">
   <Pattern confidenceLevel="85">
     <IdMatch idRef="Func_brazil_cnpj"/>
     <Match idRef="Keyword_brazil_cnpj"/>
  </Pattern>
  <Pattern confidenceLevel="75">
     <IdMatch idRef="Func_brazil_cnpj"/>
  </Pattern>
</Entity>

Keywords:

Keyword_brazil_cnpj
CNPJ
CNPJ/MF
CNPJ-MF
National Registry of Legal Entities
Taxpayers Registry
Legal entity
Legal entities
Registration Status
Business
Company
CNPJ
Cadastro Nacional da Pessoa Jurídica
Cadastro Geral de Contribuintes
CGC
Pessoa jurídica
Pessoas jurídicas
Situação cadastral
Inscrição
Empresa

Brazil CPF Number

Format: 11 digits that include a check digit and can be formatted or unformatted

Pattern:

Formatted:

  • Three digits

  • A period

  • Three digits

  • A period

  • Three digits

  • A hyphen

  • Two digits which are check digits

Unformatted: 11 digits where the last two digits are check digits

Checksum: Yes

Definition:

A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_brazil_cpf finds content that matches the pattern.

  • A keyword from Keyword_brazil_cpf is found.

  • The checksum passes.

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_brazil_cpf finds content that matches the pattern.

  • The checksum passes.

<!-- Brazil CPF Number -->
<Entity id="78e09124-f2c3-4656-b32a-c1a132cd2711" recommendedConfidence="85" patternsProximity="300">
  <Pattern confidenceLevel="85">
     <IdMatch idRef="Func_brazil_cpf"/>
     <Match idRef="Keyword_brazil_cpf"/>
  </Pattern>
  <Pattern confidenceLevel="75">
     <IdMatch idRef="Func_brazil_cpf"/>
  </Pattern>
</Entity>

Keywords:

Keyword_brazil_cpf
CPF
Identification
Registration
Revenue
Cadastro de Pessoas Físicas
Imposto
Identificação
Inscrição
Receita

Brazil National ID Card (RG)

Format:

  • Registro Geral (old format): Nine digits plus delimiters

  • Registro de Identidade (RIC) (new format): 11 digits plus a hyphen

Pattern:

Registro Geral (old format):

  • Two digits

  • A period

  • Three digits

  • A period

  • Three digits

  • A hyphen

  • One digit which is a check digit

Registro de Identidade (RIC) (new format)

  • 10 digits

  • A hyphen

  • One digit which is a check digit

Checksum: Yes

Definition:

A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_brazil_rg finds content that matches the pattern.

  • A keyword from Keyword_brazil_rg is found.

  • The checksum passes.

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_brazil_rg finds content that matches the pattern.

  • The checksum passes.

<!-- Brazil National ID Card (RG) -->
<Entity id="486de900-db70-41b3-a886-abdf25af119c" recommendedConfidence="85" patternsProximity="300">
  <Pattern confidenceLevel="85">
     <IdMatch idRef="Func_brazil_rg"/>
     <Match idRef="Keyword_brazil_rg"/>
  </Pattern>
  <Pattern confidenceLevel="75">
     <IdMatch idRef="Func_brazil_rg"/>
  </Pattern>
</Entity>

Keywords:

Keyword_brazil_rg
National ID
Registration
Cédula de identidade
Registro Geral
RG
Registro de Identidade
RIC
Número de registo
Registro

Canada Bank Account Number

Format: Seven or twelve digits

Pattern: A Canada Bank Account Number is seven or twelve digits. A Canada bank account transit number is:

  • Five digits

  • A hyphen

  • Three digits

    OR

  • A zero "0"

  • Eight digits

Checksum: No

Definition:

A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The regular expression Regex_canada_bank_account_number finds content that matches the pattern.

  • A keyword from Keyword_canada_bank_account_number is found.

  • The regular expression Regex_canada_bank_account_transit_number finds content that matches the pattern.

A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The regular expression Regex_canada_bank_account_number finds content that matches the pattern.

  • A keyword from Keyword_canada_bank_account_number is found.

<!-- Canada Bank Account Number -->
<Entity id="552e814c-cb50-4d94-bbaa-bb1d1ffb34de" patternsProximity="300" recommendedConfidence="75">
  <Pattern confidenceLevel="85">
        <IdMatch idRef="Regex_canada_bank_account_number" />
        <Match idRef="Keyword_canada_bank_account_number" />
        <Match idRef="Regex_canada_bank_account_transit_number" />
   </Pattern>
   <Pattern confidenceLevel="75">
        <IdMatch idRef="Regex_canada_bank_account_number" />
        <Match idRef="Keyword_canada_bank_account_number" />
   </Pattern>
</Entity>

Keywords:

Keyword_canada_bank_account_number
canada savings bonds
canada revenue agency
canadian financial institution
direct deposit form
canadian citizen
legal representative
notary public
commissioner for oaths
child care benefit
universal child care
canada child tax benefit
income tax benefit
harmonized sales tax
social insurance number
income tax refund
child tax benefit
territorial payments
institution number
deposit request
banking information
direct deposit

Canada Driver's License Number

Format: Varies by province

Pattern: Various patterns covering Alberta, British Columbia, Manitoba, New Brunswick, Newfoundland/Labrador, Nova Scotia, Ontario, Prince Edward Island, Quebec, and Saskatchewan

Checksum: No

Definition:

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_[province_name]_drivers_license_number finds content that matches the pattern.

  • A keyword from Keyword_[province_name]_drivers_license_name is found.

  • A keyword from Keyword_canada_drivers_license is found.

<!-- Canada Driver's License Number -->
    <Entity id="37186abb-8e48-4800-ad3c-e3d1610b3db0" patternsProximity="300" recommendedConfidence="75">
      <Pattern confidenceLevel="75">
        <IdMatch idRef="Func_alberta_drivers_license_number" />
        <Match idRef="Keyword_alberta_drivers_license_name" />
        <Match idRef="Keyword_canada_drivers_license" />
      </Pattern>
      <Pattern confidenceLevel="75">
        <IdMatch idRef="Func_british_columbia_drivers_license_number" />
        <Match idRef="Keyword_british_columbia_drivers_license_name" />
        <Match idRef="Keyword_canada_drivers_license" />
      </Pattern>
      <Pattern confidenceLevel="75">
        <IdMatch idRef="Func_manitoba_drivers_license_number" />
        <Match idRef="Keyword_manitoba_drivers_license_name" />
        <Match idRef="Keyword_canada_drivers_license" />
      </Pattern>
      <Pattern confidenceLevel="75">
        <IdMatch idRef="Func_new_brunswick_drivers_license_number" />
        <Match idRef="Keyword_new_brunswick_drivers_license_name" />
        <Match idRef="Keyword_canada_drivers_license" />
      </Pattern>
      <Pattern confidenceLevel="75">
        <IdMatch idRef="Func_newfoundland_labrador_drivers_license_number" />
        <Match idRef="Keyword_newfoundland_labrador_drivers_license_name" />
        <Match idRef="Keyword_canada_drivers_license" />
      </Pattern>
      <Pattern confidenceLevel="75">
        <IdMatch idRef="Func_nova_scotia_drivers_license_number" />
        <Match idRef="Keyword_nova_scotia_drivers_license_name" />
        <Match idRef="Keyword_canada_drivers_license" />
      </Pattern>
      <Pattern confidenceLevel="75">
        <IdMatch idRef="Func_ontario_drivers_license_number" />
        <Match idRef="Keyword_ontario_drivers_license_name" />
        <Match idRef="Keyword_canada_drivers_license" />
      </Pattern>
      <Pattern confidenceLevel="75">
        <IdMatch idRef="Func_prince_edward_island_drivers_license_number" />
        <Match idRef="Keyword_prince_edward_island_drivers_license_name" />
        <Match idRef="Keyword_canada_drivers_license" />
      </Pattern>
      <Pattern confidenceLevel="75">
        <IdMatch idRef="Func_quebec_drivers_license_number" />
        <Match idRef="Keyword_quebec_drivers_license_name" />
        <Match idRef="Keyword_canada_drivers_license" />
      </Pattern>
      <Pattern confidenceLevel="75">
        <IdMatch idRef="Func_saskatchewan_drivers_license_number" />
        <Match idRef="Keyword_saskatchewan_drivers_license_name" />
        <Match idRef="Keyword_canada_drivers_license" />
      </Pattern>
    </Entity>

Keywords:

Keyword_[province_name]_drivers_license_name Keyword_canada_drivers_license
The province abbreviation, for example AB
The province name, for example Alberta
DL
DLS
CDL
CDLS
DriverLic
DriverLics
DriverLicense
DriverLicenses
DriverLicence
DriverLicences
Driver Lic
Driver Lics
Driver License
Driver Licenses
Driver Licence
Driver Licences
DriversLic
DriversLics
DriversLicence
DriversLicences
DriversLicense
DriversLicenses
Drivers Lic
Drivers Lics
Drivers License
Drivers Licenses
Drivers Licence
Drivers Licences
Driver'Lic
Driver'Lics
Driver'License
Driver'Licenses
Driver'Licence
Driver'Licences
Driver' Lic
Driver' Lics
Driver' License
Driver' Licenses
Driver' Licence
Driver' Licences
Driver'sLic
Driver'sLics
Driver'sLicense
Driver'sLicenses
Driver'sLicence
Driver'sLicences
Driver's Lic
Driver's Lics
Driver's License
Driver's Licenses
Driver's Licence
Driver's Licences
Permis de Conduire
id
ids
idcard number
idcard numbers
idcard #
idcard #s
idcard card
idcard cards
idcard
identification number
identification numbers
identification #
identification #s
identification card
identification cards
identification
DL#
DLS#
CDL#
CDLS#
DriverLic#
DriverLics#
DriverLicense#
DriverLicenses#
DriverLicence#
DriverLicences#
Driver Lic#
Driver Lics#
Driver License#
Driver Licenses#
Driver License#
Driver Licences#
DriversLic#
DriversLics#
DriversLicense#
DriversLicenses#
DriversLicence#
DriversLicences#
Drivers Lic#
Drivers Lics#
Drivers License#
Drivers Licenses#
Drivers Licence#
Drivers Licences#
Driver'Lic#
Driver'Lics#
Driver'License#
Driver'Licenses#
Driver'Licence#
Driver'Licences#
Driver' Lic#
Driver' Lics#
Driver' License#
Driver' Licenses#
Driver' Licence#
Driver' Licences#
Driver'sLic#
Driver'sLics#
Driver'sLicense#
Driver'sLicenses#
Driver'sLicence#
Driver'sLicences#
Driver's Lic#
Driver's Lics#
Driver's License#
Driver's Licenses#
Driver's Licence#
Driver's Licences#
Permis de Conduire#
ID#
IDs#
idcard card#
idcard cards#
idcard#
identification card#
identification cards#
identification#

Canada Health Service Number

Format: 10 digits

Pattern: 10 digits

Checksum: No

Definition:

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The regular expression Regex_canada_health_service_number finds content that matches the pattern.

  • A keyword from Keyword_canada_health_service_number is found.

<!-- Canada Health Service Number -->
<Entity id="59c0bf39-7fab-482c-af25-00faa4384c94" patternsProximity="300" recommendedConfidence="75">
  <Pattern confidenceLevel="75">
        <IdMatch idRef="Regex_canada_health_service_number" />
        <Any minMatches="1">
          <Match idRef="Keyword_canada_health_service_number" />
        </Any>
  </Pattern>
</Entity>

Keywords:

Keyword_canada_health_service_number
personal health number
patient information
health services
speciality services
automobile accident
patient hospital
psychiatrist
workers compensation
disability

Canada Passport Number

Format: Two uppercase letters followed by six digits

Pattern: Two uppercase letters followed by six digits

Checksum: No

Definition:

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The regular expression Regex_canada_passport_number finds content that matches the pattern.

  • A keyword from Keyword_canada_passport_number or Keyword_passport is found.

<!-- Canada Passport Number -->
<Entity id="14d0db8b-498a-43ed-9fca-f6097ae687eb" patternsProximity="300" recommendedConfidence="75">
  <Pattern confidenceLevel="75">
        <IdMatch idRef="Regex_canada_passport_number" />
        <Any minMatches="1">
          <Match idRef="Keyword_canada_passport_number" />
          <Match idRef="Keyword_passport" />
        </Any>
  </Pattern>
</Entity>

Keywords:

Keyword_canada_passport_number Keyword_passport
canadian citizenship
canadian passport
passport application
passport photos
certified translator
canadian citizens
processing times
renewal application
Passport Number
Passport No
Passport #
Passport#
PassportID
Passportno
passportnumber
パスポート
パスポート番号
パスポートのNum
パスポート#
Numéro de passeport
Passeport n °
Passeport Non
Passeport #
Passeport#
PasseportNon
Passeportn °

Canada Personal Health Identification Number (PHIN)

Format: Nine digits

Pattern: Nine digits

Checksum: No

Definition:

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The regular expression Regex_canada_phin finds content that matches the pattern.

  • At least two keywords from Keyword_canada_phin or Keyword_canada_provinces are found..

<!-- Canada PHIN -->
<Entity id="722e12ac-c89a-4ec8-a1b7-fea3469f89db" patternsProximity="300" recommendedConfidence="75">
  <Pattern confidenceLevel="75">
        <IdMatch idRef="Regex_canada_phin" />
        <Any minMatches="2">
          <Match idRef="Keyword_canada_phin" />
          <Match idRef="Keyword_canada_provinces" />
        </Any>
  </Pattern>
</Entity>

Keywords:

Keyword_canada_phin Keyword_canada_provinces
social insurance number
health information act
income tax information
manitoba health
health registration
prescription purchases
benefit eligibility
personal health
power of attorney
registration number
personal health number
practitioner referral
wellness professional
patient referral
health and wellness
Nunavut
Quebec
Northwest Territories
Ontario
British Columbia
Alberta
Saskatchewan
Manitoba
Yukon
Newfoundland and Labrador
New Brunswick
Nova Scotia
Prince Edward Island
Canada

Canada Social Insurance Number

Format: Nine digits with optional hyphens or spaces

Pattern:

Formatted:

  • Three digits

  • A hyphen or space

  • Three digits

  • A hyphen or space

  • Three digits

Unformatted: Nine digits

Checksum: Yes

Definition:

A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_canadian_sin finds content that matches the pattern.

  • At least two of any combinations of the following:

    • A keyword from Keyword_sin is found.

    • A keyword from Keyword_sin_collaborative is found.

    • The function Func_eu_date finds a date in the right date format.

  • The checksum passes.

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_unformatted_canadian_sin finds content that matches the pattern.

  • A keyword from Keyword_sin is found.

  • The checksum passes.

<!-- Canada Social Insurance Number -->
<Entity id="a2f29c85-ecb8-4514-a610-364790c0773e" patternsProximity="300" recommendedConfidence="75">
  <Pattern confidenceLevel="85">
        <IdMatch idRef="Func_canadian_sin" />
        <Any minMatches="2">
          <Match idRef="Keyword_sin" />
          <Match idRef="Keyword_sin_collaborative" />
          <Match idRef="Func_eu_date" />
        </Any>
  </Pattern>
  <Pattern confidenceLevel="75">
        <IdMatch idRef="Func_unformatted_canadian_sin" />
        <Match idRef="Keyword_sin" />
  </Pattern>
</Entity>

Keywords:

Keyword_sin Keyword_sin_collaborative
sin
social insurance
numero d'assurance sociale
sins
ssn
ssns
social security
numero d'assurance social
national identification number
national id
sin#
soc ins
social ins
driver's license
drivers license
driver's licence
drivers licence
DOB
Birthdate
Birthday
Date of Birth

Chile Identity Card Number

Format: 7-8 digits plus delimiters a check digit or letter

Pattern: 7-8 digits plus delimiters:

  • 1-2 digits

  • A period

  • Three digits

  • A period

  • Three digits

  • A dash

  • One digit or letter (not case sensitive) which is a check digit

Checksum: Yes

Definition:

A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_chile_id_card finds content that matches the pattern.

  • A keyword from Keyword_chile_id_card is found.

  • The checksum passes.

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_chile_id_card finds content that matches the pattern.

  • The checksum passes.

<!-- Chile Identity Card Number -->
<Entity id="4e979794-49a0-407e-a0b9-2c536937b925" recommendedConfidence="85" patternsProximity="300">
  <Pattern confidenceLevel="85">
     <IdMatch idRef="Func_chile_id_card"/>
     <Match idRef="Keyword_chile_id_card"/>
  </Pattern>
  <Pattern confidenceLevel="75">
     <IdMatch idRef="Func_chile_id_card"/>
  </Pattern>
</Entity>

Keywords:

Keyword_chile_id_card
National Identification Number
Identity card
ID
Identification
Rol Único Nacional
RUN
Rol Único Tributario
RUT
Cédula de Identidad
Número De Identificación Nacional
Tarjeta de identificación
Identificación

China Resident Identity Card (PRC) Number

Format: 18 digits

Pattern: 18 digits:

  • Six digits which are an address code

  • Eight digits in the form YYYYMMDD, which are the date of birth

  • Three digits that are an order code

  • One digit that is a check digit

Checksum: Yes

Definition:

A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_china_resident_id finds content that matches the pattern.

  • A keyword from Keyword_china_resident_id is found.

  • The checksum passes.

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_china_resident_id finds content that matches the pattern.

  • The checksum passes.

<!-- China Resident Identity Card (PRC) Number -->
<Entity id="c92daa86-2d16-4871-901f-816b3f554fc1" recommendedConfidence="85" patternsProximity="300">
  <Pattern confidenceLevel="85">
     <IdMatch idRef="Func_china_resident_id"/>
     <Match idRef="Keyword_china_resident_id"/>
  </Pattern>
  <Pattern confidenceLevel="75">
     <IdMatch idRef="Func_china_resident_id"/>
  </Pattern>
</Entity>

Keywords:

Keyword_china_resident_id
Resident Identity Card
PRC
National Identification Card
身份证
居民 身份证
居民身份证
鉴定
身分證
居民 身份證
鑑定

Credit Card Number

Format: 14 digits that can be formatted or unformatted (dddddddddddddd) and must pass the Luhn test.

Pattern: Very complex and robust pattern that detects cards from all major brands worldwide, including Visa, MasterCard, Discover Card, JCB, American Express, gift cards, and diner cards.

Checksum: Yes, the Luhn checksum

Definition:

A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_credit_card finds content that matches the pattern.

  • One of the following is true:

    • A keyword from Keyword_cc_verification is found.

    • A keyword from Keyword_cc_name is found.

    • The function Func_expiration_date finds a date in the right date format.

  • The checksum passes.

A DLP policy is 65% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_credit_card finds content that matches the pattern.

  • The checksum passes.

<!-- Credit Card Number -->
<Entity id="50842eb7-edc8-4019-85dd-5a5c1f2bb085" patternsProximity="300" recommendedConfidence="85">
  <Pattern confidenceLevel="85">
        <IdMatch idRef="Func_credit_card" />
        <Any minMatches="1">
          <Match idRef="Keyword_cc_verification" />
          <Match idRef="Keyword_cc_name" />
          <Match idRef="Func_expiration_date" />
        </Any>
  </Pattern>
  <Pattern confidenceLevel="65">
        <IdMatch idRef="Func_credit_card" />
  </Pattern>
</Entity>

Keywords:

Keyword_cc_verification Keyword_cc_name
card verification
card identification number
cvn
cid
cvc2
cvv2
pin block
security code
security number
security no
issue number
issue no
cryptogramme
numéro de sécurité
numero de securite
kreditkartenprüfnummer
kreditkartenprufnummer
prüfziffer
prufziffer
sicherheits Kode
sicherheitscode
sicherheitsnummer
verfalldatum
codice di verifica
cod. sicurezza
cod sicurezza
n autorizzazione
código
codigo
cod. seg
cod seg
código de segurança
codigo de seguranca
codigo de segurança
código de seguranca
cód. segurança
cod. seguranca cod. segurança
cód. seguranca
cód segurança
cod seguranca cod segurança
cód seguranca
número de verificação
numero de verificacao
ablauf
gültig bis
gültigkeitsdatum
gultig bis
gultigkeitsdatum
scadenza
data scad
fecha de expiracion
fecha de venc
vencimiento
válido hasta
valido hasta
vto
data de expiração
data de expiracao
data em que expira
validade
valor
vencimento
Venc
amex
american express
americanexpress
Visa
mastercard
master card
mc
mastercards
master cards
diner's Club
diners club
dinersclub
discover card
discovercard
discover cards
JCB
japanese card bureau
carte blanche
carteblanche
credit card
cc#
cc#:
expiration date
exp date
expiry date
date d'expiration
date d'exp
date expiration
bank card
bankcard
card number
card num
cardnumber
cardnumbers
card numbers
creditcard
credit cards
creditcards
ccn
card holder
cardholder
card holders
cardholders
check card
checkcard
check cards
checkcards
debit card
debitcard
debit cards
debitcards
atm card
atmcard
atm cards
atmcards
enroute
en route
card type
carte bancaire
carte de crédit
carte de credit
numéro de carte
numero de carte
nº de la carte
nº de carte
kreditkarte
karte
karteninhaber
karteninhabers
kreditkarteninhaber
kreditkarteninstitut
kreditkartentyp
eigentümername
kartennr
kartennummer
kreditkartennummer
kreditkarten-nummer
carta di credito
carta credito
n. carta
n carta
nr. carta
nr carta
numero carta
numero della carta
numero di carta
tarjeta credito
tarjeta de credito
tarjeta crédito
tarjeta de crédito
tarjeta de atm
tarjeta atm
tarjeta debito
tarjeta de debito
tarjeta débito
tarjeta de débito
nº de tarjeta
no. de tarjeta
no de tarjeta
numero de tarjeta
número de tarjeta
tarjeta no
tarjetahabiente
cartão de crédito
cartão de credito
cartao de crédito
cartao de credito
cartão de débito
cartao de débito
cartão de debito
cartao de debito
débito automático
debito automatico
número do cartão
numero do cartão
número do cartao
numero do cartao
número de cartão
numero de cartão
número de cartao
numero de cartao
nº do cartão
nº do cartao
nº. do cartão
no do cartão
no do cartao
no. do cartão
no. do cartao

Croatia Identity Card Number

Format: Nine digits

Pattern: Nine consecutive digits

Checksum: No

Definition:

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_croatia_id_card finds content that matches the pattern.

  • A keyword from Keyword_croatia_id_card is found.

<!--Croatia Identity Card Number-->
<Entity id="ff12f884-c20a-4189-b185-34c8e7258d47" recommendedConfidence="75" patternsProximity="300">
  <Pattern confidenceLevel="75">
     <IdMatch idRef="Func_croatia_id_card"/>
     <Match idRef="Keyword_croatia_id_card"/>
  </Pattern>
</Entity>

Keywords:

Keyword_croatia_id_card
Croatian identity card
Osobna iskaznica

Croatia Personal Identification (OIB) Number

Format: 10 digits

Pattern: 10 digits:

  • Six digits in the form DDMMYY, which are the date of birth

  • Four digits where the final digit is a check digit

Checksum: Yes

Definition:

A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_croatia_oib_number finds content that matches the pattern.

  • A keyword from Keyword_croatia_oib_number is found.

  • The checksum passes.

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_croatia_oib_number finds content that matches the pattern.

  • The checksum passes.

<!-- Croatia Personal Identification (OIB) Number -->
<Entity id="31983b6d-db95-4eb2-a630-b44bd091968d" recommendedConfidence="85" patternsProximity="300">
  <Pattern confidenceLevel="85">
     <IdMatch idRef="Func_croatia_oib_number"/>
     <Match idRef="Keyword_croatia_oib_number"/>
  </Pattern>
  <Pattern confidenceLevel="75">
     <IdMatch idRef="Func_croatia_oib_number"/>
  </Pattern>
</Entity>

Keywords:

Keyword_croatia_oib_number
Personal Identification Number
Osobni identifikacijski broj
OIB

Czech National Identity Card Number

Format: 10 digits containing a forward slash

Pattern: 10 digits:

  • Six digits that are the date of birth

  • A forward slash

  • Four digits where the final digit is a check digit

Checksum: Yes

Definition:

A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_czech_id_card finds content that matches the pattern.

  • A keyword from Keyword_czech_id_card is found.

  • The checksum passes.

<!-- Czech National Identity Card Number -->
<Entity id="60c0725a-4eb6-455b-9dda-05d8a7396497" recommendedConfidence="85" patternsProximity="300">
  <Pattern confidenceLevel="85">
     <IdMatch idRef="Func_czech_id_card"/>
     <Match idRef="Keyword_czech_id_card"/>
  </Pattern>
</Entity>

Keywords:

Keyword_czech_id_card
Czech national identity card
Občanský průka

Denmark Personal Identification Number

Format: 10 digits containing a hyphen

Pattern: 10 digits:

  • Six digits in the format DDMMYY, which are the date of birth

  • A hyphen

  • Four digits where the final digit is a check digit

Checksum: Yes

Definition:

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The regular expression Regex_denmark_id finds content that matches the pattern.

  • A keyword from Keyword_denmark_id is found.

  • The checksum passes.

<!-- Denmark Personal Identification Number -->
<Entity id="6c4f2fef-56e1-4c00-8093-88d7a01cf460" recommendedConfidence="75" patternsProximity="300">
  <Pattern confidenceLevel="75">
     <IdMatch idRef="Regex_denmark_id"/>
     <Match idRef="Keyword_denmark_id"/>
  </Pattern>
</Entity>

Keywords:

Keyword_denmark_id
Personal Identification Number
CPR
Det Centrale Personregister
Personnummer

Drug Enforcement Agency (DEA) Number

Format: Two letters followed by seven digits

Pattern: Pattern must include all of the following:

  • One letter (not case sensitive) from this set of possible letters: abcdefghjklmnprstux, which is a registrant code

  • One letter (not case sensitive), which is the first letter of the registrant's last name

  • Seven digits, the last of which is the check digit

Checksum: Yes

Definition:

A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_dea_number finds content that matches the pattern.

  • The checksum passes.

<!-- DEA Number -->
<Entity id="9a5445ad-406e-43eb-8bd7-cac17ab6d0e4" recommendedConfidence="85" patternsProximity="300">
  <Pattern confidenceLevel="85">
     <IdMatch idRef="Func_dea_number"/>
  </Pattern>
</Entity>

Keywords: None

EU Debit Card Number

Format: 16 digits

Pattern: Very complex and robust pattern

Checksum: Yes

Definition:

A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_eu_debit_card finds content that matches the pattern.

  • At least one of the following is true:

    • A keyword from Keyword_eu_debit_card is found.

    • A keyword from Keyword_card_terms_dict is found.

    • A keyword from Keyword_card_security_terms_dict is found.

    • A keyword from Keyword_card_expiration_terms_dict is found.

    • The function Func_eu_date1 finds a date in the right date format.

    • The function Func_eu_date2 finds a date in the right date format.

  • The checksum passes.

<!-- EU Debit Card Number -->
<Entity id="0e9b3178-9678-47dd-a509-37222ca96b42" patternsProximity="300" recommendedConfidence="85">
  <Pattern confidenceLevel="85">
        <IdMatch idRef="Func_eu_debit_card" />
        <Any minMatches="1">
          <Match idRef="Keyword_eu_debit_card" />
          <Match idRef="Keyword_card_terms_dict" />
          <Match idRef="Keyword_card_security_terms_dict" />
          <Match idRef="Keyword_card_expiration_terms_dict" />
          <Match idRef="Func_expiration_date" />
          <Match idRef="Func_eu_date" />
          <Match idRef="Func_eu_date1" />
          <Match idRef="Func_eu_date2" />
        </Any>
  </Pattern>
</Entity>

Keywords:

Keyword_eu_debit_card Keyword_card_terms_dict Keyword_card_security_terms_dict Keyword_card_expiration_terms_dict
account number
card number
card no.
security number
cc#
acct nbr
acct num
acct no
american express
americanexpress
americano espresso
amex
atm card
atm cards
atm kaart
atmcard
atmcards
atmkaart
atmkaarten
bancontact
bank card
bankkaart
card holder
card holders
card num
card number
card numbers
card type
cardano numerico
cardholder
cardholders
cardnumber
cardnumbers
carta bianca
carta credito
carta di credito
cartao de credito
cartao de crédito
cartao de debito
cartao de débito
carte bancaire
carte blanche
carte bleue
carte de credit
carte de crédit
carte di credito
carteblanche
cartão de credito
cartão de crédito
cartão de debito
cartão de débito
cb
ccn
check card
check cards
checkcard
checkcards
chequekaart
cirrus
cirrus-edc-maestro
controlekaart
controlekaarten
credit card
credit cards
creditcard
creditcards
debetkaart
debetkaarten
debit card
debit cards
debitcard
debitcards
debito automatico
diners club
dinersclub
discover
discover card
discover cards
discovercard
discovercards
débito automático
edc
eigentümername
european debit card
hoofdkaart
hoofdkaarten
in viaggio
japanese card bureau
japanse kaartdienst
jcb
kaart
kaart num
kaartaantal
kaartaantallen
kaarthouder
kaarthouders
karte
karteninhaber
karteninhabers
kartennr
kartennummer
kreditkarte
kreditkarten-nummer
kreditkarteninhaber
kreditkarteninstitut
kreditkartennummer
kreditkartentyp
maestro
master card
master cards
mastercard
mastercards
mc
mister cash
n carta
n. carta
no de tarjeta
no do cartao
no do cartão
no. de tarjeta
no. do cartao
no. do cartão
nr carta
nr. carta
numeri di scheda
numero carta
numero de cartao
numero de carte
numero de cartão
numero de tarjeta
numero della carta
numero di carta
numero di scheda
numero do cartao
numero do cartão
numéro de carte
nº carta
nº de carte
nº de la carte
nº de tarjeta
nº do cartao
nº do cartão
nº. do cartão
número de cartao
número de cartão
número de tarjeta
número do cartao
scheda dell'assegno
scheda dell'atmosfera
scheda dell'atmosfera
scheda della banca
scheda di controllo
scheda di debito
scheda matrice
schede dell'atmosfera
schede di controllo
schede di debito
schede matrici
scoprono la scheda
scoprono le schede
solo
supporti di scheda
supporto di scheda
switch
tarjeta atm
tarjeta credito
tarjeta de atm
tarjeta de credito
tarjeta de debito
tarjeta debito
tarjeta no
tarjetahabiente
tipo della scheda
ufficio giapponese della
scheda
v pay
v-pay
visa
visa plus
visa electron
visto
visum
vpay
card identification number
card verification
cardi la verifica
cid
cod seg
cod seguranca
cod segurança
cod sicurezza
cod. seg
cod. seguranca
cod. segurança
cod. sicurezza
codice di sicurezza
codice di verifica
codigo
codigo de seguranca
codigo de segurança
crittogramma
cryptogram
cryptogramme
cv2
cvc
cvc2
cvn
cvv
cvv2
cód seguranca
cód segurança
cód. seguranca
cód. segurança
código
código de seguranca
código de segurança
de kaart controle
geeft nr uit
issue no
issue number
kaartidentificatienummer
kreditkartenprufnummer
kreditkartenprüfnummer
kwestieaantal
no. dell'edizione
no. di sicurezza
numero de securite
numero de verificacao
numero dell'edizione
numero di identificazione della
scheda
numero di sicurezza
numero van veiligheid
numéro de sécurité
nº autorizzazione
número de verificação
perno il blocco
pin block
prufziffer
prüfziffer
security code
security no
security number
sicherheits kode
sicherheitscode
sicherheitsnummer
speldblok
veiligheid nr
veiligheidsaantal
veiligheidscode
veiligheidsnummer
verfalldatum
ablauf
data de expiracao
data de expiração
data del exp
data di exp
data di scadenza
data em que expira
data scad
data scadenza
date de validité
datum afloop
datum van exp
de afloop
espira
espira
exp date
exp datum
expiration
expire
expires
expiry
fecha de expiracion
fecha de venc
gultig bis
gultigkeitsdatum
gültig bis
gültigkeitsdatum
la scadenza
scadenza
valable
validade
valido hasta
valor
venc
vencimento
vencimiento
verloopt
vervaldag
vervaldatum
vto
válido hasta

Finland National ID

Format: Six digits plus a character indicating a century plus three digits plus a check digit

Pattern: Pattern must include all of the following:

  • Six digits in the format DDMMYY, which are a date of birth

  • Century marker (either '-', '+' or 'a')

  • Three-digit personal identification number

  • A digit or letter (case insensitive) which is a check digit

Checksum: Yes

Definition:

A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_finnish_national_id finds content that matches the pattern.

  • A keyword from Keyword_finnish_national_id is found.

  • The checksum passes.

<!-- Finnish National ID-->
<Entity id="338FD995-4CB5-4F87-AD35-79BD1DD926C1" patternsProximity="300" recommendedConfidence="85">
  <Pattern confidenceLevel="85">
          <IdMatch idRef="Func_finnish_national_id" />
          <Match idRef="Keyword_finnish_national_id" />
  </Pattern>
</Entity>

Keywords:

Keyword_finnish_national_id
Sosiaaliturvatunnus
SOTU Henkilötunnus HETU
Personbeteckning
Personnummer

Finland Passport Number

Format: Combination of nine letters and digits

Pattern: Combination of nine letters and digits:

  • Two letters (not case sensitive)

  • Seven digits

Checksum: No

Definition:

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The regular expression Regex_finland_passport_number finds content that matches the pattern.

  • A keyword from Keyword_finland_passport_number is found.

<!-- Finland Passport Number -->
<Entity id="d1685ac3-1d3a-40f8-8198-32ef5669c7a5" recommendedConfidence="75" patternsProximity="300">
  <Pattern confidenceLevel="75">
     <IdMatch idRef="Regex_finland_passport_number"/>
     <Match idRef="Keyword_finland_passport_number"/>
  </Pattern>
</Entity>

Keywords:

Keyword_finland_passport_number
Passport
Passi

France Driver's License Number

Format: 12 digits

Pattern: 12 digits with validation to discount similar patterns such as French telephone numbers

Checksum: No

Definition:

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters::

  • The function Func_french_drivers_license finds content that matches the pattern.

  • At least one of the following is true:

    • A keyword from Keyword_french_drivers_license is found.

    • The function Func_eu_date finds a date in the right date format.

<!-- France Driver's License Number -->
<Entity id="18e55a36-a01b-4b0f-943d-dc10282a1824" patternsProximity="300" recommendedConfidence="75">
  <Pattern confidenceLevel="75">
        <IdMatch idRef="Func_french_drivers_license" />
        <Any minMatches="1">
          <Match idRef="Keyword_french_drivers_license" />
          <Match idRef="Func_eu_date" />
        </Any>
  </Pattern>
</Entity>

Keywords:

Keyword_french_drivers_license
drivers licence
drivers license
driving licence
driving license
permis de conduire
licence number
license number
licence numbers
license numbers

France National ID Card (CNI)

Format: 12 digits

Pattern: 12 digits

Checksum: No

Definition:

A DLP policy is 65% confident that it's detected this type of sensitive information if, within a proximity of 300 characters: The regular expression Regex_france_cni finds content that matches the pattern.

<!-- France CNI -->
<Entity id="f741ac74-1bc0-4665-b69b-f0c7f927c0c4" patternsProximity="300" recommendedConfidence="65">
  <Pattern confidenceLevel="65">
        <IdMatch idRef="Regex_france_cni" />
  </Pattern>
</Entity>

Keywords: None

France Passport Number

Format: Nine digits and letters

Pattern: Nine digits and letters:

  • Two digits

  • Two letters (not case sensitive)

  • Five digits

Checksum: No

Definition:

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_fr_passport finds content that matches the pattern.

  • A keyword from Keyword_passport is found..

<!-- France Passport Number -->
<Entity id="3008b884-8c8c-4cd8-a289-99f34fc7ff5d" patternsProximity="300" recommendedConfidence="75">
  <Pattern confidenceLevel="75">
        <IdMatch idRef="Func_fr_passport" />
        <Match idRef="Keyword_passport" />
  </Pattern>
</Entity>

Keywords:

Keyword_passport
Passport Number
Passport No
Passport #
Passport#
PassportID
Passportno
passport number
パスポート
パスポート番号
パスポートのNum
パスポート #
Numéro de passeport
Passeport n °
Passeport Non
Passeport #
Passeport#
PasseportNon
Passeportn °

France Social Security Number (INSEE)

Format: 15 digits

Pattern:

Must match one of two patterns:

  • 13 digits followed by a space followed by two digits, or

  • 15 consecutive digits

Checksum: Yes

Definition:

A DLP policy is 95% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_french_insee or Func_fr_insee finds content that matches the pattern.

  • A keyword from Keyword_fr_insee is found.

  • The checksum passes.

A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_french_insee or Func_fr_insee finds content that matches the pattern.

  • No keyword from Keyword_fr_insee is found.

  • The checksum passes.

<!-- France INSEE -->
<Entity id="71f62b97-efe0-4aa1-aa49-e14de253619d" patternsProximity="300" recommendedConfidence="85">
  <Pattern confidenceLevel="95">
        <IdMatch idRef="Func_french_insee" />
        <Match idRef="Func_fr_insee" />
        <Any minMatches="1">
          <Match idRef="Keyword_fr_insee" />
        </Any>
  </Pattern>
  <Pattern confidenceLevel="85">
        <IdMatch idRef="Func_french_insee" />
        <Match idRef="Func_fr_insee" />
        <Any minMatches="0" maxMatches="0">
          <Match idRef="Keyword_fr_insee" />
        </Any>
  </Pattern>
</Entity>

Keywords:

Keyword_fr_insee
insee
securité sociale
securite sociale
national id
national identification
numéro d'identité
no d'identité
no. d'identité
numero d'identite
no d'identite
no. d'identite
social security number
social security code
social insurance number
le numéro d'identification nationale
d'identité nationale
numéro de sécurité sociale
le code de la sécurité sociale
numéro d'assurance sociale
numéro de sécu
code sécu

German Driver's License Number

Format: Combination of 11 digits and letters

Pattern: 11 digits and letters (not case sensitive):

  • A digit or letter

  • Two digits

  • Six digits or letters

  • A digit

  • A digit or letter

Checksum: Yes

Definition:

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_german_drivers_license finds content that matches the pattern.

  • At least one of the following is true:

    • A keyword from Keyword_german_drivers_license_number is found.

    • A keyword from Keyword_german_drivers_license_collaborative is found.

    • A keyword from Keyword_german_drivers_license is found.

  • The checksum passes.

<!-- German Driver's License Number -->
<Entity id="91da9335-1edb-45b7-a95f-5fe41a16c63c" patternsProximity="300" recommendedConfidence="75">
  <Pattern confidenceLevel="75">
        <IdMatch idRef="Func_german_drivers_license" />
        <Any minMatches="1">
          <Match idRef="Keyword_german_drivers_license_number" />
          <Match idRef="Keyword_german_drivers_license_collaborative" />
          <Match idRef="Keyword_german_drivers_license" />
        </Any>
  </Pattern>
</Entity>

Keywords:

Keyword_german_drivers_license_number Keyword_german_drivers_license_collaborative Keyword_german_drivers_license
Führerschein
Fuhrerschein
Fuehrerschein
Führerscheinnummer
Fuhrerscheinnummer
Fuehrerscheinnummer
Führerschein-
Fuhrerschein-
Fuehrerschein-
FührerscheinnummerNr
FuhrerscheinnummerNr
FuehrerscheinnummerNr
FührerscheinnummerKlasse
FuhrerscheinnummerKlasse
FuehrerscheinnummerKlasse
Führerschein- Nr
Fuhrerschein- Nr
Fuehrerschein- Nr
Führerschein- Klasse
Fuhrerschein- Klasse
Fuehrerschein- Klasse
FührerscheinnummerNr
FuhrerscheinnummerNr
FuehrerscheinnummerNr
FührerscheinnummerKlasse
FuhrerscheinnummerKlasse
FuehrerscheinnummerKlasse
Führerschein- Nr
Fuhrerschein- Nr
Fuehrerschein- Nr
Führerschein- Klasse
Fuhrerschein- Klasse
Fuehrerschein- Klasse
DL
DLS
Driv Lic
Driv Licen
Driv License
Driv Licenses
Driv Licence
Driv Licences
Driv Lic
Driver Licen
Driver License
Driver Licenses
Driver Licence
Driver Licences
Drivers Lic
Drivers Licen
Drivers License
Drivers Licenses
Drivers Licence
Drivers Licences
Driver's Lic
Driver's Licen
Driver's License
Driver's Licenses
Driver's Licence
Driver's Licences
Driving Lic
Driving Licen
Driving License
Driving Licenses
Driving Licence
Driving Licences
Nr-Führerschein
Nr-Fuhrerschein
Nr-Fuehrerschein
No-Führerschein
No-Fuhrerschein
No-Fuehrerschein
N-Führerschein
N-Fuhrerschein
N-Fuehrerschein
Nr-Führerschein
Nr-Fuhrerschein
Nr-Fuehrerschein
No-Führerschein
No-Fuhrerschein
No-Fuehrerschein
N-Führerschein
N-Fuhrerschein
N-Fuehrerschein
ausstellungsdatum
ausstellungsort
ausstellende behöde
ausstellende behorde
ausstellende behoerde

German Identity Card Number

Format:

  • Since 1 November 2010: Nine letters and digits

  • From 1 April 1987 until 31 October 2010: 10 digits

Pattern:

Since 1 November 2010:

  • One letter (not case sensitive)

  • Eight digits

From 1 April 1987 until 31 October 2010: 10 digits

Checksum: No

Definition:

A DLP policy is 65% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The regular expression Regex_germany_id_card finds content that matches the pattern.

  • A keyword from Keyword_germany_id_card is found.

<!-- Germany Identity Card Number -->
<Entity id="e577372f-c42e-47a0-9d85-bebed1c237d4" recommendedConfidence="65" patternsProximity="300">
  <Pattern confidenceLevel="65">
     <IdMatch idRef="Regex_germany_id_card"/>
     <Match idRef="Keyword_germany_id_card"/>
  </Pattern>
</Entity>

Keywords:

Keyword_germany_id_card
Identity Card
ID
Identification
Personalausweis
Identifizierungsnummer
Ausweis
Identifikation

German Passport Number

Format: 10 digits or letters

Pattern: Pattern must include all of the following:

  • First character is a digit or a letter from this set (C, F, G, H, J, K)

  • Three digits

  • Five digits or letters from this set (C, -H, J-N, P, R, T, V-Z)

  • A digit

Checksum: Yes

Definition:

A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_german_passport finds content that matches the pattern.

  • A keyword from any of the five keyword lists is found.

  • The checksum passes.

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_german_passport_data finds content that matches the pattern.

  • A keyword from any of the five keyword lists is found.

  • The checksum passes.

<!-- German Passport Number -->
<Entity id="2e3da144-d42b-47ed-b123-fbf78604e52c" patternsProximity="300" recommendedConfidence="75">
  <Pattern confidenceLevel="85">
        <IdMatch idRef="Func_german_passport" />
        <Any minMatches="1">
          <Match idRef="Keyword_german_passport" />
          <Match idRef="Keyword_german_passport_collaborative" />
          <Match idRef="Keyword_german_passport_number" />
          <Match idRef="Keyword_german_passport1" />
          <Match idRef="Keyword_german_passport2" />
        </Any>
  </Pattern>
  <Pattern confidenceLevel="75">
        <IdMatch idRef="Func_german_passport_data" />
        <Any minMatches="1">
          <Match idRef="Keyword_german_passport" />
          <Match idRef="Keyword_german_passport_collaborative" />
          <Match idRef="Keyword_german_passport_number" />
          <Match idRef="Keyword_german_passport1" />
          <Match idRef="Keyword_german_passport2" />
        </Any>
  </Pattern>
</Entity>

Keywords:

Keyword_german_passport Keyword_german_passport_collaborative Keyword_german_passport_number Keyword_german_passport1 Keyword_german_passport2
reisepass
reisepasse
reisepassnummer
passport
passports
geburtsdatum
ausstellungsdatum
ausstellungsort
No-Reisepass
Nr-Reisepass
Reisepass-Nr bnationalit.t

Greece National ID Card

Format: Combination of 7-8 letters and numbers plus a dash

Pattern:

Seven letters and numbers (old format):

  • One letter (any letter of the Greek alphabet)

  • A dash

  • Six digits

Eight letters and numbers (new format):

  • Two letters whose uppercase character occurs in both the Greek and Latin alphabets (ABEZHIKMNOPTYX)

  • A dash

  • Six digits

Checksum: No

Definition:

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The regular expression Regex_greece_id_card finds content that matches the pattern.

  • A keyword from Keyword_greece_id_card is found.

<!-- Greece National ID Card -->
<Entity id="82568215-1da1-46d3-874a-d2294d81b5ac" recommendedConfidence="85" patternsProximity="300">
  <Pattern confidenceLevel="85">
     <IdMatch idRef="Regex_greece_id_card"/>
     <Match idRef="Keyword_greece_id_card"/>
  </Pattern>
</Entity>

Keywords:

Keyword_greece_id_card
Greek identity Card
Tautotita
Δελτίο αστυνομικής ταυτότητας
Ταυτότητα

Hong Kong Identity Card (HKID) Number

Format: Combination of 8-9 letters and numbers plus optional parentheses around the final character

Pattern: Combination of 8-9 letters:

  • 1-2 letters (not case sensitive)

  • Six digits

  • The final character (any digit or the letter A), which is the check digit and is optionally enclosed in parentheses.

Checksum: Yes

Definition:

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_hong_kong_id_card finds content that matches the pattern.

  • A keyword from Keyword_hong_kong_id_card is found.

  • The checksum passes.

A DLP policy is 65% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_hong_kong_id_card finds content that matches the pattern.

  • The checksum passes.

<!-- Hong Kong Identity Card (HKID) number -->
<Entity id="e63c28a7-ad29-4c17-a41a-3d2a0b70fd9c" recommendedConfidence="75" patternsProximity="300">
  <Pattern confidenceLevel="75">
     <IdMatch idRef="Func_hong_kong_id_card"/>
     <Match idRef="Keyword_hong_kong_id_card"/>
  </Pattern>
  <Pattern confidenceLevel="65">
     <IdMatch idRef="Func_hong_kong_id_card"/>
  </Pattern>
</Entity>

Keywords:

Keyword_hong_kong_id_card
Hong Kong Identity Card
HKID
ID card
香港身份證
香港永久性居民身份證

India Permanent Account Number

Format: 10 letters or digits

Pattern: 10 letters or digits:

  • Five letters (not case sensitive)

  • Four digits

  • A letter, which is an alphabetic check digit

Checksum: Yes

Definition:

A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The regular expression Regex_india_permanent_account_number finds content that matches the pattern.

  • A keyword from Keyword_india_permanent_account_number is found.

  • The checksum passes.

<!-- India Permanent Account Number -->
<Entity id="2602bfee-9bb0-47a5-a7a6-2bf3053e2804" recommendedConfidence="85" patternsProximity="300">
  <Pattern confidenceLevel="85">
     <IdMatch idRef="Regex_india_permanent_account_number"/>
     <Match idRef="Keyword_india_permanent_account_number"/>
  </Pattern>
</Entity>

Keywords:

Keyword_india_permanent_account_number
Permanent Account Number
PAN

India Unique Identification (Aadhaar) Number

Format: 12 digits containing optional spaces or dashes

Pattern: 12 digits:

  • Four digits

  • An optional space or dash

  • Four digits

  • An optional space or dash

  • The final digit, which is the check digit

Checksum: Yes

Definition:

A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_india_aadhaar finds content that matches the pattern.

  • A keyword from Keyword_india_aadhar is found.

  • The checksum passes.

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_india_aadhaar finds content that matches the pattern.

  • The checksum passes.

<!-- India Unique Identification (Aadhaar) number -->
<Entity id="1ca46b29-76f5-4f46-9383-cfa15e91048f" recommendedConfidence="85" patternsProximity="300">
  <Pattern confidenceLevel="85">
     <IdMatch idRef="Func_india_aadhaar"/>
     <Match idRef="Keyword_india_aadhar"/>
  </Pattern>
  <Pattern confidenceLevel="75">
     <IdMatch idRef="Func_india_aadhaar"/>
  </Pattern>
</Entity>

Keywords:

Keyword_india_aadhar
Aadhar
Aadhaar
UID
आधार

Indonesia Identity Card (KTP) Number

Format: 16 digits containing optional periods

Pattern: 16 digits:

  • Two-digit province code

  • A period (optional)

  • Two-digit regency or city code

  • Two-digit subdistrict code

  • A period (optional)

  • Six digits in the format DDMMYY, which are the date of birth

  • A period (optional)

  • Four digits

Checksum: No

Definition:

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The regular expression Regex_indonesia_id_card finds content that matches the pattern.

  • A keyword from Keyword_indonesia_id_card is found.

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters: The regular expression Regex_indonesia_id_card finds content that matches the pattern.

<!-- Indonesia Identity Card (KTP) Number -->
<Entity id="da68fdb0-f383-4981-8c86-82689d3b7d55" recommendedConfidence="85" patternsProximity="300">
  <Pattern confidenceLevel="85">
     <IdMatch idRef="Regex_indonesia_id_card"/>
     <Match idRef="Keyword_indonesia_id_card"/>
  </Pattern>
  <Pattern confidenceLevel="75">
     <IdMatch idRef="Regex_indonesia_id_card"/>
  </Pattern>
</Entity>

Keywords:

Keyword_indonesia_id_card
KTP
Kartu Tanda Penduduk
Nomor Induk Kependudukan

International Banking Account Number (IBAN)

Format: Country code (two letters) plus check digits (two digits) plus bban number (up to 30 characters)

Pattern:

Pattern must include all of the following:

  • Two-letter country code

  • Two check digits (followed by an optional space)

  • 1-7 groups of four letters or digits (can be separated by spaces)

  • 1-3 letters or digits

The format for each country is slightly different. The IBAN sensitive information type covers these 60 countries: ad, ae, al, at, az, ba, be, bg, bh, ch, cr, cy, cz, de, dk, do, ee, es, fi, fo, fr, gb, ge, gi, gl, gr, hr, hu, ie, il, is, it, kw, kz, lb, li, lt, lu, lv, mc, md, me, mk, mr, mt, mu, nl, no, pl, pt, ro, rs, sa, se, si, sk, sm, tn, tr, vg

Checksum: Yes

Definition:

A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_iban finds content that matches the pattern.

  • The checksum passes.

<Entity id="e7dc4711-11b7-4cb0-b88b-2c394a771f0e" patternsProximity="300" recommendedConfidence="85">
  <Pattern confidenceLevel="85">
        <IdMatch idRef="Func_iban" />
  </Pattern>
</Entity>

Keywords: None

IP Address

Format: IPv4 or IPv6 address

Pattern:

  • IPv4: Complex pattern that accounts for formatted (periods) and unformatted (no periods) versions of the IPv4 addresses.

  • IPv6: Complex pattern that accounts for formatted IPv6 numbers (which include colons).

Checksum: No

Definition:

For IPv4, a DLP policy is 95% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The regular expression Regex_ipv4_address finds content that matches the pattern.

  • A keyword from Keyword_ipaddress is found.

For IPv6, a DLP policy is 95% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The regular expression Regex_ipv6_address finds content that matches the pattern.

  • No keyword from Keyword_ipaddress is found.

For IPv4, a DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The regular expression Regex_ipv4_address finds content that matches the pattern.

  • No keyword from Keyword_ipaddress is found.

For IPv6, a DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The regular expression Regex_ipv6_address finds content that matches the pattern.

  • No keyword from Keyword_ipaddress is found.

<Entity id="1daa4ad5-e2dd-4ca4-a788-54722c09efb2" patternsProximity="300" recommendedConfidence="85">
    <Pattern confidenceLevel="95">
        <IdMatch idRef="Regex_ipv4_address" />
        <Any minMatches="1">
          <Match idRef="Keyword_ipaddress" />
        </Any>
    </Pattern>
    <Pattern confidenceLevel="95">
        <IdMatch idRef="Regex_ipv6_address" />
        <Any minMatches="1">
          <Match idRef="Keyword_ipaddress" />
        </Any>
    </Pattern>
    <Pattern confidenceLevel="85">
        <IdMatch idRef="Regex_ipv4_address" />
        <Any minMatches="0" maxMatches="0">
          <Match idRef="Keyword_ipaddress" />
        </Any>
    </Pattern>
    <Pattern confidenceLevel="85">
        <IdMatch idRef="Regex_ipv6_address" />
        <Any minMatches="0" maxMatches="0">
          <Match idRef="Keyword_ipaddress" />
        </Any>
    </Pattern>
</Entity>

Keywords:

Keyword_ipaddress
ip address
internet protocol
IP-כתובת ה

Ireland Personal Public Service (PPS) Number

Format:

  • New format (1 January 2013 and later): Seven digits followed by two letters

  • Old format (31 December 2012 and earlier): Seven digits followed by 1-2 letters

Pattern:

New format (1 January 2013 and later)

  • Seven digits

  • A letter (not case sensitive) which is an alphabetic check digit

  • The letter "A" or "H" (not case sensitive)

Old format (31 December 2012 and earlier)

  • Seven digits

  • 1-2 letters (not case sensitive)

Checksum: Yes

Definition:

A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_ireland_pps finds content that matches the pattern.

  • One of the following is true:

    • A keyword from Keyword_ireland_pps is found.

    • The function Func_eu_date finds a date in the right date format.

  • The checksum passes.

A DLP policy is 65% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_ireland_pps finds content that matches the pattern.

  • The checksum passes.

<!-- Ireland Personal Public Service (PPS) Number -->
<Entity id="1cdb674d-c19a-4fcf-9f4b-7f56cc87345a" recommendedConfidence="85" patternsProximity="300">
  <Pattern confidenceLevel="85">
     <IdMatch idRef="Func_ireland_pps"/>
     <Any minMatches="1">
  <Match idRef="Keyword_ireland_pps"/>
  <Match idRef="Func_eu_date"/>
     </Any>
  </Pattern>
  <Pattern confidenceLevel="65">
     <IdMatch idRef="Func_ireland_pps"/>
  </Pattern>
</Entity>

Keywords:

Keyword_ireland_pps
Personal Public Service Number
PPS Number
PPS Num
PPS No.
PPS #
PPS#
PPSN
Public Services Card
Uimhir Phearsanta Seirbhíse Poiblí
Uimh. PSP
PSP

Israel Bank Account Number

Format: 13 digits

Pattern:

Formatted:

  • Two digits

  • A dash

  • Three digits

  • A dash

  • Eight digits

Unformatted: 13 consecutive digits

Checksum: No

Definition:

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The regular expression Regex_israel_bank_account_number finds content that matches the pattern.

  • A keyword from Keyword_israel_bank_account_number is found.

<!-- Israel Bank Account Number -->
<Entity id="7d08b2ff-a0b9-437f-957c-aeddbf9b2b25" patternsProximity="300" recommendedConfidence="75">
    <Pattern confidenceLevel="75">
        <IdMatch idRef="Regex_israel_bank_account_number" />
        <Any minMatches="1">
          <Match idRef="Keyword_israel_bank_account_number" />
        </Any>
    </Pattern>
</Entity>

Keywords:

Keyword_israel_bank_account_number
Bank Account Number
Bank Account
Account Number
מספר חשבון בנק

Israel National ID

Format: Nine digits

Pattern: Nine consecutive digits

Checksum: Yes

Definition:

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_israeli_national_id_number finds content that matches the pattern.

  • A keyword from Keyword_Israel_National_ID is found.

  • The checksum passes.

<!-- Israel National ID Number -->
<Entity id="e05881f5-1db1-418c-89aa-a3ac5c5277ee" patternsProximity="300" recommendedConfidence="75">
    <Pattern confidenceLevel="75">
        <IdMatch idRef="Func_israeli_national_id_number" />
        <Any minMatches="1">
          <Match idRef="Keyword_Israel_National_ID" />
        </Any>
    </Pattern>
</Entity>

Keywords:

Keyword_Israel_National_ID
מספר זהות
National ID Number

Italy Driver's License Number

Format: A combination of 10 letters and digits

Pattern: A combination of 10 letters and digits:

  • One letter (not case sensitive)

  • The letter "A" or "V" (not case sensitive)

  • Seven letters (not case sensitive), digits, or the underscore character

  • One letter (not case sensitive)

Checksum: No

Definition:

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The regular expression Regex_italy_drivers_license_number finds content that matches the pattern.

  • A keyword from Keyword_italy_drivers_license_number is found.

<!-- Italy Driver's license Number -->
<Entity id="97d6244f-9157-41bd-8e0c-9d669a5c4d71" patternsProximity="300" recommendedConfidence="75">
    <Pattern confidenceLevel="75">
        <IdMatch idRef="Regex_italy_drivers_license_number" />
        <Any minMatches="1">
          <Match idRef="Keyword_italy_drivers_license_number" />
        </Any>
    </Pattern>
</Entity>

Keywords:

Keyword_italy_drivers_license_number
numero di patente di guida
patente di guida

Japan Bank Account Number

Format: Seven or eight digits

Pattern:

Bank account number: Seven or eight digits

Bank account branch code:

  • Four digits

  • A space or dash (optional)

  • Three digits

Checksum: No

Definition:

A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_jp_bank_account finds content that matches the pattern.

  • A keyword from Keyword_jp_bank_account is found.

  • One of the following is true:

    • The function Func_jp_bank_account_branch_code finds content that matches the pattern.

    • A keyword from Keyword_jp_bank_branch_code is found.

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_jp_bank_account finds content that matches the pattern.

  • A keyword from Keyword_jp_bank_account is found.

<!-- Japan Bank Account Number -->
<Entity id="d354f95b-96ee-4b80-80bc-4377312b55bc" patternsProximity="300" recommendedConfidence="75">
  <Version minEngineVersion="15.01.0131.000">
    <Pattern confidenceLevel="85">
          <IdMatch idRef="Func_jp_bank_account" />
          <Match idRef="Keyword_jp_bank_account" />
          <Any minMatches="1">
            <Match idRef="Func_jp_bank_account_branch_code" />
            <Match idRef="Keyword_jp_bank_branch_code" />
          </Any>
      </Pattern>
  </Version>
     <Pattern confidenceLevel="75">
        <IdMatch idRef="Func_jp_bank_account" />
        <Match idRef="Keyword_jp_bank_account" />
    </Pattern>
</Entity>

Keywords:

Keyword_jp_bank_account Keyword_jp_bank_branch_code
Checking Account Number
Checking Account
Checking Account #
Checking Acct Number
Checking Acct #
Checking Acct No.
Checking Account No.
Bank Account Number
Bank Account
Bank Account #
Bank Acct Number
Bank Acct #
Bank Acct No.
Bank Account No.
Savings Account Number
Savings Account
Savings Account #
Savings Acct Number
Savings Acct #
Savings Acct No.
Savings Account No.
Debit Account Number
Debit Account
Debit Account #
Debit Acct Number
Debit Acct #
Debit Acct No.
Debit Account No.
口座番号を当座預金口座の確認
#アカウントの確認、勘定番号の確認
#勘定の確認
勘定番号の確認
口座番号の確認
銀行口座番号
銀行口座
銀行口座#
銀行の勘定番号
銀行のacct#
銀行の勘定いいえ
銀行口座番号
普通預金口座番号
預金口座
貯蓄口座#
貯蓄勘定の数
貯蓄勘定#
貯蓄勘定番号
普通預金口座番号
引き落とし口座番号
口座番号
口座番号#
デビットのacct番号
デビット勘定#
デビットACCTの番号
デビット口座番号
Otemachi

Japan Driver's License Number

Format: 12 digits

Pattern: 12 consecutive digits

Checksum: No

Definition:

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_jp_drivers_license_number finds content that matches the pattern.

  • A keyword from Keyword_jp_drivers_license_number is found.

<!-- Japan Driver's License Number -->
<Entity id="c6011143-d087-451c-8313-7f6d4aed2270" patternsProximity="300" recommendedConfidence="75">
    <Pattern confidenceLevel="75">
        <IdMatch idRef="Func_jp_drivers_license_number" />
        <Match idRef ="Keyword_jp_drivers_license_number" />
    </Pattern>
</Entity>

Keywords:

Keyword_jp_drivers_license_number
driver license
drivers license
driver's license
drivers licenses
driver's licenses
driver licenses
dl#
dls#
lic#
lics#
運転免許証
運転免許
免許証
免許
運転免許証番号
運転免許番号
免許証番号
免許番号
運転免許証ナンバー
運転免許ナンバー
免許証ナンバー
運転免許証No.
運転免許No.
免許証No.
免許No.
運転免許証#
運転免許#
免許証#
免許#

Japan Passport Number

Format: Two letters followed by seven digits

Pattern: Two letters (not case sensitive) followed by seven digits

Checksum: No

Definition:

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_jp_passport finds content that matches the pattern.

  • A keyword from Keyword_jp_passport is found.

<!-- Japan Passport Number -->
<Entity id="75177310-1a09-4613-bf6d-833aae3743f8" patternsProximity="300" recommendedConfidence="75">
    <Pattern confidenceLevel="75">
        <IdMatch idRef="Func_jp_passport" />
        <Match idRef="Keyword_jp_passport" />
    </Pattern>
</Entity>

Keywords:

Keyword_jp_passport
パスポート
パスポート番号
パスポートのNum
パスポート#

Japan Resident Registration Number

Format: 11 digits

Pattern: 11 consecutive digits

Checksum: No

Definition:

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_jp_resident_registration_number finds content that matches the pattern.

  • A keyword from Keyword_jp_resident_registration_number is found.

<!-- Japan Resident Registration Number -->
<Entity id="01c1209b-6389-4faf-a5f8-3f7e13899652" patternsProximity="300" recommendedConfidence="75">
    <Pattern confidenceLevel="75">
        <IdMatch idRef="Func_jp_resident_registration_number" />
        <Match idRef ="Keyword_jp_resident_registration_number" />
    </Pattern>
</Entity>

Keywords:

Keyword_jp_resident_registration_number
Resident Registration Number
Resident Register Number
Residents Basic Registry Number
Resident Registration No.
Resident Register No.
Residents Basic Registry No.
Basic Resident Register No.
住民登録番号、登録番号をレジデント
住民基本登録番号、登録番号
住民基本レジストリ番号を常駐
登録番号を常駐住民基本台帳登録番号

Japan Social Insurance Number (SIN)

Format: 7-12 digits

Pattern: 7-12 digits:

  • Four digits

  • A hyphen (optional)

  • Six digits

    OR

  • 7-12 consecutive digits

Checksum: No

Definition:

A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_jp_sin finds content that matches the pattern.

  • A keyword from Keyword_jp_sin is found.

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_jp_sin_pre_1997 finds content that matches the pattern.

  • A keyword from Keyword_jp_sin is found.

<!-- Japan Social Insurance Number -->
<Entity id="c840e719-0896-45bb-84fd-1ed5c95e45ff" patternsProximity="300" recommendedConfidence="75">
    <Pattern confidenceLevel="85">
        <IdMatch idRef="Func_jp_sin" />
        <Match idRef="Keyword_jp_sin" />
    </Pattern>
    <Pattern confidenceLevel="75">
        <IdMatch idRef="Func_jp_sin_pre_1997" />
        <Match idRef="Keyword_jp_sin" />
    </Pattern>
</Entity>

Keywords:

Keyword_jp_sin
Social Insurance No.
Social Insurance Num
Social Insurance Number
社会保険のテンキー
社会保険番号

Malaysia ID Card Number

Format: 12 digits containing optional hyphens

Pattern: 12 digits:

  • Six digits in the format YYMMDD, which are the date of birth

  • A dash (optional)

  • Two-letter place-of-birth code

  • A dash (optional)

  • Three random digits

  • One-digit gender code

Checksum: No

Definition:

A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The regular expression Regex_malaysia_id_card_number finds content that matches the pattern.

  • A keyword from Keyword_malaysia_id_card_number is found.

<!-- Malaysia ID Card Number -->
</Entity>
      <Entity id="7f0e921c-9677-435b-aba2-bb8f1013c749" patternsProximity="300" recommendedConfidence="85">
        <Pattern confidenceLevel="85">
            <IdMatch idRef="Regex_malaysia_id_card_number" />
            <Match idRef="Keyword_malaysia_id_card_number" />
        </Pattern>
</Entity>

Keywords:

Keyword_malaysia_id_card_number
MyKad
Identity Card
ID Card
Identification Card
Digital Application Card
Kad Akuan Diri
Kad Aplikasi Digital

Netherlands Citizen's Service (BSN) Number

Format: 8-9 digits containing optional spaces

Pattern: 8-9 digits:

  • Three digits

  • A space (optional)

  • Three digits

  • A space (optional)

  • 2-3 digits

Checksum: Yes

Definition:

A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_netherlands_bsn finds content that matches the pattern.

  • A keyword from Keyword_netherlands_bsn is found.

  • The function Func_eu_date finds a date in the right date format.

  • The checksum passes.

A DLP policy is 65% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_netherlands_bsn finds content that matches the pattern.

  • The checksum passes.

<!-- Netherlands Citizen's Service (BSN) Number -->
<Entity id="c5f54253-ef7e-44f6-a578-440ed67e946d" recommendedConfidence="85" patternsProximity="300">
  <Pattern confidenceLevel="85">
     <IdMatch idRef="Func_netherlands_bsn"/>
     <Match idRef="Keyword_netherlands_bsn"/>
     <Match idRef="Func_eu_date"/>
  </Pattern>
  <Pattern confidenceLevel="65">
     <IdMatch idRef="Func_netherlands_bsn"/>
  </Pattern>
</Entity>

Keywords:

Keyword_netherlands_bsn
Citizen service number
BSN
Burgerservicenummer
Sofinummer
Persoonsgebonden nummer
Persoonsnummer

New Zealand Ministry of Health Number

Format: Three letters, a space (optional), and four digits

Pattern: Three letters (not case sensitive) a space (optional) four digits

Checksum: Yes

Definition:

A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_new_zealand_ministry_of_health_number finds content that matches the pattern.

  • A keyword from Keyword_nz_terms is found.

  • The checksum passes.

<!-- New Zealand Health Number -->
<Entity id="2b71c1c8-d14e-4430-82dc-fd1ed6bf05c7" patternsProximity="300" recommendedConfidence="85">
    <Pattern confidenceLevel="85">
        <IdMatch idRef="Func_new_zealand_ministry_of_health_number" />
        <Any minMatches="1">
          <Match idRef="Keyword_nz_terms" />
        </Any>
    </Pattern>
</Entity>

Keywords:

Keyword_nz_terms
NHI
New Zealand
Health
treatment

Norway Identification Number

Format: 11 digits

Pattern: 11 digits:

  • Six digits in the format DDMMYY that are the date of birth

  • Three-digit individual number

  • Two check digits

Checksum: Yes

Definition:

A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_norway_id_number finds content that matches the pattern.

  • A keyword from Keyword_norway_id_number is found.

  • The checksum passes.

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_norway_id_numbe finds content that matches the pattern.

  • The checksum passes.

<!-- Norway Identification Number -->
<Entity id="d4c8a798-e9f2-4bd3-9652-500d24080fc3" recommendedConfidence="85" patternsProximity="300">
  <Pattern confidenceLevel="85">
     <IdMatch idRef="Func_norway_id_number"/>
     <Match idRef="Keyword_norway_id_number"/>
  </Pattern>
  <Pattern confidenceLevel="75">
     <IdMatch idRef="Func_norway_id_number"/>
  </Pattern>
</Entity>

Keywords:

Keyword_norway_id_number
Personal identification number
Norwegian ID Number
ID Number
Identification
Personnummer
Fødselsnummer

Philippines Unified Multi-Purpose ID Number

Format: 12 digits separated by hyphens

Pattern: 12 digits:

  • Four digits

  • A hyphen

  • Seven digits

  • A hyphen

  • One digit

Checksum: No

Definition:

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The regular expression Regex_philippines_unified_id finds content that matches the pattern.

  • A keyword from Keyword_philippines_id is found.

<!-- Philippines Unified Multi-Purpose ID number -->
<Entity id="019b39dd-8c25-4765-91a3-d9c6baf3c3b3" recommendedConfidence="75" patternsProximity="300">
  <Pattern confidenceLevel="75">
     <IdMatch idRef="Regex_philippines_unified_id"/>
     <Match idRef="Keyword_philippines_id"/>
  </Pattern>
</Entity>

Keywords:

Keyword_philippines_id
Unified Multi-Purpose ID
UMID
Identity Card
Pinag-isang Multi-Layunin ID

Poland Identity Card

Format: Three letters and six digits

Pattern: Three letters (not case sensitive) followed by six digits

Checksum: Yes

Definition:

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_polish_national_id finds content that matches the pattern.

  • A keyword from Keyword_polish_national_id_passport_number is found.

  • The checksum passes.

<!-- Poland Identity Card-->
<Entity id="25E64989-ED5D-40CA-A939-6C14183BB7BF" patternsProximity="300" recommendedConfidence="85">
      <Pattern confidenceLevel="85">
          <IdMatch idRef="Func_polish_national_id" />
          <Match idRef="Keyword_polish_national_id_passport_number" />
      </Pattern>
</Entity>

Keywords:

Keyword_polish_national_id_passport_number
Nazwa i nr dowodu tożsamości
Dowód Tożsamości
dow. os.

Poland National ID (PESEL)

Format: 11 digits

Pattern: 11 consecutive digits

Checksum: Yes

Definition:

A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_pesel_identification_number finds content that matches the pattern.

  • A keyword from Keyword_pesel_identification_number is found.

  • The checksum passes.

<!-- Poland National ID (PESEL) -->
<Entity id="E3AAF206-4297-412F-9E06-BA8487E22456" patternsProximity="300" recommendedConfidence="85">
      <Pattern confidenceLevel="85">
          <IdMatch idRef="Func_pesel_identification_number" />
          <Match idRef="Keyword_pesel_identification_number" />
      </Pattern>
</Entity>

Keywords:

Keyword_pesel_identification_number
Nr PESEL
PESEL

Poland Passport

Format: Two letters and seven digits

Pattern: Two letters (not case sensitive) followed by seven digits

Checksum: Yes

Definition:

A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_polish_passport_number finds content that matches the pattern.

  • A keyword from Keyword_polish_national_id_passport_number is found.

  • The checksum passes.

<!-- Poland Passport Number -->
<Entity id="03937FB5-D2B6-4487-B61F-0F8BFF7C3517" patternsProximity="300" recommendedConfidence="85">
      <Pattern confidenceLevel="85">
          <IdMatch idRef="Func_polish_passport_number" />
          <Match idRef="Keyword_polish_national_id_passport_number" />
      </Pattern>
</Entity>
</Version>

Keywords:

Keyword_polish_national_id_passport_number
Nazwa i nr dowodu tożsamości
Dowód Tożsamości
dow. os.

Portugal Citizen Card Number

Format: Eight digits

Pattern: Eight digits

Checksum: No

Definition:

A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The regular expression Regex_portugal_citizen_card finds content that matches the pattern.

  • A keyword from Keyword_portugal_citizen_card is found.

<!-- Portugal Citizen Card Number -->
<Entity id="91a7ece2-add4-4986-9a15-c84544d81ecd" recommendedConfidence="85" patternsProximity="300">
  <Pattern confidenceLevel="85">
     <IdMatch idRef="Regex_portugal_citizen_card"/>
     <Match idRef="Keyword_portugal_citizen_card"/>
  </Pattern>
</Entity>

Keywords:

Keyword_portugal_citizen_card
Citizen Card
National ID Card
CC
Cartão de Cidadão
Bilhete de Identidade

Saudi Arabia National ID

Format: 10 digits

Pattern: 10 consecutive digits

Checksum: No

Definition:

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The regular expression Regex_saudi_arabia_national_id finds content that matches the pattern.

  • A keyword from Keyword_saudi_arabia_national_id is found.

<!-- Saudi Arabia National ID -->
<Entity id="8c5a0ba8-404a-41a3-8871-746aa21ee6c0" patternsProximity="300" recommendedConfidence="75">
    <Pattern confidenceLevel="75">
        <IdMatch idRef="Regex_saudi_arabia_national_id" />
        <Any minMatches="1">
          <Match idRef="Keyword_saudi_arabia_national_id" />
        </Any>
    </Pattern>
</Entity>

Keywords:

Keyword_saudi_arabia_national_id
Identification Card
I card number
ID number
الوطنية الهوية بطاقة رقم

Singapore National Registration Identity Card (NRIC) Number

Format: Nine letters and digits

Pattern: Nine letters and digits:

  • The letter "F", "G", "S", or "T" (not case sensitive)

  • Seven digits

  • An alphabetic check digit

Checksum: Yes

Definition:

A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The regular expression Regex_singapore_nric finds content that matches the pattern.

  • A keyword from Keyword_singapore_nric is found.

  • The checksum passes.

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The regular expression Regex_singapore_nric finds content that matches the pattern.

  • The checksum passes.

<!-- Singapore National Registration Identity Card (NRIC) Number -->
<Entity id="cead390a-dd83-4856-9751-fb6dc98c34da" recommendedConfidence="75" patternsProximity="300">
  <Pattern confidenceLevel="85">
     <IdMatch idRef="Regex_singapore_nric"/>
     <Match idRef="Keyword_singapore_nric"/>
  </Pattern>
  <Pattern confidenceLevel="75">
     <IdMatch idRef="Regex_singapore_nric"/>
  </Pattern>
</Entity>

Keywords:

Keyword_singapore_nric
National Registration Identity Card
Identity Card Number
NRIC
IC
Foreign Identification Number
FIN
身份证
身份證

South Africa Identification Number

Format: 13 digits that may contain spaces

Pattern: 13 digits:

  • Six digits in the format YYMMDD, which are the date of birth

  • Four digits

  • A single-digit citizenship indicator

  • The digit "8" or "9"

  • One digit that is a checksum digit

Checksum: Yes

Definition:

A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_south_africa_identification_number finds content that matches the pattern.

  • A keyword from Keyword_south_africa_identification_number is found.

  • The checksum passes.

<!-- South Africa Identification Number -->
<Entity id="e2adf7cb-8ea6-4048-a2ed-d89eb65f2780" recommendedConfidence="85" patternsProximity="300">
  <Pattern confidenceLevel="85">
     <IdMatch idRef="Func_south_africa_identification_number"/>
     <Match idRef="Keyword_south_africa_identification_number"/>
  </Pattern>
</Entity>

Keywords:

Keyword_south_africa_identification_number
Identity card
ID
Identification

South Korea Resident Registration Number

Format: 13 digits containing a hyphen

Pattern: 13 digits:

  • Six digits in the format YYMMDD that are the date of birth

  • A hyphen

  • One digit determined by the century and gender

  • Four-digit region-of-birth code

  • One digit used to differentiate people for whom the preceding numbers are identical

  • A check digit.

Checksum: Yes

Definition:

A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_south_korea_resident_number finds content that matches the pattern.

  • A keyword from Keyword_south_korea_resident_number is found.

  • The checksum passes.

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_south_korea_resident_number finds content that matches the pattern.

  • The checksum passes.

<!-- South Korea Resident Registration Number -->
<Entity id="5b802e18-ba80-44c4-bc83-bf2ad36ae36a" recommendedConfidence="85" patternsProximity="300">
  <Pattern confidenceLevel="85">
     <IdMatch idRef="Func_south_korea_resident_number"/>
     <Match idRef="Keyword_south_korea_resident_number"/>
  </Pattern>
  <Pattern confidenceLevel="75">
     <IdMatch idRef="Func_south_korea_resident_number"/>
  </Pattern>
</Entity>

Keywords:

Keyword_south_korea_resident_number
National ID card
Citizen's Registration Number
Jumin deungnok beonho
RRN
주민등록번호

Spain Social Security Number (SSN)

Format: 11-12 digits

Pattern: 11-12 digits:

  • Two digits

  • A forward slash (optional)

  • 7-8 digits

  • A forward slash (optional)

  • Two digits

Checksum: Yes

Definition:

A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_spanish_social_security_number finds content that matches the pattern.

  • The checksum passes.

<!-- Spain SSN -->
<Entity id="5df987c0-8eae-4bce-ace7-b316347f3070" patternsProximity="300" recommendedConfidence="85">
    <Pattern confidenceLevel="85">
        <IdMatch idRef="Func_spanish_social_security_number" />
    </Pattern>
</Entity>

Keywords: None

Sweden National ID

Format: 10 or 12 digits and an optional delimiter

Pattern: 10 or 12 digits and an optional delimiter:

  • 2-4 digits (optional)

  • Six digits in date format YYMMDD

  • Delimiter of "-" or "+" (optional), plus

  • Four digits

Checksum: Yes

Definition:

A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_swedish_national_identifier finds content that matches the pattern.

  • The checksum passes.

<!-- Sweden National ID -->
<Entity id="f69aaf40-79be-4fac-8f05-fd1910d272c8" patternsProximity="300" recommendedConfidence="85">
    <Pattern confidenceLevel="85">
        <IdMatch idRef="Func_swedish_national_identifier" />
    </Pattern>
</Entity>

Keywords: None

Sweden Passport Number

Format: Eight digits

Pattern: Eight consecutive digits

Checksum: No

Definition:

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The regular expression Regex_sweden_passport_number finds content that matches the pattern.

  • One of the following is true:

    • A keyword from Keyword_passport is found.

    • A keyword from Keyword_sweden_passport is found.

<!-- Sweden Passport Number -->
<Entity id="ba4e7456-55a9-4d89-9140-c33673553526" patternsProximity="300" recommendedConfidence="75">
    <Pattern confidenceLevel="75">
        <IdMatch idRef="Regex_sweden_passport_number" />
        <Any minMatches="1">
          <Match idRef="Keyword_passport" />
          <Match idRef="Keyword_sweden_passport" />
        </Any>
    </Pattern>
</Entity>

Keywords:

Keyword_sweden_passport Keyword_passport
visa requirements
Alien Registration Card
Schengen visas
Schengen visa
Visa Processing
Visa Type
Single Entry
Multiple Entry
G3 Processing Fees
Passport Number
Passport No
Passport #
Passport#
PassportID
Passportno
passport number
パスポート
パスポート番号
パスポートのNum
パスポート#
Numéro de passeport
Passeport n °
Passeport Non
Passeport #
Passeport#
PasseportNon
Passeportn °

SWIFT Code

Format: Four letters followed by 5-31 letters or digits

Pattern: Four letters followed by 5-31 letters or digits:

  • Four-letter bank code (not case sensitive)

  • An optional space

  • 4-28 letters or digits (the Basic Bank Account Number (BBAN))

  • An optional space

  • 1-3 letters or digits (remainder of the BBAN)

Checksum: No

Definition:

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The regular expression Regex_swift finds content that matches the pattern.

  • A keyword from Keyword_swift is found.

<Entity id="cb2ab58c-9cb8-4c81-baf8-a4e106791df4" patternsProximity="300" recommendedConfidence="75">
<Pattern confidenceLevel="75">
        <IdMatch idRef="Regex_swift" />
        <Match idRef="Keyword_swift" />
    </Pattern>
</Entity>

Keywords:

Keyword_swift
international organization for standardization 9362
iso 9362
iso9362
swift#
swift code
swift number
swiftroutingnumber
swift code
swift number #
swift routing number
bic number
bic code
bic #
bic#
bank identifier code
標準化9362
迅速#
SWIFTコード
SWIFT番号
迅速なルーティング番号
BIC番号
BICコード
銀行識別コードのための国際組織
Organisation internationale de normalisation 9362
rapide #
code SWIFT
le numéro de swift
swift numéro d'acheminement
le numéro BIC
# BIC
code identificateur de banque

Taiwanese ID

Format: One letter (in English) followed by nine digits

Pattern: One letter (in English) followed by nine digits:

  • One letter (in English, not case sensitive)

  • The digit "1" or "2"

  • Eight digits

Checksum: Yes

Definition:

A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_taiwanese_national_id finds content that matches the pattern.

  • A keyword from Keyword_taiwanese_national_id is found.

  • The checksum passes.

<!-- Taiwanese National ID -->
<Entity id="4C7BFC34-8DD1-421D-8FB7-6C6182C2AF03" patternsProximity="300" recommendedConfidence="85">
      <Pattern confidenceLevel="85">
          <IdMatch idRef="Func_taiwanese_national_id" />
          <Match idRef="Keyword_taiwanese_national_id" />
      </Pattern>
</Entity>

Keywords:

Keyword_taiwanese_national_id
身份證字號
身份證
身份證號碼
身份證號
身分證字號
身分證
身分證號碼
身份證號
身分證統一編號
國民身分證統一編號
簽名
蓋章
簽名或蓋章
簽章

Taiwan Passport Number

Format:

  • Biometric passport number: Nine digits

  • Non-biometric passport number: Nine digits

Pattern:

  • Biometric passport number

    • The digit "3"

    • Eight digits

  • Non-biometric passport number: Nine digits

Checksum: No

Definition:

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The regular expression Regex_taiwan_passport finds content that matches the pattern.

  • A keyword from Keyword_taiwan_passport is found.

<!-- Taiwan Passport Number -->
<Entity id="e7251cb4-4c2c-41df-963e-924eb3dae04a" recommendedConfidence="75" patternsProximity="300">
  <Pattern confidenceLevel="75">
     <IdMatch idRef="Regex_taiwan_passport"/>
     <Match idRef="Keyword_taiwan_passport"/>
  </Pattern>
</Entity>

Keywords:

Keyword_taiwan_passport
Taiwan passport number
Passport number
Passport no
Passport Num
Passport #
护照
中華民國護照
Zhōnghuá Mínguó hùzhào

Taiwan Resident Certificate (ARC/TARC) Number

Format: 10 letters and digits

Pattern: 10 letters and digits:

  • Two letters (not case sensitive)

  • Eight digits

Checksum: No

Definition:

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The regular expression Regex_taiwan_resident_certificate finds content that matches the pattern.

  • A keyword from Keyword_taiwan_resident_certificate is found.

<!-- Taiwan Resident Certificate (ARC/TARC) -->
<Entity id="48269fec-05ea-46ea-b326-f5623a58c6e9" recommendedConfidence="75" patternsProximity="300">
  <Pattern confidenceLevel="75">
     <IdMatch idRef="Regex_taiwan_resident_certificate"/>
     <Match idRef="Keyword_taiwan_resident_certificate"/>
  </Pattern>
</Entity>

Keywords:

Keyword_taiwan_resident_certificate
Resident Certificate
Resident Cert
Resident Cert.
Identification card
Alien Resident Certificate
ARC
Taiwan Area Resident Certificate
TARC
居留證
外僑居留證
台灣地區居留證

U.K. Driver's License Number

Format: Combination of 18 letters and digits in the specified format

Pattern: 18 letters and digits:

  • Five letters (not case sensitive) or the digit "9" in place of a letter

  • One digit

  • Five digits in the date format DDMMY for date of birth

  • Two letters (not case sensitive) or the digit "9" in place of a letter

  • Five digits

Checksum: Yes

Definition:

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_uk_drivers_license finds content that matches the pattern.

  • A keyword from Keyword_uk_drivers_license is found.

  • The checksum passes.

<!-- U.K. Driver's License Number -->
<Entity id="f93de4be-d94c-40df-a8be-461738047551" patternsProximity="300" recommendedConfidence="75">
    <Pattern confidenceLevel="75">
        <IdMatch idRef="Func_uk_drivers_license" />
        <Match idRef="Keyword_uk_drivers_license" />
    </Pattern>
</Entity>

Keywords:

Keyword_uk_drivers_license
DVLA
light vans
quad bikes
motor cars
125cc
sidecar
tricycles
motorcycles
photo card licence
learner drivers
licence holder
licence holders
driving licences
driving licence
dual control car

U.K. Electoral Roll Number

Format: Two letters followed by 1-4 digits

Pattern: Two letters (not case sensitive) followed by 1-4 numbers

Checksum: No

Definition:

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The regular expression Regex_uk_electoral finds content that matches the pattern.

  • A keyword from Keyword_uk_electoral is found.

<!-- U.K. Electoral Number -->
<Entity id="a3eea206-dc0c-4f06-9e22-aa1be3059963" patternsProximity="300" recommendedConfidence="75">
    <Pattern confidenceLevel="75">
        <IdMatch idRef="Regex_uk_electoral" />
        <Any minMatches="1">
          <Match idRef="Keyword_uk_electoral" />
        </Any>
    </Pattern>
</Entity>

Keywords:

Keyword_uk_electoral
council nomination
nomination form
electoral register
electoral roll

U.K. National Health Service Number

Format: 10-17 digits separated by spaces

Pattern: 10-17 digits:

  • Either 3 or 10 digits

  • A space

  • Three digits

  • A space

  • Four digits

Checksum: Yes

Definition:

A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_uk_nhs_number finds content that matches the pattern.

  • One of the following is true:

    • A keyword from Keyword_uk_nhs_number is found.

    • A keyword from Keyword_uk_nhs_number1 is found.

    • A keyword from Keyword_uk_nhs_number_dob is found.

  • The checksum passes.

<!-- U.K. NHS Number -->
<Entity id="3192014e-2a16-44e9-aa69-4b20375c9a78" patternsProximity="300" recommendedConfidence="85">
    <Pattern confidenceLevel="85">
        <IdMatch idRef="Func_uk_nhs_number" />
        <Any minMatches="1">
          <Match idRef="Keyword_uk_nhs_number" />
          <Match idRef="Keyword_uk_nhs_number1" />
          <Match idRef="Keyword_uk_nhs_number_dob" />
        </Any>
    </Pattern>
</Entity>

Keywords:

Keyword_uk_nhs_number Keyword_uk_nhs_number1 Keyword_uk_nhs_number_dob
national health service
nhs
health services authority
health authority
patient ID
patient identification
patient no
patient number
GP
DOB
D.O.B
Date of Birth
Birth Date

U.K. National Insurance Number (NINO)

Format: Nine letters and digits, with each pair of letters and digits optionally separated by spaces or dashes

Pattern: Nine letters and digits, with each pair of letters and digits optionally separated by spaces or dashes:

  • Two letters (not case sensitive), neither of which can be D, F, I, Q, U, or V. Additionally, the second letter can't be O. The following combinations are also not allowed: BG, GB, KN, NK, NT, TN, and ZZ.

  • Six digits

  • A space or dash (optional)

  • Two digits

  • A space or dash (optional)

  • Two digits

  • A space or dash (optional)

  • Two digits

  • One letter that can be A, B, C, D; or one space.

Checksum: No

Definition:

A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_uk_nino finds content that matches the pattern.

  • A keyword from Keyword_uk_nino is found.

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_uk_nino finds content that matches the pattern.

  • No keyword from Keyword_uk_nino is found.

<!-- U.K. NINO -->
<Entity id="16c07343-c26f-49d2-a987-3daf717e94cc" patternsProximity="300" recommendedConfidence="75">
    <Pattern confidenceLevel="85">
        <IdMatch idRef="Func_uk_nino" />
        <Any minMatches="1">
          <Match idRef="Keyword_uk_nino" />
        </Any>
    </Pattern>
     <Pattern confidenceLevel="75">
        <IdMatch idRef="Func_uk_nino" />
        <Any minMatches="0" maxMatches="0">
          <Match idRef="Keyword_uk_nino" />
        </Any>
    </Pattern>
</Entity>

Keywords:

Keyword_uk_nino
national insurance number
national insurance contributions
protection act
insurance
social security number
insurance application
medical application
social insurance
medical attention
social security
great britain
insurance

U.S. / U.K. Passport Number

Format: Nine digits

Pattern: Nine consecutive digits

Checksum: No

Definition:

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_usa_uk_passport finds content that matches the pattern.

  • A keyword from Keyword_passport is found.

<Entity id="178ec42a-18b4-47cc-85c7-d62c92fd67f8" patternsProximity="300" recommendedConfidence="75">
    <Pattern confidenceLevel="75">
        <IdMatch idRef="Func_usa_uk_passport" />
        <Match idRef="Keyword_passport" />
    </Pattern>
</Entity>

Keywords:

Keyword_passport
Passport Number
Passport No
Passport #
Passport#
PassportID
Passportno
passport number
パスポート
パスポート番号
パスポートのNum
パスポート#
Numéro de passeport
Passeport n °
Passeport Non
Passeport #
Passeport#
PasseportNon
Passeportn °

U.S. Bank Account Number

Format: 4-17 digits

Pattern: 4-17 consecutive digits

Checksum: No

Definition:

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The regular expression Regex_usa_bank_account_number finds content that matches the pattern.

  • A keyword from Keyword_usa_Bank_Account is found.

<!-- U.S. Bank Account Number -->
<Entity id="a2ce32a8-f935-4bb6-8e96-2a5157672e2c" patternsProximity="300" recommendedConfidence="75">
    <Pattern confidenceLevel="75">
        <IdMatch idRef="Regex_usa_bank_account_number" />
        <Match idRef="Keyword_usa_Bank_Account" />
    </Pattern>
</Entity>

Keywords:

Keyword_usa_Bank_Account
Checking Account Number
Checking Account
Checking Account #
Checking Acct Number
Checking Acct #
Checking Acct No.
Checking Account No.
Bank Account Number
Bank Account #
Bank Acct Number
Bank Acct #
Bank Acct No.
Bank Account No.
Savings Account Number
Savings Account.
Savings Account #
Savings Acct Number
Savings Acct #
Savings Acct No.
Savings Account No.
Debit Account Number
Debit Account
Debit Account #
Debit Acct Number
Debit Acct #
Debit Acct No.
Debit Account No.

U.S. Driver's License Number

Format: Depends on the state

Pattern: Depends on the state -- for example, New York:

  • Nine digits formatted like ddd ddd ddd will match

  • Nine digits like ddddddddd will not match.

Checksum: No

Definition:

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_new_york_drivers_license_number finds content that matches the pattern.

  • A keyword from Keyword_[state_name]_drivers_license_name is found.

  • A keyword from Keyword_us_drivers_license is found.

A DLP policy is 65% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_new_york_drivers_license_number finds content that matches the pattern.

  • A keyword from Keyword_[state_name]_drivers_license_name is found.

  • A keyword from Keyword_us_drivers_license_abbreviations is found.

  • No keyword from Keyword_us_drivers_license is found.

<Pattern confidenceLevel="75">
        <IdMatch idRef="Func_new_york_drivers_license_number" />
        <Match idRef="Keyword_new_york_drivers_license_name" />
        <Match idRef="Keyword_us_drivers_license" />
    </Pattern>
    <Pattern confidenceLevel="65">
        <IdMatch idRef="Func_new_york_drivers_license_number" />
        <Match idRef="Keyword_new_york_drivers_license_name" />
        <Match idRef="Keyword_us_drivers_license_abbreviations" />
        <Any minMatches="0" maxMatches="0">
          <Match idRef="Keyword_us_drivers_license" />
        </Any>
    </Pattern>

Keywords:

Keyword_us_drivers_license_abbreviations Keyword_us_drivers_license Keyword_[state_name]_drivers_license_name
DL
DLS
CDL
CDLS
ID
IDs
DL#
DLS#
CDL#
CDLS#
ID#
IDs#
ID number
ID numbers
LIC
LIC#
DriverLic
DriverLics
DriverLicense
DriverLicenses
Driver Lic
Driver Lics
Driver License
Driver Licenses
DriversLic
DriversLics
DriversLicense
DriversLicenses
Drivers Lic
Drivers Lics
Drivers License
Drivers Licenses
Driver'Lic
Driver'Lics
Driver'License
Driver'Licenses
Driver' Lic
Driver' Lics
Driver' License
Driver' Licenses
Driver'sLic
Driver'sLics
Driver'sLicense
Driver'sLicenses
Driver's Lic
Driver's Lics
Driver's License
Driver's Licenses
identification number
identification numbers
identification #
ID card
ID cards
identification card
identification cards
DriverLic#
DriverLics#
DriverLicense#
DriverLicenses#
Driver Lic#
Driver Lics#
Driver License#
Driver Licenses#
DriversLic#
DriversLics#
DriversLicense#
DriversLicenses#
Drivers Lic#
Drivers Lics#
Drivers License#
Drivers Licenses#
Driver'Lic#
Driver'Lics#
Driver'License#
Driver'Licenses#
Driver' Lic#
Driver' Lics#
Driver' License#
Driver' Licenses#
Driver'sLic#
Driver'sLics#
Driver'sLicense#
Driver'sLicenses#
Driver's Lic#
Driver's Lics#
Driver's License#
Driver's Licenses#
ID card#
ID cards#
identification card#
identification cards#
State abbreviation (for example, "NY")
State name (for example, "New York")

U.S. Individual Taxpayer Identification Number (ITIN)

Format: Nine digits that start with a "9" and contain a "7" or "8" as the fourth digit, optionally formatted with spaces or dashes

Pattern:

Formatted:

  • The digit "9"

  • Two digits

  • A space or dash

  • A "7" or "8"

  • A digit

  • A space, or dash

  • Four digits

Unformatted:

  • The digit "9"

  • Two digits

  • A "7" or "8"

  • Five digits

Checksum: No

Definition:

A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_formatted_itin finds content that matches the pattern.

  • At least one of the following is true:

    • A keyword from Keyword_itin is found.

    • The function Func_us_address finds an address in the right date format.

    • The function Func_us_date finds a date in the right date format.

    • A keyword from Keyword_itin_collaborative is found.

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_unformatted_itin finds content that matches the pattern.

  • At least one of the following is true:

    • A keyword from Keyword_itin_collaborative is found.

    • The function Func_us_address finds an address in the right date format.

    • The function Func_us_date finds a date in the right date format.

<!-- U.S. Individual Taxpayer Identification Number (ITIN) -->
<Entity id="e55e2a32-f92d-4985-a35d-a0b269eb687b" patternsProximity="300" recommendedConfidence="75">
    <Pattern confidenceLevel="85">
        <IdMatch idRef="Func_formatted_itin" />
        <Any minMatches="1">
          <Match idRef="Keyword_itin" />
          <Match idRef="Func_us_address" />
          <Match idRef="Func_us_date" />
          <Match idRef="Keyword_itin_collaborative" />
        </Any>
    </Pattern>
    <Pattern confidenceLevel="75">
        <IdMatch idRef="Func_unformatted_itin" />
        <Match idRef="Keyword_itin" />
        <Any minMatches="1">
          <Match idRef="Keyword_itin_collaborative" />
          <Match idRef="Func_us_address" />
          <Match idRef="Func_us_date" />
        </Any>
    </Pattern>
</Entity>

Keywords:

Keyword_itin Keyword_itin_collaborative
taxpayer
tax ID
tax identification
itin
ssn
tin
social security
tax payer
itins
taxid
individual taxpayer
License
DL
DOB
Birthdate
Birthday
Date of Birth

U.S. Social Security Number (SSN)

Format: Nine digits, which may be in a formatted or unformatted pattern

Note

If issued before mid-2011, an SSN has strong formatting where certain parts of the number must fall within certain ranges to be valid (but there's no checksum).

Pattern: Four functions look for SSNs in four different patterns:

  • Func_ssn finds SSNs with pre-2011 strong formatting that are formatted with dashes or spaces (ddd-dd-dddd OR ddd dd dddd)

  • Func_unformatted_ssn finds SSNs with pre-2011 strong formatting that are unformatted as nine consecutive digits (ddddddddd)

  • Func_randomized_formatted_ssn finds post-2011 SSNs that are formatted with dashes or spaces (ddd-dd-dddd OR ddd dd dddd)

  • Func_randomized_unformatted_ssn finds post-2011 SSNs that are unformatted as nine consecutive digits (ddddddddd)

Checksum: No

Definition:

A DLP policy is 85% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_ssn finds content that matches the pattern.

  • At least one of the following is true:

    • A keyword from Keyword_ssn is found.

    • The function Func_us_date finds a date in the right date format.

    • The function Func_us_address finds an address in the right date format.

A DLP policy is 75% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_unformatted_ssn finds content that matches the pattern.

  • A keyword from Keyword_ssn is found.

  • At least one of the following is true:

    • The function Func_us_date finds a date in the right date format.

    • The function Func_us_address finds an address in the right date format.

A DLP policy is 65% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_randomized_formatted_ssn finds content that matches the pattern.

  • The function Func_ssn does not find content that matches the pattern.

  • At least one of the following is true:

    • A keyword from Keyword_ssn is found.

    • The function Func_us_date finds a date in the right date format.

    • The function Func_us_address finds an address in the right date format.

A DLP policy is 55% confident that it's detected this type of sensitive information if, within a proximity of 300 characters:

  • The function Func_randomized_unformatted_ssn finds content that matches the pattern.

  • A keyword from Keyword_ssn is found.

  • The function Func_unformatted_ssn does not find content that matches the pattern.

  • At least one of the following is true:

    • The function Func_us_date finds a date in the right date format.

    • The function Func_us_address finds an address in the right date format.

<!-- U.S. Social Security Number (SSN) -->
<Entity id="a44669fe-0d48-453d-a9b1-2cc83f2cba77" patternsProximity="300" recommendedConfidence="75">
    <Pattern confidenceLevel="85">
        <IdMatch idRef="Func_ssn" />
        <Any minMatches="1">
          <Match idRef="Keyword_ssn" />
          <Match idRef="Func_us_date" />
          <Match idRef="Func_us_address" />
        </Any>
    </Pattern>
    <Pattern confidenceLevel="75">
        <IdMatch idRef="Func_unformatted_ssn" />
        <Match idRef="Keyword_ssn" />
        <Any minMatches="1">
          <Match idRef="Func_us_date" />
          <Match idRef="Func_us_address" />
        </Any>
    </Pattern>
    <Pattern confidenceLevel="65">
        <IdMatch idRef="Func_randomized_formatted_ssn" />
        <Any minMatches="0" maxMatches="0">
          <Match idRef="Func_ssn" />
        </Any>
        <Any minMatches="1">
          <Match idRef="Keyword_ssn" />
          <Match idRef="Func_us_date" />
          <Match idRef="Func_us_address" />
        </Any>
    </Pattern>
    <Pattern confidenceLevel="55">
        <IdMatch idRef="Func_randomized_unformatted_ssn" />
        <Match idRef="Keyword_ssn" />
        <Any minMatches="0" maxMatches="0">
          <Match idRef="Func_unformatted_ssn" />
        </Any>
        <Any minMatches="1">
          <Match idRef="Func_us_date" />
          <Match idRef="Func_us_address" />
        </Any>
    </Pattern>
</Entity>

Keywords:

Keyword_ssn
Social Security
Social Security#
Soc Sec
SSN
SSNS
SSN#
SS#
SSID