Predefined data classes details
Data classes are assigned at the asset level and at the column level.
At asset level, the assigned data classes represent the data classes selected by the analysis, or manually, as the best match for a column.
At column level, data classes are assigned based on column name (scope column) or on column data (scope value) depending on the data class definition:
- Scope column
- Classification is based only on parsing and analysis of the column name, that is, on metadata alone. Depending on how well the column matches the data class, the classifier returns a confidence value between 0.0 and 1.0, where 0.0 means no match, and 1.0 means a perfect match. Alternatively, the classifier can return false instead of 0.0 and true instead of 1.0.
- Scope value
- Classification is based on analysis of the data values found in a column. Depending on whether the value matches the class, the classifier returns true or false. When all values are evaluated, the percentage of values of a column that are not null and match the data class represents the confidence of the data class for the column.
In addition to the confidence, the priority of the data class is taken into account when data classes are assigned. Unless stated otherwise in the data class details, the default priority for data classes with the matching scope value is 10. For data classes with the matching scope column, the default priority is 0.
Each predefined data class uses a specific type of classifier to apply classification logic:
- A Java classifier can evaluate the column name or the data values of a column to determine the data class (scope column).
- A Regex classifier evaluates the data values of a column by applying a regular expression is used to determine if each value belongs to the data class (scope value).
- A Value list classifier evaluates the data values of a column based on a given list of valid values to determine the data class (scope value).
For more information about scopes, classifiers, and matching criteria, see Matching data.
Details of each data class
Find the classification type and scope, evaluation criteria, and an example for each predefined data class.
Account number
Scope
Value
Matching criteria
RegularExpression: .
Column name filter: ^acc(ount)?([ _-])?(num(ber)?|id|no(.)?){1}$
Sample data value
123456
Address Line 1
Scope
Column
Matching criteria
Java class: com.ibm.infosphere.classification.impl.AddressLineClassifier
Data type: string
Data minimum length: 4
Data maximum length: 100
Column name filter: addr.{0,15}(1|one)$
Default priority: 12
Sample data value
Address Line 2
Scope
Column
Matching criteria
Java class: com.ibm.infosphere.classification.impl.AddressLineClassifier
Data type: string
Data minimum length: 4
Data maximum length: 100
Column name filter: addr.{0,15}(2|two)$
Default priority: 12
Sample data value
Address Line 3
Scope
Column
Matching criteria
Java class: com.ibm.infosphere.classification.impl.AddressLineClassifier
Data type: string
Data minimum length: 4
Data maximum length: 100
Column name filter: addr.{0,15}(3|three)$
Default priority: 12
Sample data value
Note: You must specify the data class names exactly. In the case of the
Address Line 3
class name, two spaces exist between Address
and Line
.
Airport Code
Scope
Value
Matching criteria
List of airport codes; case-sensitive
Data type: string
Data minimum length: 3
Data maximum length: 3
Sample data value
Alabama State Driver’s License
Scope
Value
Matching criteria
RegularExpression: a-zA-Z[a-zA-Z]{1}\d{6}$|\d{7}$
Data type: string
Data minimum length: 7
Data maximum length: 7
Sample data value
1234567
Alaska State Driver’s License
Scope
Value
Matching criteria
RegularExpression: ^\d{7}$
Data type: string
Data minimum length: 7
Data maximum length: 7
Sample data value
1234567
Alberta Province Driver’s License
Scope
Value
Matching criteria
RegularExpression: ^\d{6}[-]?\d{3}$
Data type: numeric, string
Data minimum length: 9
Data maximum length: 10
Sample data value
123456-123
American Express Card
Scope
Value
Matching criteria
Java class: com.ibm.infosphere.classification.impl.AMEXClassifier
Data type: numeric, string
Data minimum length: 16
Data maximum length: 18
Sample data value
3400-000000-00009
Arizona State Driver’s License
Scope
Value
Matching criteria
RegularExpression: ^([0-6]\d{2}|7[0-6]\d|77[0-2])([ -.]?)(\d{2})\2(\d{4})$|[1]\d{8}$
Data type: string
Data minimum length: 9
Data maximum length: 11
Column name filter: d(.)?l(.)?([ -]?(number|no(.)?))?$|driv(ing|er(s|'s)?)[ -]license|license
Sample data value
A12345678
Arkansas State Driver’s License
Scope
Value
Matching criteria
RegularExpression: ^\d{9}$
Data type: string
Data minimum length: 9
Data maximum length: 9
Column name filter: d(.)?l(.)?([ -]?(number|no(.)?))?$|driv(ing|er(s|'s)?)[ -]license|license
Sample data value
012345678
BIC
Scope
Value
Matching criteria
Java class: com.ibm.infosphere.classification.impl.BICClassifier
Data type: string
Data minimum length: 9
Data maximum length: 9
Sample data value
DEUTDEDBDUE
Boolean
Scope
Value
Matching criteria
List of values: 0, 1, True, False, Yes, No
Data type: numeric or string
Default priority: 16
Sample data value
True
British Columbia Province Driver’s License
Scope
Value
Matching criteria
RegularExpression: ^\d{7}$
Data type: numeric, string
Data minimum length: 7
Data maximum length: 7
Column name filter: d(.)?l(.)?([ -]?(number|no(.)?))?$|driv(ing|er(s|'s)?)[ -]license|license
Sample data value
1234567
California State Driver’s License
Scope
Value
Matching criteria
RegularExpression: a-zA-Z0-9[2]{1}0-9{7}$
Data type: string
Data minimum length: 8
Data maximum length: 8
Column name filter: d(.)?l(.)?([ -]?(number|no(.)?))?$|driv(ing|er(s|'s)?)[ -]license|license
Sample data value
C1234567
Canada Post Code
Scope
Value
Matching criteria
RegularExpression: [3]\d[A-Z][ -]?\d[A-Z]\d$
Data type: string
Data minimum length: 6
Data maximum length: 6
Sample data value
H3A 0B1
Canada Province Code
Scope
Value
Matching criteria
List of Canada province codes; case-sensitive
Data type: string
Data minimum length: 2
Data maximum length: 2
Default priority: 14
Sample data value
QC
Canada Province Name
Scope
Value
Matching criteria
List of Canada province names; case-insensitive
Data type: string
Data minimum length: 2
Data maximum length: 25
Default priority: 12
Sample data value
Quebec
Canadian Social Insurance Number (SIN)
Scope
Value
Matching criteria
Java class: com.ibm.infosphere.classification.impl.CanadianSINClassifier
Data type: numeric, string
Data minimum length: 9
Data maximum length: 20
Sample data value
046-454-286
City
Scope
Value
Matching criteria
List of city names; case-insensitive
Data type: string
Data minimum length: 2
Data maximum length: 58
Default confidence threshold: 50%
Default priority: 7
Sample data value
Los Angeles
Code
Scope
Column
Matching criteria
Java class: com.ibm.infosphere.classification.impl.CodeClassifier
Default priority: -10
Sample data value
Colorado State Driver’s License
Scope
Value
Matching criteria
RegularExpression: (?:(0-90-90-9[0-9]{2}-?[0-9]{3}-?[0-9]{4}$)|(a-zA-Z{1}0-9{3,6}$))
Data type: string
Data minimum length: 4
Data maximum length: 11
Column name filter: d(.)?l(.)?([ -]?(number|no(.)?))?$|driv(ing|er(s|'s)?)[ -]license|license
Sample data value
12-345-2222
Color
Scope
Value
Matching criteria
List of colors; case-insensitive
Data type: string
Data minimum length: 3
Data maximum length: 41
Sample data value
Blue
Commercial and Government Entity Code
Scope
Value
Matching criteria
Java class: com.ibm.infosphere.classification.impl.CAGECodeClassifier
Data type: string
Data minimum length: 5
Data maximum length: 5
Column name filter: cage|fscm|nscm|entity|code
Sample data value
1ASDY
Computer Host Name
Scope
Value
Matching criteria
Java class: com.ibm.infosphere.classification.impl.HostNameClassifier
Data type: string
Data minimum length: 4
Data maximum length: 255
Sample data value
www.example.com
Connecticut State Driver’s License
Scope
Value
Matching criteria
RegularExpression: 0-9[4]{9}$
Data type: numeric, string
Data minimum length: 9
Data maximum length: 9
Column name filter: d(.)?l(.)?([ -]?(number|no(.)?))?$|driv(ing|er(s|'s)?)[ -]license|license
Sample data value
123456789
Country Code
Scope
Value
Matching criteria
List of country codes; case-sensitive
Data type: string
Data minimum length: 2
Data maximum length: 3
Default priority: 13
Sample data value
USA
Country Name
Scope
Value
Matching criteria
List of country names; case-insensitive
Data type: string
Data minimum length: 2
Data maximum length: 50
Default priority: 12
Sample data value
India
Credit Card Expiration Date
Scope
Column
Matching criteria
Java class: com.ibm.infosphere.classification.impl.CreditCardExpDateAndValidationNumberClassifier
Sample data value
12/2018
Credit Card Number
Scope
Value
Matching criteria
Java class: com.ibm.infosphere.classification.impl.CreditCardClassifier
Data type: numeric, string
Data minimum length: 13
Data maximum length: 20
Sample data value
4146 6643 9004 5458
Credit Card Validation Number
Scope
Column
Matching criteria
Java class: com.ibm.infosphere.classification.impl.CreditCardExpDateAndValidationNumberClassifier
Sample data value
1234
Currency
Scope
Value
Matching criteria
Java class: com.ibm.infosphere.classification.impl.CurrencyClassifier
Data type: string
Data minimum length: 2
Data maximum length: 25
Sample data value
$12,345.67
Current Procedural Terminology
Scope
Value
Matching criteria
Java class: com.ibm.infosphere.classification.impl.CPTClassifier
Data type: string
Data minimum length: 5
Data maximum length: 5
Column name filter: CPT|medical procedure code|medical procedure|medicalcode|current procedural terminology
Sample data value
Customer Number
Scope
Value
Matching criteria
RegularExpression: .
Column name filter: ^cust(omer)?([ _-])?(num(ber)?|id|no(.)?){1}$
Sample data value
3141596
Date
Scope
Value
Matching criteria
Java class: com.ibm.infosphere.classification.impl.DateTimeClassifier
Sample data value
12-30-2015
Date of Birth
Scope
Value
Matching criteria
Java class: com.ibm.infosphere.classification.impl.DOBClassifier
Column name filter: dob$|birth(day)?|geburtsdatum|na(issance|cimiento|scita)|urodzenia|(生ま(れた日)?|誕生日)|出生(年月)?
Sample data value
12-30-2015
Delaware State Driver’s License
Scope
Value
Matching criteria
RegularExpression: 0-9[5]{1,7}$
Data type: numeric, string
Data minimum length: 1
Data maximum length: 7
Column name filter: d(.)?l(.)?([ -]?(number|no(.)?))?$|driv(ing|er(s|'s)?)[ -]license|license
Sample data value
1234567
Diners Club Card
Scope
Value
Matching criteria
Java class: com.ibm.infosphere.classification.impl.DinersClubClassifier
Data type: numeric, string
Data minimum length: 15
Data maximum length: 18
Sample data value
5520111111111121
Discover Card
Scope
Value
Matching criteria
Java class: com.ibm.infosphere.classification.impl.DiscoverClassifier
Data type: numeric, string
Data minimum length: 17
Data maximum length: 18
Sample data value
6220264390045758
Driver’s License
Scope
Value
Matching criteria
RegularExpression: a-zA-Z0-9*-[6]{1,19}$
Data type: string
Data minimum length: 1
Data maximum length: 19
Column name filter: d(.)?l(.)?([ -]?(number|no(.)?))?$|driv(ing|er(s|'s)?)[ -]license|license
Sample data value
1234567
DUNS
Scope
Value
Matching criteria
RegularExpression: ^(\d{2})([ -]?)(\d{3})([ -]?)(\d{4})$
Data type: string
Data minimum length: 9
Data maximum length: 11
Column name filter: duns|universal number|universal_number
Sample data value
12-345-6789
Email Address
Scope
Value
Matching criteria
Java class: com.ibm.infosphere.classification.impl.EmailClassifier
Data type: string
Data minimum length: 6
Data maximum length: 254
Sample data value
Employment Status
Scope
Value
Matching criteria
List of employment statuses; case-insensitive
Data type: string
Data minimum length: 2
Data maximum length: 50
Sample data value
employee
Ethnicity
Scope
Value
Matching criteria
List of ethnicities; case-insensitive
Data type: string
Data minimum length: 3
Data maximum length: 22
Sample data value
Hispanic
Eye Color
Scope
Value
Matching criteria
List of eye colors; case-insensitive
Data type: string
Data minimum length: 3
Data maximum length: 14
Column name filter: eye|eye_color|eyecolor
Sample data value
Hazel
First Name
Scope
Column
Matching criteria
Java class: com.ibm.infosphere.classification.impl.GNMFirstNameClassifier
Data type: string
Column name filter: ^(?i)(given|f(irst)?)([ _.-])?name$
Default confidence threshold: 50%
Default priority: 10
Sample data value
James
Florida State Driver’s License
Scope
Value
Matching criteria
RegularExpression: a-zA-Z0-90-90-90-90-1[a-zA-Z]{1}[0-9]{3}-[0-9]{3}-[0-9]{2}-[0-9]{3}-[0-1]{1}$|a-zA-Z{1}0-9{12}$
Data type: string
Data minimum length: 13
Data maximum length: 17
Column name filter: d(.)?l(.)?([ -]?(number|no(.)?))?$|driv(ing|er(s|'s)?)[ -]license|license
Sample data value
X123-123-33-229-0
Fortune 1000 Company
Scope
Value
Matching criteria
Java class: com.ibm.infosphere.classification.impl.Fortune1000Classifier
Data type: string
Data minimum length: 2
Data maximum length: 50
Sample data value
French INSEE Number
Scope
Value
Matching criteria
Java class: com.ibm.infosphere.classification.impl.FranceINSEEClassifier
Data type: numeric, string
Data minimum length: 15
Data maximum length: 15
Sample data value
151022A10204375
Gender
Scope
Value
Matching criteria
List of values: M, F, Male, Female
Data type: string
Data minimum length: 1
Data maximum length: 6
Default priority: 16
Sample data value
F
Geographic Coordinates
Scope
Value
Matching criteria
Java class: com.ibm.infosphere.classification.impl.GeographicCoordinatesClassifier
Data type: string
Data minimum length: 3
Data maximum length: 44
Sample data value
49° 13" N; 1°10’00.012" E
Georgia State Driver’s License
Scope
Value
Matching criteria
RegularExpression: ^([0-6]\d{2}|7[0-6]\d|77[0-2])([ -.]?)(\d{2})\2(\d{4})$|0-9[7]{7,9}$
Data type: numeric, string
Data minimum length: 7
Data maximum length: 11
Column name filter: d(.)?l(.)?([ -]?(number|no(.)?))?$|driv(ing|er(s|'s)?)[ -]license|license
Sample data value
123-45-6789
Germany Vehicle Registration Number
Scope
Value
Matching criteria
Java class: com.ibm.infosphere.classification.impl.GermanyCarClassifier
Data type: string
Data minimum length: 4
Data maximum length: 9
Sample data value
BB-XY1066
Hair Color
Scope
Value
Matching criteria
List of hair colors; case-insensitive
Data type: string
Data minimum length: 3
Data maximum length: 14
Column name filter: hair|hair_color|haircolor
Sample data value
Black
Hawaii State Driver’s License
Scope
Value
Matching criteria
RegularExpression: ^([0-6]\d{2}|7[0-6]\d|77[0-2])([ -.]?)(\d{2})\2(\d{4})$|hH0-9[8]{1}0-9{8}$
Data type: string
Data minimum length: 9
Data maximum length: 11
Column name filter: d(.)?l(.)?([ -]?(number|no(.)?))?$|driv(ing|er(s|'s)?)[ -]license|license
Sample data value
H12345678
Health Insurance Claim Number
Scope
Value
Matching criteria
Java class: com.ibm.infosphere.classification.impl.HICNClassifier
Data type: string
Data minimum length: 6
Data maximum length: 15
Sample data value
WD-000-00-0000
Hobby/Leisure Activity
Scope
Value
Matching criteria
List of hobbies; case-insensitive
Data type: string
Data minimum length: 3
Data maximum length: 38
Column name filter: hobb(y$|ies$)|leisure([ _])activit(y$|ies$)
Sample data value
Tennis
Honorific
Scope
Value
Matching criteria
List of honorifics; case-insensitive
Data type: string
Data minimum length: 1
Data maximum length: 38
Sample data value
Mr
IBAN
Scope
Value
Matching criteria
Java class: com.ibm.infosphere.classification.impl.IBANClassifier
Data type: string
Data minimum length: 14
Data maximum length: 42
Sample data value
GB87 BARC 2065 8244 9716 55
ICD-10
Scope
Value
Matching criteria
Java class: com.ibm.infosphere.classification.impl.ICD10Classifier
Data minimum length: 3
Data maximum length: 7
Sample data value
D36.7
Idaho State Driver’s License
Scope
Value
Matching criteria
b>RegularExpression: ^([0-6]\d{2}|7[0-6]\d|77[0-2])([ -.]?)(\d{2})\2(\d{4})$|a-zA-Z0-9a-zA-Z[9]{2}0-9{6}a-zA-Z{1}$
Data type: string
Data minimum length: 9
Data maximum length: 11
Column name filter: d(.)?l(.)?([ -]?(number|no(.)?))?$|driv(ing|er(s|'s)?)[ -]license|license
Sample data value
AA123456X
Identifier
Scope
Column
Matching criteria
Java class: com.ibm.infosphere.classification.impl.IdentifierClassifier
Default priority: -10
Sample data value
Illinois State Driver’s License
Scope
Value
Matching criteria
RegularExpression: a-zA-Z0-90-90-9[a-zA-Z]{1}[0-9]{3}-[0-9]{4}-[0-9]{4}$|a-zA-Z{1}0-9{11}$
Data type: string
Data minimum length: 12
Data maximum length: 14
Column name filter: d(.)?l(.)?([ -]?(number|no(.)?))?$|driv(ing|er(s|'s)?)[ -]license|license
Sample data value
A123-4567-8999
INCO Terms (International Commercial Terms)
Scope
Value
Matching criteria
List of INCO terms; case-insensitive
Data type: string
Data minimum length: 3
Data maximum length: 3
Sample data value
FCA
Indiana State Driver’s License
Scope
Value
Matching criteria
RegularExpression: 0-90-90-9[0-9]{4}-[0-9]{2}-[0-9]{4}$|a-zA-Z{1}0-9{9}$|0-9[10]{10}$
Data type: string
Data minimum length: 10
Data maximum length: 12
Column name filter: d(.)?l(.)?([ -]?(number|no(.)?))?$|driv(ing|er(s|'s)?)[ -]license|license
Sample data value
1234-56-7890
Indicator
Scope
Column
Matching criteria
Java class: com.ibm.infosphere.classification.impl.IndicatorClassifier
Default priority: -10
Sample data value
Individual Taxpayer Identification Number (ITIN)
Scope
Value
Matching criteria
RegularExpression: ^(9\d{2})([ -]?)(?!93|89)([789][0-9])([ -]?)(\d{4})$
Data minimum length: 9
Data maximum length: 11
Column name filter: itin|tax
Sample data value
913-72-2222
International Mobile Equipment Identity (IMEI)
Scope
Value
Matching criteria
Java class: com.ibm.infosphere.classification.impl.IMEINumberClassifier
Data type: string
Data minimum length: 15
Data maximum length: 15
Column name filter: imei
Sample data value
490154203237518
International Securities Identification Number (ISIN)
Scope
Value
Matching criteria
Java class: com.ibm.infosphere.classification.impl.ISINClassifier
Data type: numeric, string
Data minimum length: 10
Data maximum length: 20
Sample data value
GB0002634946
International Standard Book Number (ISBN)
Scope
Value
Matching criteria
Java class: com.ibm.infosphere.classification.impl.ISBNClassifier
Data type: numeric, string
Data minimum length: 10
Data maximum length: 20
Sample data value
978 0 306 40615 7
International Standard Industrial Classification
Scope
Value
Matching criteria
Java class: com.ibm.infosphere.classification.impl.ISICClassifier
Data type: string
Data minimum length: 5
Data maximum length: 5
Column name filter: (unsic)$|(isic)$|^((industr(ial|ies|y))[ ._-]?(code))$
Sample data value
C3319
Internet Protocol Address
Scope
Value
Matching criteria
RegularExpression: ^\s*(?:(?:25[0-5]|2[0-4][0-9]|[01]?[0-9][0-9]?).){3}(?:25[0-5]|2[0-4][0-9]|[01]?[0-9][0-9]?)\s*$
Data type: string
Data minimum length: 7
Data maximum length: 15
Sample data value
127.127.127.002
Internet Protocol Version 6 Address
Scope
Value
Matching criteria
RegularExpression: 0-9A-Fa-f0-9A-Fa-f0-9A-Fa-f0-9A-Fa-f0-9A-Fa-f0-9A-Fa-f0-9A-Fa-f0-9A-Fa-f0-9A-Fa-f0-9A-Fa-f0-9A-Fa-f0-9A-Fa-f0-9A-Fa-f0-9A-Fa-f0-9A-Fa-f0-9A-Fa-f0-9A-Fa-f0-9A-Fa-f0-9A-Fa-f0-9A-Fa-f^\s*(((0-9A-Fa-f{1,4}:){7}(0-9A-Fa-f{1,4}|:))|((0-9A-Fa-f{1,4}:){6}(:0-9A-Fa-f{1,4}|((25[0-5]|2[0-4]\d|1\d\d|[1-9]?\d)(.(25[0-5]|2[0-4]\d|1\d\d|[1-9]?\d)){3})|:))|((0-9A-Fa-f{1,4}:){5}(((:0-9A-Fa-f{1,4}){1,2})|:((25[0-5]|2[0-4]\d|1\d\d|[1-9]?\d)(.(25[0-5]|2[0-4]\d|1\d\d|[1-9]?\d)){3})|:))|((0-9A-Fa-f{1,4}:){4}(((:0-9A-Fa-f{1,4}){1,3})|((:0-9A-Fa-f{1,4})?:((25[0-5]|2[0-4]\d|1\d\d|[1-9]?\d)(.(25[0-5]|2[0-4]\d|1\d\d|[1-9]?\d)){3}))|:))|((0-9A-Fa-f{1,4}:){3}(((:0-9A-Fa-f{1,4}){1,4})|((:0-9A-Fa-f{1,4}){0,2}:((25[0-5]|2[0-4]\d|1\d\d|[1-9]?\d)(.(25[0-5]|2[0-4]\d|1\d\d|[1-9]?\d)){3}))|:))|((0-9A-Fa-f{1,4}:){2}(((:0-9A-Fa-f{1,4}){1,5})|((:0-9A-Fa-f{1,4}){0,3}:((25[0-5]|2[0-4]\d|1\d\d|[1-9]?\d)(.(25[0-5]|2[0-4]\d|1\d\d|[1-9]?\d)){3}))|:))|((0-9A-Fa-f{1,4}:){1}(((:0-9A-Fa-f{1,4}){1,6})|((:0-9A-Fa-f{1,4}){0,4}:((25[0-5]|2[0-4]\d|1\d\d|[1-9]?\d)(.(25[0-5]|2[0-4]\d|1\d\d|[1-9]?\d)){3}))|:))|(:(((:0-9A-Fa-f{1,4}){1,7})|((:0-9A-Fa-f{1,4}){0,5}:((25[0-5]|2[0-4]\d|1\d\d|[1-9]?\d)(.(25[0-5]|2[0-4]\d|1\d\d|[1-9]?\d)){3}))|:)))(%.+)?\s*$
Data type: string
Data minimum length: 3
Data maximum length: 39
Sample data value
fe80:0:0:0:204:61ff:fe9d:f156
Iowa State Driver’s License
Scope
Value
Matching criteria
RegularExpression: ^([0-6]\d{2}|7[0-6]\d|77[0-2])([ -.]?)(\d{2})\2(\d{4})$|0-9a-zA-Z0-9[11]{3}a-zA-Z{2}0-9{4}$
Data type: string
Data minimum length: 9
Data maximum length: 11
Column name filter: d(.)?l(.)?([ -]?(number|no(.)?))?$|driv(ing|er(s|'s)?)[ -]license|license
Sample data value
123XX4567
Ireland Eircode
Scope
Value
Matching criteria
RegularExpression: \dACD-FHKNPRTV-Y[12]\d[\dW][ -]?\dACD-FHKNPRTV-Y{4}$
Data type: string
Data minimum length: 7
Data maximum length: 8
Sample data value
D02 NY52
ISO 3166-2 Code
Scope
Value
Matching criteria
List of ISO 3166-2 codes of states or provinces; case-insensitive
Data type: string
Data minimum length: 4
Data maximum length: 6
Sample data value
IN-KA
Italian Fiscal Code
Scope
Value
Matching criteria
RegularExpression: A-ZA-Z0-9L-NP-V0-9LNP-V0-9L-NP-V^(A-Z{3})([ -]?)(A-Z{3})\2(0-9L-NP-V{2})([A-EHLMPRST])(0-9LNP-V{2})\2([A-ILMZ]0-9L-NP-V{3})([A-Z])$
Data type: string
Data minimum length: 16
Data maximum length: 16
Sample data value
MRTMTT25D09F205Z
Japan Credit Bureau (JCB)
Scope
Value
Matching criteria
Java class: com.ibm.infosphere.classification.impl.JapanCBClassifier
Data type: numeric, string
Data minimum length: 17
Data maximum length: 18
Sample data value
35283095185620637
Kansas State Driver’s License
Scope
Value
Matching criteria
RegularExpression: ^([0-6]\d{2}|7[0-6]\d|77[0-2])([ -.]?)(\d{2})\2(\d{4})$|kK0-90-90-9[kK]{1}[0-9]{2}-[0-9]{2}-[0-9]{4}$|kK{1}0-9{8}$
Data type: string
Data minimum length: 9
Data maximum length: 11
Column name filter: d(.)?l(.)?([ -]?(number|no(.)?))?$|driv(ing|er(s|'s)?)[ -]license|license
Sample data value
K94-12-3456
Kentucky State Driver’s License
Scope
Value
Matching criteria
RegularExpression: a-zA-Z0-90-90-9[a-zA-Z]{1}[0-9]{2}-[0-9]{3}-[0-9]{3}$|a-zA-Z{1}0-9{8}$
Data type: string
Data minimum length: 9
Data maximum length: 11
Column name filter: d(.)?l(.)?([ -]?(number|no(.)?))?$|driv(ing|er(s|'s)?)[ -]license|license
Sample data value
A23-145-678
Language Code or Name
Scope
Value
Matching criteria
List of languages; case-insensitive
Data type: string
Data minimum length: 2
Data maximum length: 37
Column name filter: lang|locale|language
Sample data value
EN
Last Name
Scope
Column
Matching criteria
Java class: com.ibm.infosphere.classification.impl.GNMLastNameClassifier
Data type: string
Column name filter: ^l(ast)?([ -])?name$|surname$|family(([ -])?)name$
Default confidence threshold: 50%
Default priority: 7
Sample data value
Smith
Latitude
Scope
Value
Matching criteria
Java class: com.ibm.infosphere.classification.impl.LatitudeClassifier
Data type: numeric, string
Data minimum length: 1
Data maximum length: 20
Column name filter: lat$|lat_|_lat$|latitud(ine|e|o)?|breitengrad|breddekreds|breedtegraad|breiddegrad|breiddargráða|enlem|πλάτος|широт(ы|a)|קו רוחב|عرض جغرافي|緯度|纬度
Sample data value
49° 13"
Legal Marital/Civil Status.
Scope
Value
Matching criteria
List of marital statuses; case-insensitive
Data type: string
Data minimum length: 6
Data maximum length: 23
Sample data value
Single
Note: You must specify the data class names exactly. In the case of the Legal Marital/Civil Status.
class name, full stop must be used at the end of the name.
Longitude
Scope
Value
Matching criteria
Java class: com.ibm.infosphere.classification.impl.LongitudeClassifier
Data type: numeric, string
Data minimum length: 1
Data maximum length: 21
Column name filter: long$|long_|_long$|longitud(ine|e|o)?|längengrad|laengengrad|længdekreds|laengdekreds|lengtegraad|lengdegrad|lengdargráða|boylam|μήκος|долгот(ы|a)|קו אורך|طول جغرافي|経度|经度
Sample data value
1°10’00.012" E
Louisiana State Driver’s License
Scope
Value
Matching criteria
RegularExpression: 0-9^000-9{7}$
Data type: string
Data minimum length: 9
Data maximum length: 9
Column name filter: d(.)?l(.)?([ -]?(number|no(.)?))?$|driv(ing|er(s|'s)?)[ -]license|license
Sample data value
003456789
Mac Address
Scope
Value
Matching criteria
RegularExpression: 0-9A-Fa-f0-9A-Fa-f0-9A-Fa-f(?:[0-9A-Fa-f]{2}([:-]))(?:[0-9A-Fa-f]{2}\1){4}[0-9A-Fa-f]{2}$|(?:(0-9A-Fa-f{4}.){2}0-9A-Fa-f{4}$
Data minimum length: 14
Data maximum length: 17
Sample data value
12:34:56:78:9F
Maine State Driver’s License
Scope
Value
Matching criteria
RegularExpression: 0-9[13]{7}$
Data type: numeric, string
Data minimum length: 7
Data maximum length: 7
Column name filter: d(.)?l(.)?([ -]?(number|no(.)?))?$|driv(ing|er(s|'s)?)[ -]license|license
Sample data value
1234567
Manitoba Province Driver’s License
Scope
Value
Matching criteria
RegularExpression: ^\d{9}$
Data type: numeric, string
Data minimum length: 9
Data maximum length: 9
Column name filter: d(.)?l(.)?([ -]?(number|no(.)?))?$|driv(ing|er(s|'s)?)[ -]license|license
Sample data value
123456789
Maryland State Driver’s License
Scope
Value
Matching criteria
RegularExpression: a-zA-Z0-90-90-90-9[a-zA-Z]{1}-[0-9]{3}-[0-9]{3}-[0-9]{3}-[0-9]{3}$|a-zA-Z{1}0-9{12}$
Data type: string
Data minimum length: 13
Data maximum length: 17
Column name filter: d(.)?l(.)?([ -]?(number|no(.)?))?$|driv(ing|er(s|'s)?)[ -]license|license
Sample data value
A-123-456-789-999
Massachusetts State Driver’s License
Scope
Value
Matching criteria
RegularExpression: ^([0-6]\d{2}|7[0-6]\d|77[0-2])([ -.]?)(\d{2})\2(\d{4})$|sS0-9[14]{1}0-9{8}$
Data type: string
Data minimum length: 9
Data maximum length: 11
Column name filter: d(.)?l(.)?([ -]?(number|no(.)?))?$|driv(ing|er(s|'s)?)[ -]license|license
Sample data value
S12345678
Master Card
Scope
Value
Matching criteria
Java class: com.ibm.infosphere.classification.impl.MasterCardClassifier
Data type: numeric, string
Data minimum length: 17
Data maximum length: 18
Sample data value
5285696282092972
Michigan State Driver’s License
Scope
Value
Matching criteria
RegularExpression: a-zA-Z[a-zA-Z]{1}\d{12}$|a-zA-Z{1} 0-9{3} 0-9{3} 0-9{3} 0-9{3}$
Data type: string
Data minimum length: 13
Data maximum length: 17
Column name filter: d(.)?l(.)?([ -]?(number|no(.)?))?$|driv(ing|er(s|'s)?)[ -]license|license
Sample data value
A012345678912
Middle Name
Scope
Column
Matching criteria
Java class: com.ibm.infosphere.classification.impl.GNMFirstNameClassifier
Data type: string
Column name filter: ^m(iddle)?([ _.-])?name$
Default confidence threshold: 50%
Default priority: 10
Sample data value
James
Minnesota State Driver’s License
Scope
Value
Matching criteria
RegularExpression: a-zA-Z[a-zA-Z]{1}\d{12}$|a-zA-Z{1}-0-9{3}-0-9{3}-0-9{3}-0-9{3}$
Data type: string
Data minimum length: 13
Data maximum length: 17
Column name filter: d(.)?l(.)?([ -]?(number|no(.)?))?$|driv(ing|er(s|'s)?)[ -]license|license
Sample data value
A012345678912
Missouri State Driver’s License
Scope
Value
Matching criteria
RegularExpression: \d{9}$|a-zA-Z{1}0-9{5,9}$
Data type: string
Data minimum length: 6
Data maximum length: 10
Column name filter: d(.)?l(.)?([ -]?(number|no(.)?))?$|driv(ing|er(s|'s)?)[ -]license|license
Sample data value
012345678
Montana State Driver’s License
Scope
Value
Matching criteria
RegularExpression: a-zA-Z[a-zA-Z]{9}$|\d{13}$
Data type: string
Data minimum length: 9
Data maximum length: 13
Column name filter: d(.)?l(.)?([ -]?(number|no(.)?))?$|driv(ing|er(s|'s)?)[ -]license|license
Sample data value
0123456789123
Month
Scope
Value
Matching criteria
Java class: com.ibm.infosphere.classification.impl.MonthClassifier
Data type: numeric, string
Data minimum length: 1
Data maximum length: 10
Sample data value
January
Name Suffix
Scope
Value
Matching criteria
List of name suffixes; case-insensitive
Data type: string
Data minimum length: 2
Data maximum length: 16
Sample data value
PhD
Nebraska State Driver’s License
Scope
Value
Matching criteria
RegularExpression: a-zA-Z0-9[15]{1}0-9{3,8}$
Data type: string
Data minimum length: 4
Data maximum length: 9
Column name filter: d(.)?l(.)?([ -]?(number|no(.)?))?$|driv(ing|er(s|'s)?)[ -]license|license
Sample data value
C12345678
Nevada State Driver’s License
Scope
Value
Matching criteria
RegularExpression: xX[xX]{1}\d{8}$|\d{10}$|^\d{12}$
Data type: string
Data minimum length: 9
Data maximum length: 12
Column name filter: d(.)?l(.)?([ -]?(number|no(.)?))?$|driv(ing|er(s|'s)?)[ -]license|license
Sample data value
0123456789
New Brunswick Province Driver’s License
Scope
Value
Matching criteria
RegularExpression: 0-9[16]{1,7}$
Data type: string
Data minimum length: 1
Data maximum length: 7
Column name filter: d(.)?l(.)?([ -]?(number|no(.)?))?$|driv(ing|er(s|'s)?)[ -]license|license
Sample data value
0123456
New Hampshire State Driver’s License
Scope
Value
Matching criteria
RegularExpression: a-zA-Z^\d{2}a-zA-Z{3}\d{5}$
Data type: string
Data minimum length: 10
Data maximum length: 10
Column name filter: d(.)?l(.)?([ -]?(number|no(.)?))?$|driv(ing|er(s|'s)?)[ -]license|license
Sample data value
01ABC56789
New Jersey State Driver’s License
Scope
Value
Matching criteria
RegularExpression: a-zA-Z[a-zA-Z]{1}\d{14}$|a-zA-Z{1}\d{4} \d{5} \d{5}$|a-zA-Z[17]{1}\d{4}-\d{5}-\d{5}$
Data type: string
Data minimum length: 15
Data maximum length: 17
Column name filter: d(.)?l(.)?([ -]?(number|no(.)?))?$|driv(ing|er(s|'s)?)[ -]license|license
Sample data value
A01234567891234
New Mexico State Driver’s License
Scope
Value
Matching criteria
RegularExpression: ^\d{9}$
Data type: string
Data minimum length: 9
Data maximum length: 9
Column name filter: d(.)?l(.)?([ -]?(number|no(.)?))?$|driv(ing|er(s|'s)?)[ -]license|license
Sample data value
012345678
New York State Driver’s License
Scope
Value
Matching criteria
RegularExpression: \d{9}$|a-zA-Z{1}\d{18}$|^\d{3} \d{3} \d{3}$
Data type: string
Data minimum length: 9
Data maximum length: 19
Column name filter: d(.)?l(.)?([ -]?(number|no(.)?))?$|driv(ing|er(s|'s)?)[ -]license|license
Sample data value
012345678
Newfoundland and Labrador Province State Driver’s License
Scope
Value
Matching criteria
RegularExpression: a-zA-Z[18]{1}\d{9}$
Data type: string
Data minimum length: 10
Data maximum length: 10
Column name filter: d(.)?l(.)?([ -]?(number|no(.)?))?$|driv(ing|er(s|'s)?)[ -]license|license
Sample data value
A012345678
NoClassDetected
Scope
Matching criteria
Neither column name nor column values matches any of the available data classes.
Sample data value
North Carolina State Driver’s License
Scope
Value
Matching criteria
RegularExpression: 0-9[19]{1,12}$
Data type: string
Data minimum length: 1
Data maximum length: 12
Column name filter: d(.)?l(.)?([ -]?(number|no(.)?))?$|driv(ing|er(s|'s)?)[ -]license|license
Sample data value
123456789999
North Dakota State Driver’s License
Scope
Value
Matching criteria
RegularExpression: ^([0-6]\d{2}|7[0-6]\d|77[0-2])([ -.]?)(\d{2})\2(\d{4})$|a-zA-Z0-90-9[a-zA-Z]{3}-[0-9]{2}-[0-9]{4}$|a-zA-Z{3}0-9{6}$
Data type: string
Data minimum length: 9
Data maximum length: 11
Column name filter: d(.)?l(.)?([ -]?(number|no(.)?))?$|driv(ing|er(s|'s)?)[ -]license|license
Sample data value
XYZ-11-2222
Nova Scotia Province Driver’s License
Scope
Value
Matching criteria
RegularExpression: a-zA-Z0-9[20]{2}0-9{6}$
Data type: string
Data minimum length: 8
Data maximum length: 8
Column name filter: d(.)?l(.)?([ -]?(number|no(.)?))?$|driv(ing|er(s|'s)?)[ -]license|license
Sample data value
XY123456
Ohio State Driver’s License
Scope
Value
Matching criteria
RegularExpression: a-zA-Z0-9[21]{2}0-9{6}$
Data type: string
Data minimum length: 8
Data maximum length: 8
Column name filter: d(.)?l(.)?([ -]?(number|no(.)?))?$|driv(ing|er(s|'s)?)[ -]license|license
Sample data value
XY123456
Oklahoma State Driver’s License
Scope
Value
Matching criteria
RegularExpression: ^([0-6]\d{2}|7[0-6]\d|77[0-2])([ -.]?)(\d{2})\2(\d{4})$|a-zA-Z0-9[22]{1}0-9{9}$
Data type: string
Data minimum length: 9
Data maximum length: 11
Column name filter: d(.)?l(.)?([ -]?(number|no(.)?))?$|driv(ing|er(s|'s)?)[ -]license|license
Sample data value
X123456789
Ontario Province Driver’s License
Scope
Value
Matching criteria
RegularExpression: a-zA-Z^(a-zA-Z{1}\d{4})([-]?)(\d{5})\2(\d{5})$
Data type: string
Data minimum length: 15
Data maximum length: 17
Column name filter: d(.)?l(.)?([ -]?(number|no(.)?))?$|driv(ing|er(s|'s)?)[ -]license|license
Sample data value
A1234-56123-99999
Oregon State Driver’s License
Scope
Value
Matching criteria
RegularExpression: ^\d{1,9}$
Data type: string
Data minimum length: 1
Data maximum length: 9
Column name filter: d(.)?l(.)?([ -]?(number|no(.)?))?$|driv(ing|er(s|'s)?)[ -]license|license
Sample data value
012345678
Organization Name
Scope
Value
Matching criteria
Java class: com.ibm.infosphere.classification.impl.GNMOrganizationClassifier
Data type: string
Default confidence threshold: 60%
Default priority: 7
Sample data value
IBM
Passport Number
Scope
Value
Matching criteria
RegularExpression: A-Z0-9<0-9A-Z0-9A-Z0-9A-Z0-9<0-9[23]{9}0-9{1}A-Z{3}0-9{7}A-Z{1}0-9{7}A-Z0-9<{14}0-9{2}$
Data type: string
Data minimum length: 6
Data maximum length: 254
Sample data value
L898902C<3UTO6908061F9406236ZE184226B<<<<<14
Pennsylvania State Driver’s License
Scope
Value
Matching criteria
RegularExpression: \d{8}$|\d{2} \d{3} \d{3}$
Data type: string
Data minimum length: 8
Data maximum length: 10
Column name filter: d(.)?l(.)?([ -]?(number|no(.)?))?$|driv(ing|er(s|'s)?)[ -]license|license
Sample data value
01234567
Percentage
Scope
Value
Matching criteria
RegularExpression: 0-90-9^(?[+-]? ?0-9{1,10}[,.]?0-9{0,10} ?(%|percent|pct))?$
Data type: string
Data minimum length: 2
Data maximum length: 25
Sample data value
45%
Person Name
Scope
Column
Matching criteria
Java class: com.ibm.infosphere.classification.impl.GNMFullNameClassifier
Data type: string
Column name filter: _?name|नाम|名称|nom|nome|όνομα|nomine|имя|이름|име|naam
Default confidence threshold: 50%
Default priority: 7
Sample data value
John Doe
Political Party
Scope
Value
Matching criteria
List of poitical parties; case-insensitive
Data type: string
Data minimum length: 2
Data maximum length: 99
Column name filter: politic
Sample data value
PDP
Prince Edward Island Province State Driver’s License
Scope
Value
Matching criteria
RegularExpression: ^\d{6}$
Data type: numeric, string
Data minimum length: 6
Data maximum length: 6
Column name filter: d(.)?l(.)?([ -]?(number|no(.)?))?$|driv(ing|er(s|'s)?)[ -]license|license
Sample data value
123456
Quantity
Scope
Column
Matching criteria
Java class: com.ibm.infosphere.classification.impl.QuantityClassifier
Default priority: -10
Sample data value
100
Quebec Province Driver’s License
Scope
Value
Matching criteria
RegularExpression: a-zA-Z^(a-zA-Z{1}\d{4})([-]?)(\d{6})(\2)(\d{2})$
Data type: string
Data minimum length: 13
Data maximum length: 15
Column name filter: d(.)?l(.)?([ -]?(number|no(.)?))?$|driv(ing|er(s|'s)?)[ -]license|license
Sample data value
A1234-222222-00
Relationship
Scope
Value
Matching criteria
List of relationship types; case-insensitive
Data type: string
Data minimum length: 3
Data maximum length: 26
Sample data value
Friendship
Religion
Scope
Value
Matching criteria
List of religions; case-insensitive
Data type: string
Data minimum length: 3
Data maximum length: 19
Sample data value
Christian
Rhode Island State Driver’s License
Scope
Value
Matching criteria
RegularExpression: \d{7}$|vV{1}\d{6}$
Data type: string
Data minimum length: 7
Data maximum length: 7
Sample data value
0123456
Routing Transit Number
Scope
Value
Matching criteria
Java class: com.ibm.infosphere.classification.impl.RTNClassifier
Data type: numeric, string
Data minimum length: 9
Data maximum length: 9
Sample data value
121000358
Saskatchewan Province State Driver’s License
Scope
Value
Matching criteria
RegularExpression: ^\d{8}$
Data type: string
Data minimum length: 8
Data maximum length: 8
Column name filter: d(.)?l(.)?([ -]?(number|no(.)?))?$|driv(ing|er(s|'s)?)[ -]license|license
Sample data value
12345678
South Carolina State Driver’s License
Scope
Value
Matching criteria
RegularExpression: ^\d{9}$
Data type: string
Data minimum length: 9
Data maximum length: 9
Column name filter: d(.)?l(.)?([ -]?(number|no(.)?))?$|driv(ing|er(s|'s)?)[ -]license|license
Sample data value
123456789
South Dakota State Driver’s License
Scope
Value
Matching criteria
RegularExpression: ([0-6]\d{2}|7[0-6]\d|77[0-2])([-.]?)(\d{2})\2(\d{4})$|\d{6}$|^\d{8}$
Data type: string
Data minimum length: 6
Data maximum length: 11
Column name filter: d(.)?l(.)?([ -]?(number|no(.)?))?$|driv(ing|er(s|'s)?)[ -]license|license
Sample data value
123456
Spanish Fiscal Identification Number
Scope
Value
Matching criteria
RegularExpression: (X?)[ -]?(\d{7})[ -]?([A-HJ-NP-TV-Z])
Data type: string
Data minimum length: 8
Data maximum length: 11
Sample data value
3124124N
State/Province Name
Scope
Value
Matching criteria
List of state and province names; case-insensitive
Data type: string
Data minimum length: 2
Data maximum length: 43
Sample data value
San Salvador
Temperature
Scope
Value
Matching criteria
Java class: com.ibm.infosphere.classification.impl.TemperatureClassifier
Data type: string
Data minimum length: 2
Data maximum length: 25
Sample data value
20°C
Tennessee State Driver’s License
Scope
Value
Matching criteria
RegularExpression: ^\d{8,9}$
Data type: string
Data minimum length: 8
Data maximum length: 9
Column name filter: d(.)?l(.)?([ -]?(number|no(.)?))?$|driv(ing|er(s|'s)?)[ -]license|license
Sample data value
123456789
Texas State Driver’s License
Scope
Value
Matching criteria
RegularExpression: ^\d{8}$
Data type: string
Data minimum length: 8
Data maximum length: 8
Column name filter: d(.)?l(.)?([ -]?(number|no(.)?))?$|driv(ing|er(s|'s)?)[ -]license|license
Sample data value
12345678
Text
Scope
Column
Matching criteria
Java class: com.ibm.infosphere.classification.impl.TextClassifier
Default priority: -10
Sample data value
Put your TV viewing into overdrive with scenes that jump off your screen when you add 3D HDTVs to your home-theater system.
UK National Insurance Number
Scope
Value
Matching criteria
RegularExpression: ^([A-CEGHJ-PR-TW-Z][A-CEGHJ-NPR-TW-Z])([ -.]?)(\d{2})\2?(\d{2})\2?(\d{2})(\2([ABCD]))?$
Data type: string
Data minimum length: 8
Data maximum length: 13
Sample data value
CA 123456 A
UK Post Code
Scope
Value
Matching criteria
RegularExpression: ABD-HJLNP-UW-Z[24][A-Z]?\d[A-Z\d]?[ -]?\dABD-HJLNP-UW-Z{2}$
Data type: string
Data minimum length: 5
Data maximum length: 8
Sample data value
L1 8LW
UK Province Code
Scope
Value
Matching criteria
List of UK province codes; case-sensitive
Data type: string
Data minimum length: 2
Data maximum length: 3
Sample data value
HAW
Uniform Resource Locator
Scope
Value
Matching criteria
Java class: com.ibm.infosphere.classification.impl.URLClassifier
Data type: string
Data minimum length: 10
Data maximum length: 1000
Sample data value
Universal Product Code (UPC)
Scope
Value
Matching criteria
Java class: com.ibm.infosphere.classification.impl.UPCClassifier
Data type: numeric, string
Data minimum length: 12
Data maximum length: 12
Sample data value
123456789999
US County
Scope
Value
Matching criteria
List of US county names; case-insensitive
Data type: string
Data minimum length: 3
Data maximum length: 35
Default priority: 8
Sample data value
Adams
US Employer Identification Number
Scope
Value
Matching criteria
RegularExpression: ^(0[1-6]|1[0-6]|2[0-7]|[35][0-9]|[468][0-8]|7[1-7]|9[0-589])[ -]?\d{7}$
Data type: string
Data minimum length: 9
Data maximum length: 10
Column name filter: EMPLOYER|EIN
Sample data value
99-1234567
US National Drug Code
Scope
Value
Matching criteria
Java class: com.ibm.infosphere.classification.impl.USNDCClassifier
Data type: string
Data minimum length: 10
Data maximum length: 20
Column name filter: DRUG|NDC
Sample data value
1234-5678-90
US Phone Number
Scope
Value
Matching criteria
RegularExpression: ^(+?1\s*[-/.]?)?(((\d{3}))|(\d{3}))\s*[-/.]?\s*(\d{3})\s*[-/.]?\s*(\d{4})\s*(([xX]|[eE][xX][tT]).?\s*(\d+))*$
Data type: string
Data minimum length: 9
Data maximum length: 16
Sample data value
1 (234) 567-8901
US Social Security Number
Scope
Value
Matching criteria
b>RegularExpression: ^([1-578]\d{2}|0[1-9]\d|00[1-9]|6[0-57-9]\d|66[0-57-9])([ -.]?)([1-9]\d|0[1-9])\2([1-9]\d{3}|0[1-9]\d{2}|00[1-9]\d|000[1-9])$
Data type: numeric, string
Data minimum length: 9
Data maximum length: 11
Sample data value
123-45-6789
US Social Security Number Last 4
Scope
Value
Matching criteria
RegularExpression: ^([1-9]\d{3}|0[1-9]\d{2}|00[1-9]\d|000[1-9])$
Data minimum length: 4
Data maximum length: 4
Column name filter: ssn(4)?$|(ssn|social(.?security)?|socsec)(.*4)?
Sample data value
US Standard Industrial Classification
Scope
Value
Matching criteria
Java class: com.ibm.infosphere.classification.impl.USSICClassifier
Data minimum length: 3
Data maximum length: 4
Column name filter: SIC|USSIC|Standard Industrial Classification
Sample data value
1234
US State Capital Name
Scope
Value
Matching criteria
List of US state capital names; case-insensitive
Data type: string
Data minimum length: 5
Data maximum length: 14
Sample data value
Montgomery
US State Code
Scope
Value
Matching criteria
List of US state codes; case-sensitive
Data type: string
Data minimum length: 2
Data maximum length: 2
Default priority: 14
Sample data value
DE
US State Name
Scope
Value
Matching criteria
List of US state names; case-insensitive
Data type: string
Data minimum length: 4
Data maximum length: 20
Default priority: 12
Sample data value
Massachusetts
US Street Name
Scope
Value
Matching criteria
Java class: com.ibm.infosphere.classification.impl.StreetClassifier
Data type: string
Data minimum length: 7
Data maximum length: 50
Sample data value
8475 NW St
US Zip Code
Scope
Value
Matching criteria
Java class: com.ibm.infosphere.classification.impl.USZipCodeClassifier
Data type: numeric, string
Data minimum length: 5
Data maximum length: 10
Sample data value
02201-1020
Utah State Driver’s License
Scope
Value
Matching criteria
RegularExpression: ^\d{4,9}$
Data type: numeric, string
Data minimum length: 4
Data maximum length: 9
Column name filter: d(.)?l(.)?([ -]?(number|no(.)?))?$|driv(ing|er(s|'s)?)[ -]license|license
Sample data value
123456789
Vehicle Identification Number (VIN)
Scope
Value
Matching criteria
Java class: com.ibm.infosphere.classification.impl.VehicleIdNumber
Data type: string
Data minimum length: 17
Data maximum length: 17
Sample data value
1JCCM85E5BT001312
Vermont State Driver’s License
Scope
Value
Matching criteria
RegularExpression: \d{8}$|\d{7}A$
Data type: string
Data minimum length: 8
Data maximum length: 8
Column name filter: d(.)?l(.)?([ -]?(number|no(.)?))?$|driv(ing|er(s|'s)?)[ -]license|license
Sample data value
1234567A
Virginia State Driver’s License
Scope
Value
Matching criteria
RegularExpression: ([0-6]\d{2}|7[0-6]\d|77[0-2])([-.]?)(\d{2})\2(\d{4})$|A-Za-z{1}\d{8}$
Data type: string
Data minimum length: 9
Data maximum length: 11
Column name filter: d(.)?l(.)?([ -]?(number|no(.)?))?$|driv(ing|er(s|'s)?)[ -]license|license
Sample data value
A12345678
VISA Card
Scope
Value
Matching criteria
b>Java class: com.ibm.infosphere.classification.impl.VisaClassifier
Data type: numeric, string
Data minimum length: 17
Data maximum length: 18
Sample data value
4024007121595481
Washington DC State Driver’s License
Scope
Value
Matching criteria
RegularExpression: \d{7}$|\d{9}$
Data type: numeric, string
Data minimum length: 7
Data maximum length: 9
Column name filter: d(.)?l(.)?([ -]?(number|no(.)?))?$|driv(ing|er(s|'s)?)[ -]license|license
Sample data value
1234567
Washington State Driver’s License
Scope
Value
Matching criteria
RegularExpression: A-Za-za-zA-ZA-Za-z*A-Za-z0-9^(?=.*A-Za-z{2})(a-zA-Z{2}A-Za-z*{5}\d{3}A-Za-z0-9{2})$
Data type: string
Data minimum length: 12
Data maximum length: 12
Column name filter: d(.)?l(.)?([ -]?(number|no(.)?))?$|driv(ing|er(s|'s)?)[ -]license|license
Sample data value
DOE**MJ501A1
West Virginia State Driver’s License
Scope
Value
Matching criteria
RegularExpression: a-zA-Z[a-zA-Z]{1}\d{6}$|\d{7}$
Data type: string
Data minimum length: 7
Data maximum length: 7
Column name filter: d(.)?l(.)?([ -]?(number|no(.)?))?$|driv(ing|er(s|'s)?)[ -]license|license
Sample data value
A123456
Wisconsin State Driver’s License
Scope
Value
Matching criteria
RegularExpression: a-zA-Z[25]{1}\d{3}-\d{4}-\d{4}-\d{2}$
Data type: string
Data minimum length: 17
Data maximum length: 17
Column name filter: d(.)?l(.)?([ -]?(number|no(.)?))?$|driv(ing|er(s|'s)?)[ -]license|license
Sample data value
Z123-1234-5678-12
Wyoming State Driver’s License
Scope
Value
Matching criteria
RegularExpression: ^\d{6}-\d{3}$
Data type: string
Data minimum length: 10
Data maximum length: 10
Column name filter: d(.)?l(.)?([ -]?(number|no(.)?))?$|driv(ing|er(s|'s)?)[ -]license|license
Sample data value
123456-123
Parent topic: Predefined data classes