How to Classify Sensitive Data

Use the information in this article to help you classify sensitive data.

Use the following archTIS guidelines and recommendations when classifying sensitive data.

Note that classification is a multi-step process.
Classification is not something that is setup for your entire environment and performed in one attempt.

Establish a Classification Schema - Start with what you know

Keep your classification schema simple
- Layer upon layer of classifications can lead to confusion
- Classifying data as "not-sensitive" is simple and often overlooked, but can be very useful
- Don't use complex naming conventions
Avoid over-labeling
- Use labels and terms users can understand and which are easily defined
Create a checklist for each data type
- Which data types are provided - Spirion data types (Social Security number, credit card number, phone number, etc.)?
- Which data types are missing and must be created (custom data types)?
- - Custom data types are created by you
  - Custom data type examples include: keywords, sensitive data definitions, regular expressions, dictionary data types, etc. such as: AWS Access Key Keyword - Pattern, CUI, EIN, ePHI, HSA Number, HSA Validator, IMEI, IPv4, IPv6, mother's maiden name, Asana client secret, etc.

Leverage Sensitive Data Definitions - Existing Spirion or Custom data types such as Social Security number, credit card number, phone number, IMEI, ABN, etc.
Below is an example of a custom data type called "PII" (personally identifiable information), which contains multiple native, Spirion-provided data types

PII data type example with definition logic

To start, apply classifications to the data in your Spirion database
- Examine the senstive data (SSN, CCN, etc.) in the files your scans have discovered and decide how the sensitive data should be classified
- Next, decide how to classify the files which contain the sensitive data
- Lastly, what are the logical remediation steps to take on this data? Redact, shred, notify users? Is classifying the data or file the final step?
Choose a group or department to test your classification schema on such as HR, Finance, a small database, or onedrive account, etc.
Decide on a reasonable level of error and understand thresholds

Understand the service tasks related to classifications and workflows
Start with easy operators - Equals, Greater Than
Apply classifications to Spirion database file records first, and later to the files themselves: the setting which controls this option in Scan Playbooks is shown in the screenshot below.
When the file itself is classified, the classification is embedded into the file's metadata and follows the file wherever it goes