How to Classify Sensitive Data

Use the information in this article to help you classify sensitive data.

Use the following archTIS guidelines and recommendations when classifying sensitive data.

  • Note that classification is a multi-step process.
  • Classification is not something that is setup for your entire environment and performed in one attempt.

Establish a Classification Schema - Start with what you know

  • Keep your classification schema simple
    • Layer upon layer of classifications can lead to confusion
    • Classifying data as "not-sensitive" is simple and often overlooked, but can be very useful
    • Don't use complex naming conventions
  • Avoid over-labeling
    • Use labels and terms users can understand and which are easily defined
  • Create a checklist for each data type
    • Which data types are provided - Spirion data types (Social Security number, credit card number, phone number, etc.)?
    • Which data types are missing and must be created (custom data types)?
      • Custom data types are created by you
      • Custom data type examples include: keywords, sensitive data definitions, regular expressions, dictionary data types, etc. such as: AWS Access Key Keyword - Pattern, CUI, EIN, ePHI, HSA Number, HSA Validator, IMEI, IPv4, IPv6, mother's maiden name, Asana client secret, etc.

  • Leverage Sensitive Data Definitions - Existing Spirion or Custom data types such as Social Security number, credit card number, phone number, IMEI, ABN, etc.
  • Below is an example of a custom data type called "PII" (personally identifiable information), which contains multiple native, Spirion-provided data types

PII data type example with definition logic

Limit the Scope of What is Classified

  • To start, apply classifications to the data in your Spirion database
    • Examine the senstive data (SSN, CCN, etc.) in the files your scans have discovered and decide how the sensitive data should be classified
    • Next, decide how to classify the files which contain the sensitive data
    • Lastly, what are the logical remediation steps to take on this data? Redact, shred, notify users? Is classifying the data or file the final step?
  • Choose a group or department to test your classification schema on such as HR, Finance, a small database, or onedrive account, etc.
  • Decide on a reasonable level of error and understand thresholds

Test Your Classifications Before Deploying Them Across Your Environment

  • Understand the service tasks related to classifications and workflows
  • Start with easy operators - Equals, Greater Than
  • Apply classifications to Spirion database file records first, and later to the files themselves: the setting which controls this option in Scan Playbooks is shown in the screenshot below.
  • When the file itself is classified, the classification is embedded into the file's metadata and follows the file wherever it goes


Was this article helpful?