Explain Data Sources and Entity Sources Flashcards
Data sources are the
primary input for personal information discovery for
specific data subjects within a specific organization
Data sources are currently not auto-discovered – they
must be explicitly specified to be used, but we
have a discovery tool that can be leveraged for this purpose
Data Sources can be
structured, semi-structured and unstructured
Entity sources are based on
(structured) data sources
At a minimum, a data sources must be
reachable by a BigID scanner via the standard
protocol for said Data Source
Supported Unstructured File Types
DOCX DOTX XLSX PPTX PUB PDF TXT CSV LOG ZIP GZ Parquet
To associate discovered personal information with specific entities, it is necessary to
add and configure at least one Entity Source
Entity sources provide a
Unique ID, Display Name and Residency for each entity. The Unique ID is mandatory and must be unique for each entity (it can be a composite); others are optional
BigID can also discover other identifiable attributes in configured data sources based on
the attributes initially specified in the entity sources (using ‘enrichment)
If your data repositories do not contain a sufficiently unique identifier field to use as a Unique ID for Entity Sources, you can
create a Composite Unique ID from several fields.
Without an entity source configured:
- There is no correlation to entities
- You cannot connect an entity to its data
- Access requests cannot be fulfilled
- Other personal data finding logic will be limited