Sourcing Data in Microsoft Azure Flashcards
What is defined as information that either does NOT have a pre-defined data model or is NOT organized in a pre-defined manner?
Data at rest
Unstructured data
Semi-structured data
Streaming data
Unstructured data
What is defined as data that is continuously generated by different sources?
Streaming data
Structured data
Semi-structured data
Unstructured data
Streaming data
What is defined as data that was NOT collected by your organization?
Data at rest
Streaming data
Intermediate data
External data
External data
What is an example of streaming data?
Data warehouse
Sensor data at intervals
Spreadsheet
Blob storage
Sensor data at intervals
What should be the first step in the requirements gathering process?
Asking why
Locating data
Exploring data
Assessing data
Asking why
If you have representative data that is a good fit, what could be missing, making it low-quality data?
Completeness
Politics
Profiles
Standardization
Completeness
What should be considered when determining how to migrate data to Azure?
Host environment
Externality of data
Network capacity
Data structure
Network capacity
What is the name of the offline data transfer offering from Microsoft Azure?
Data Box
Data Transfer
Azure Data Transfer
Offline Transfer
Data Box
What is a pipeline in Azure Data Factory?
Key-value pairs of read-only configuration
Used to store temporary values and can also be used in conjunction with parameters
A logical grouping of activities that performs a unit of work
Connection information that’s needed for Data Factory to connect to external resources
A logical grouping of activities that performs a unit of work
What is supported on Azure HD Insight as a cluster type?
Azure Datalake
SQL Data Warehouse
Hazelcast Jet
Apache Kafka
Apache Kafka