Sourcing Data in Microsoft Azure Flashcards

1
Q

What is defined as information that either does NOT have a pre-defined data model or is NOT organized in a pre-defined manner?

Data at rest

Unstructured data

Semi-structured data

Streaming data

A

Unstructured data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is defined as data that is continuously generated by different sources?

Streaming data

Structured data

Semi-structured data

Unstructured data

A

Streaming data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is defined as data that was NOT collected by your organization?

Data at rest

Streaming data

Intermediate data

External data

A

External data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is an example of streaming data?

Data warehouse

Sensor data at intervals

Spreadsheet

Blob storage

A

Sensor data at intervals

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What should be the first step in the requirements gathering process?

Asking why

Locating data

Exploring data

Assessing data

A

Asking why

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

If you have representative data that is a good fit, what could be missing, making it low-quality data?

Completeness

Politics

Profiles

Standardization

A

Completeness

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What should be considered when determining how to migrate data to Azure?

Host environment

Externality of data

Network capacity

Data structure

A

Network capacity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the name of the offline data transfer offering from Microsoft Azure?

Data Box

Data Transfer

Azure Data Transfer

Offline Transfer

A

Data Box

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is a pipeline in Azure Data Factory?

Key-value pairs of read-only configuration

Used to store temporary values and can also be used in conjunction with parameters

A logical grouping of activities that performs a unit of work

Connection information that’s needed for Data Factory to connect to external resources

A

A logical grouping of activities that performs a unit of work

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is supported on Azure HD Insight as a cluster type?

Azure Datalake

SQL Data Warehouse

Hazelcast Jet

Apache Kafka

A

Apache Kafka

How well did you know this?
1
Not at all
2
3
4
5
Perfectly