Data Platforms Flashcards

1
Q

What is Data-Driven Innovation?

A

Use of data and analytics to foster new products

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is Analytics?

A

A catch-all term for different business intelligence (BI) and application-related initiatives

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is Advanced Analytics?

A

(Semi-)Autonomous examination of data to discover deeper insights

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is Augmented Analytics?

A

Use of technologies such as machine learning and AI to assist with data preparation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is a Data Platform?

A

A centralized infrastructure facilitating ingestion

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the main challenge with raw data in a Data Platform?

A

Raw data is difficult to obtain

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is a Database?

A

A structured and persistent collection of information about some aspect of the real world

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is a Data Warehouse (DWH)?

A

A collection of data that supports decision-making processes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is OLTP?

A

Online Transaction Processing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is OLAP?

A

Online Analytical Processing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the difference between OLTP and OLAP?

A

OLTP involves constant transactions and short-term data retention

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is a Schemaless Database?

A

A type of database with no predefined schema

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is a Data Lake?

A

A central repository for storage

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are the differences between Data Warehouses and Data Lakes?

A

Data Warehouses are schema-on-write and curated

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is a Data Lakehouse?

A

A data management architecture combining the flexibility of data lakes with the management and ACID transactions of data warehouses

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is Data Provenance?

A

The description of the origins of data and the process by which it arrived at the database

17
Q

What is the role of a Data Steward?

A

Ensures that data governance processes are followed

18
Q

What is Data Versioning?

A

Managing changes to data collections with revision/version numbers

19
Q

What is Data Compression?

A

The process of encoding information using fewer bits. Lossless compression removes redundancy without losing data; lossy compression removes less important information.

20
Q

What is Data Profiling?

A

Methods to analyze data sets to derive metadata such as data types

21
Q

What is Entity Resolution?

A

Finding records that refer to the same entity across different data sources to ensure consistency and avoid duplication.

22
Q

What is a Data Catalog?

A

An organized inventory of data at a metadata level

23
Q

What is Data Fabric?

A

A design concept that connects different clouds (private

24
Q

What is Data Mesh?

A

A distributed data architecture with domain-oriented data ownership

25
Q

What is the difference between Data Mesh and Data Fabric?

A

Data Mesh focuses on decentralization and organizational change