Data Engineer Whiteboard Flashcards

1
Q

What are the high-level whirteboard areas

A

Ask questions
Dataflow
Data Integrity/Quality
Monitoring and alerting
Testing - unit/load
Scalability
Infosec compliance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What questions should you ask?

A

Restate problem
Scope
Edge cases
Any tests?
Ask questions about the data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Summerize Data Engineering from data chaos to data insights

A

Data - raw data chaos
Analytics - data analytics unlocks patterns, tends, and correlations
Insights - Data insights. Actionable indights emerge, driving informed decision making
Wisdom - organizational wisdom . The origanization gains wisdom, leveraging insight for growth

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

what are the data engineering flowchart sections?

A

Source, ingestion, transformation, serving, analytics, ML, reverse ETL, storage and undercurents

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are the undercurrents?

A

Security, data management, data ops, data architecture, orchestration, software engineering

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are the data engineering best practices?

A

Proactive data monitoring, schema drift management, continuous documentation, data security measures, version control and backups

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Summary of fundamentals of data engineering

A

Data engineering The discipline focuses on preparing “big data” for analytical or operational uses.
Use cases
Data engineering lifecycle
Data pipelines
Batch vs. stream processing
Data engineering best practices
Data engineering vs. artificial intelligence

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is data engineering?

A

The discipline focuses on preparing “big data” for analytical or operational uses.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the Data engineering lifecycle

A

The stages, from data ingestion to analytics, encompass integration, transformation, warehousing, and maintenance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are data pipelines?

A

A visual flow of the entire data engineering process, highlighting how data moves through each stage.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is batch vs stream processing?

A

Distinguishing between processing data in large sets (batches) versus real-time (stream) processing.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are the Data engineering best practices?

A

Established methods and strategies in data engineering to ensure data integrity, efficiency, and security.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is Data engineering vs. artificial intelligence?

A

Differentiating the process of preparing data for AI applications from using AI to enhance data engineering tasks.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly