chp. 4 Flashcards

1
Q

The main on objects in databases are …?

A

tables. Why?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is one of the major differences between application db’s and data warehouse?

A

how tables are created. Why?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

tables in application db’s are made to support..?

A

applications. Why?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

tabes in application db’s are mostly …?

A

static. Why?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Tables in a data warehouse are ..?

A

dynamic. Why?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Tables are ….?

A

products, collections of business logic, and data flows. Why?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the definition of orchistration?

A

a set of configurations to automate tasks, jobs, and their dependencies. Why?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What can we use to automate table creation process in GCP?

A

Cloud Composer. Why?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are common data pipline issues?

A

data duplication, handling taks dependencies, managing connections , handling late data. Why?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Cloud composer is …?

A

an Airflow managed service in GCP. Why?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is unique about Airflow?

A

you can use python scripts to manage workflows. Why?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

w.r.t workflow management tools the three main components are…?

A

Handing task dependencies, Scheduler, System Integration.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

every airflow task is made for….?

A

scheduling tasks rather than a one-time job runner. Why?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

airflow is not a …?

A

user interface based drag and drop tool. Why?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

airflow is designed for you … ?

A

write the workflow using code. Why?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

all configurations can be managed in a …?

A

git repository. Why?

17
Q

What does DAG stand for?

A

Directed acyclic graph

18
Q

What is a DAG?

A

it is a collection of tasks that are chained together with their dependencies. Why?

19
Q

What is the term for when a dag is running?

A

DAG run. Why?

20
Q

What is an operator w.r.t a dag?

A

operators are a collection of connections to different systems. Why?

21
Q

airflow uses operators to …?

A

define tasks. Why?

22
Q

DAG is …?

A

Airflow terminology of a job configuration. Why?

23
Q

Before you can use Airlfow/ Cloud composer you will need to …?

A

setup your cloud composer envirnment. Why?

24
Q

Cloud composer cost is based on ..?

A

cluster hours. Why?

25
Q

for checking task dependencies you have two visualization options. What are they?

A

Tree view and Graph view. Why?