Cloud Dataflow Flashcards

1
Q

Cloud Dataflow description

A

Fully managed service for creating data (batch and stream) processing pipelines where data is collected, transformed and then output

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the key features of Cloud Dataflow? (7)

A
  1. Based on Apache Beam
  2. Process data on multiple machines in parallel.
  3. Handles streaming data like Cloud Pub/Sub
  4. Handles batch or archived data like Cloud BigQuery
  5. Serverless
  6. Templates for ease of replication
  7. Best choice if not using Apache Hadoop or Spark
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Where does Cloud Dataflow deliver its output?

A

BigQuery, Cloud Machine Learning, Cloud Bigtable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

3 examples of Cloud Dataflow

A
  1. Analytical dashboards
  2. Forecasting Sales Trends
  3. ETL
How well did you know this?
1
Not at all
2
3
4
5
Perfectly