Big Data Flashcards

1
Q

BigQuery

A
  • serverless, high scalable and cost-effective cloud data warehouse
  • analyze large datasets quickly and easily
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Cloud Dataproc

A
  • managed Hadoop and Spark service to process large datasets
  • good choice for running batch and streaming data processing jobs
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Cloud Dataflow

A
  • fully- managed real-time data processing service
  • for batch and streaming Big Data processing
  • good choice for running real-time data processing jobs
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Cloud Data Fusion

A
  • cloud-native, fully-managed, enterprise data integration service
  • good choice for integrating data from a variety of sources
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Cloud Pub/Sub

A

-fully-managed, real-time messaging service between apps
- good choice to send and receive messages reliably and efficiently

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Cloud Dataprep

A

-data preparation tool to clean, transform and integrate data for analysis
- prepare data for ML and big data analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Cloud Datalab

A

-interactive data analysis and exploration tool to visualize and analyze data in a web browser
- good choice if you want to analyze data without writing code

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Cloud Data Catalog

A

-metadata management service to organize, discover and manage data assets
- good choice if you need to manage large number of data assets and make them accessible to their users

How well did you know this?
1
Not at all
2
3
4
5
Perfectly