GCP BigData General Flashcards

1
Q

What are the GCP BigData services?

A
  • Dataproc
  • Dataflow
  • Bigquery
  • Cloud Pub/Sub
  • Datalab
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is Dataproc?

A

It is managed,

  • Hadoop
  • Mapreduce
  • Pig
  • Hive
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is Dataflow?

A

Stream and batch processing pipelines.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is BigQuery?

A

It is a data wherehouse with analytical capabilities.

Stream data at 199K rows er second

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

For GCP Big Data service, do you need to provision and manage resources?

A

No Google takes care of this for you.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How long will it take to deploy a hadoop cluster using Dataproc?

A

90sec

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

When deploying hadoop with Dataproc can I decided on the instance size and memory?

A

Yes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

When running hadoop with Dataproc am I fixed to the size of the cluster I am currently using?

A

No, you can scale up or down as needed.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

I need to monitor my Dataproc hadoop cluster, what options do I have?

A

Use stackdriver

How well did you know this?
1
Not at all
2
3
4
5
Perfectly