Quizlet #4 Flashcards

1
Q

Data warehouse

A

Assemble data from multiple sources including databases. Built to enable rapid analysis of large and multi-dimensional datasets. Central hub for all business data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

BigQuery

A

BigQuery is serverless or resources, such as compute power, are automatically provisioned behind the scenes as needed to run your queries. So businesses do not pay for compute power unless they are actually running a query

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Pub/Sub and DataFlow

A

Work together to bring unstructured data into the cloud and transform it into semi-structured data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Data lake

A

Repository for raw data and tends to serve many purposes. Sometimes hold ‘back-up’ data, which helps businesses build resilience against unexpected harm affecting their data. Also hold historic data and not relevant to day-to-day business operations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Pub/Sub

A

Service for real time ingestion of data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Data Flow

A

Service for large scale processing of data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Cloud storage benefits

A

Any amount of data, low latency, accessible from anywhere

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Cloud Storage

A

Multi-regional storage ideal for serving content to users worldwide

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Regional Storage

A

Offered by Cloud Storage is ideal when an organization wants to use the data locally; it gives added throughput and performance by storing data in the same region as your compute infrastructure

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Looker

A

Business intelligence solution that sits on top of any analytics database and makes it simple to describe your data and define business metrics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Artificial intelligence

A

Term that describes any kind of machine capable of acting autonomously

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Machine Learning

A

Use standard algorithms or standard models to analyze data in order to derive predictive insights and make repeated decisions at scale

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Data cleanliness

A

Cleaning of the data to prevent the model from making accurate predictions or understanding data behavior needs to be cleaned. Also referred to data consistency.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Data completeness

A

Availability of sufficient data about the world to replace human knowledge

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Qualities of good data

A

Coverage, clean, complete

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Data coverage

A

Scope of domain and all possible scenarios the data can account

17
Q

Limitations on a good ML model

A

Lack of availability of better data, mistaken expectations, poor execution of program design and implementation

18
Q

TensorFlow

A

A comprehensive, flexible ecosystem of tools, libraries and community resources that lets researchers push innovations in ML and developers to easily build and deploy ML powered applications and only pay for what is being used