Analytics Flashcards

1
Q

analytics

A

the act of querying or processing your data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

data warehouse

A

a data storage solution that aggregates massive amounts of historical data from disparate sources

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

uses of a data warehouse

A

querying, reporting, analytics and business intelligence

- not used for transaction processing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Redshift

A

AWS’s data warehousing solution

  • improves speed and efficiency when querying
  • handles exabyte-scale data
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Use cases for Redshift

A
  • to consolidate multiple databases for reporting

- when you want to run a relational database that doesn’t need to do transactions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Athena

A

a query service for Amazon S3

  • can analyze S3 data using SQL
  • pay per query
  • serverless
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Glue

A

data integration service that prepares your data for analytics

  • ETL service
  • prepares and loads data
  • helps you better understand your data
  • serverless
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

ETL

A

Extract, Transform and Load

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

data integration

A

the process of preparing and combining data for analytics, machine learning, and application development. It involves multiple tasks, such as discovering and extracting data from various sources; enriching, cleaning, normalizing, and combining data; and loading and organizing data in databases, data warehouses, and data lakes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Kinesis

A

allows you to analyze data and video streams in real time

- supports: audio, video, application logs, website clickstreams, IoT

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Use case for Kinesis

A

analyze logs in near real time for application monitoring or fraud detection

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

EMR

A

Elastic Map Reduce

  • helps you process large amounts of big data
  • analyze data using Hadoop
  • works with big data frameworks like Apache Spark
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Data Pipeline

A

helps you move data between compute and storage services either running on AWS or on-premises

  • moves data at specific intervals or based on conditions
  • sends notifications of success or failure
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

use case for Data Pipeline

A

to move data from S3 to Redshift

How well did you know this?
1
Not at all
2
3
4
5
Perfectly