Analytics - Athena Flashcards

1
Q

Interactive Query service for S3 where you can write SQL queries.

No need to load data, it stays in S3

Uses Presto under the hood

Serverless

a) RDS
b) Athena
c) S3

A

b) Athena

Supports many data formats.

Like human readable: CSV, TSV, JSON, Columnar/splittable Orc, Parquet, splittable Avo, compression Snapy, Zlib, LZO, Gzip.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

I want to perform adhoc queries of web logs. Which AWS service is better to use?

a) RDS
b) Athena
c) S3

A

B) Athena

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Which AWS service is better for querying staging data before loading to redshift

a) RDS
b) Athena
c) S3

A

b_athena

So like getting a bigger picture before you acutally commit data into a data warehouse.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Which aws service is better for analyzing CloudTrail/CloudFront/VPC/ELB logs in S3

a) RDS
b) Athena
c) S3

A

b) Athena

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Which aws service is better for integration with jypter, zepplin, rstudio notebooks

a) RDS
b) Athena
c) S3

A

b) is has odcb and jdbc interfaces so you can treat athena like any other rdbms

You can also make use of amazon quicksight visualization tool into athena.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How does Athena work with Glue?

For example, you have a Glue ____________________ populating the glue catalog for your s3 data.

a) RDS
b) Athena
c) S3
d) Glue Crawler

A

d) Glue Crawler

Extract columns and table definitions out of it for you. You can use the glue console to refine that definiiton as needed.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Once you have a ___________________________ published for your S3 data, Athena will see it automatically and it can build a table from it.

a) RDS
b) Athena
c) S3
d) Glue data catalog

A

d) Anytime athena sees something in your glue data catalog in your account. Its going to make a table for that for you. So you can query it jus tlike you would any other sql database.

Other analytics tool can use that catalog to visulaize or analyze data like rds, redhsit, emr, redhsift spectrum. Services using apcahe give metastore.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

IN Athena you can control query access and track costs by ________________

a) RDS
b) Athena
c) S3
d) Workgroups

A

d) Workgroups

For example, specific types of quereis,

Each workgroup can have its own, Query hisotry, Data limits, IAM policies and encryption settings.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

in Athena how can you save lots of money by using ____________________.

ORC, Parquet. Save 30-90% and get better performance.

a) RDS
b) COlumnar formats
c) S3

A

Columnar formats

Partitions can also help lower the costs.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

IN Athena pay as you go model. Do failed queries get charged?

A

No - only successful or cancelled queries count.

No charge for DDL operations (alter/create/drop)

But you are chaged by $5 per TB scans

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

True or Falase

Athena can encrypt results at rest in S3 staging directory

A

True

either server side - SSE-S3 (s3 managed key)

or SS-KMS (KMS Key)

or client side

CSE-KMS (KMS key)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

True or false

You can have cross account access in S3 bucket policy for Athena.

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

True or Falase.

TLS encrypts in transit (between Athena and S3)

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Should you use Highly Formatted report/ Visaulisation for Athena?

A

No - Use QuickSight

AThena is jsut a sql query engine.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Should you use Athena for ETL?

A

No

Thats was Glue is for.

Can also do that with apache saprk for larger data sets.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Does using a smaller number of files help athena perform betten than large number of small files?

A

Yes

Consider using partitions aswell. Use MSCK REPAIR table command to go back and add metadata to be able to add paritions to existing tables.

Use columnar data (ORC, Parquet)

17
Q

IN athena concurnet users can safley make row level modicaitons

A

Yes - athena acid transactions supported.

18
Q

ACID transactions are powered by apcahe____________

a) RDS
b) iceburg

A

Iceburg

Just add ‘table_type; = ‘ICEBERG’ in your create table command.

19
Q

True or False athena supports time travel operations. Where you can recover data recently deleted with a select statement.

A

True

20
Q

True or false you can set up a govenred table in lake formation in order to have acid support in Athena

A

True

can also just create using a table with iceberb

21
Q

True or falase periodic compaction can help preserver performance in ACID tranasctions for athena

A

True

For example, OPTIMIZE table REWRITE DATA using BIN_PACK where catalog = ‘c1’

22
Q
A