General Knowledge Flashcards

Question

How does RedShift scale?

Answer 1

Redshift scales both horizontally and vertically

Answer 2

A new cluster is created while your old one remains available for Reads. The CName is flipped and data moved in parallel to the new compute nodes.

Answer 3

Redshift figures it out and bases it on the size of the data

Answer 4

Rows distributed across slices in a round robin fashion

Answer 5

Rows are distributed based on one column

Answer 6

The entire table is copied to every node.

Answer 7

They are similar to indexes in a traditional relational database

Answer 8

A sort key that points to a single column. Such as date

Answer 9

A sort key that is made up of all columns

Answer 10

A sort key that gives equal weight to every column.

Answer 11

A command that allows you to read from multiple data files or multiple data streams simultaneously

Answer 12

S3, EMR, DynamoDB, remote hosts with SSH

Answer 13

A Manifest

Answer 14

It allows you to export data from RedShift into files in S3

Answer 15

It forces all the traffic from COPY and UNLOAD command to use the VPC for communication rather than going over the internet

Answer 16

To copy data into RedShift from an external source

Answer 17

SELECT INTO or CREATE TABLE AS

Answer 18

Yes. It can decrypt data as it is loaded from S3. It uses hardware accelerated SSL to keep it fast.

Answer 19

GZip, IZop, and BZip2

Answer 20

It analyzes data being loaded and figures out the optimal compression to use for storing it.

Answer 21

A able with lots of rows and few columns

Answer 22

A single copy transaction if possible.

Answer 23

1. Create a KMS Key in the destination region 2. Create a snapshot copy grant in the destination region 3. specify the KMS key ID for which you are creating the copy grant in the destination region. 4. Enable copying of snapshots to the copy grant you just created.

Answer 24

It allows you to connect Redshift to a PostgreSQL database

Answer 25

Yes, you would use the remote host / ssh option

Answer 26

Datapipeline

Answer 27

The AWS Database Migration Service

Answer 28

It prioritizes short fast queries so they are not blocked by long slow ones. This is configured by using query queues. One for long-running jobs and another for short

Answer 29

It automatically adds capacity to handle increase in concurrent read queries.

Answer 30

8 in total. The default is 5 with even memory allocation

Answer 31

Five queries at once in a single queue

Answer 32

Queries that timeout in one queue can "hop" to the next to try again. The second queue could have a higher timeout value.

Answer 33

Yes. You can only use it with this statement and read only queries

Answer 34

Yes. You can.

Answer 35

It recovers space from deleted rows

Answer 36

Default value. It resorts the rows and reclaims space from deleted rows.

Answer 37

It only reclaims space from deleted rows

Answer 38

It will resort the table, but not clean up disk space

Answer 39

Will reanalyze the sort key columns for interleaved indexes

Answer 40

It allows you to quicklu add or remove nodes of the same type

Answer 41

The RA3 node. SSD Based

Answer 42

You can query Redshift and dump the results into a datalake in S3. The motivation is that this is fast and compact

Answer 43

The ability to share data across redshift clusters without needing to copy them. This works across accounts and regions. Only works with RA3 node types.

Answer 44

It sits between redshift and S3 and provides accelerated query performance. Up to 10 times faster at no extra cost

Answer 45

The client and server certificate are required

Answer 46

No. You need to create a new encrypted cluster first and then migrate data to it.

Answer 47

They provide a way to mange access at the table level for a user or group.

Answer 48

Yes. It can automatically scale and provision for your workloads. Pay only when in use

Answer 49

Adhoc business analystics, dev and test environments

Answer 50

This is billed by RPU per second plus storage.

Answer 51

No. You can only connect from within the VPC

Answer 52

Yes. You can use the Query Editor

Answer 53

It allows you to connect to an external database like a glue catalog, RDS instance, etc.. You can query the data without loading it into your datawarehouse

Answer 54

No, only S3 managed keys

General Knowledge Flashcards

(82 cards)