Databases and Analytics Flashcards

1
Q

Amazon Relational Database Service (RDS)

A
  • RDS uses EC2 instances, so you must choose an instance
    family/type
  • Relational databases are known as Structured Query Language
    (SQL) databases
  • RDS is an Online Transaction Processing (OLTP) type of database
  • Easy to setup, highly available, fault tolerant, and scalable
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

RDS Encryption

A
  • Can encrypt your Amazon RDS instances and snapshots at rest
  • Encryption uses AWS Key Management Service (KMS)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

RDS DB support types?

A

SQL Server, Oracle, MySQL Server, PostgreSQL, Aurora,
MariaDB

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

RDS scaling measures and DR?

A
  • Scales up by increasing instance size (compute and storage)
  • Read replicas option for read heavy workloads (scales out for
    reads/queries only)
  • Disaster recovery with Multi-AZ option
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Amazon Aurora

A
  • Amazon Aurora is an AWS database offering in the RDS
    family
  • Amazon Aurora is a MySQL and PostgreSQLcompatible relational database built for the cloud
  • Amazon Aurora features a distributed, fault-tolerant, self healing storage system that auto-scales up to 128TB per database instance
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Amazon DynamoDB

A
  • Fully managed NoSQL database service
  • Key/value store and document store
  • It is a non-relational, key-value type of database
  • Fully serverless service
  • Push button scaling
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Amazon DynamoDB features and benefits

A

Serverless - Fully managed, fault tolerant, service
Highly available - 99.99% availability SLA – 99.999% for Global Tables
NoSQL type of database with Name / Value
structure - Flexible schema, good for when data is not well structured or unpredictable
Horizontal scaling - Seamless scalability to any scale with push button scaling or Auto Scaling
DynamoDB Accelerator (DAX) - Fully managed in-memory cache for DynamoDB that increases performance (microsecond latency)
Backup - Point-in-time recovery down to the second in last 35 days; On-demand backup and restore
Global Tables - Fully managed multi-region, multi-master solution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Amazon RedShift

A
  • RedShift is a SQL based data warehouse used for analytics
    applications
  • RedShift is a relational database that is used for Online
    Analytics Processing (OLAP) use cases
  • RedShift uses Amazon EC2 instances, so you must choose an
    instance family/type
  • RedShift always keeps three copies of your data
  • RedShift provides continuous/incremental backups
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Amazon EMR

A
  • Managed cluster platform that simplifies running big data
    frameworks including Apache Hadoop and Apache Spark
  • Used for processing data for analytics and business
    intelligence
  • Can also be used for transforming and moving large amounts
    of data
  • Performs extract, transform, and load (ETL) functions
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Amazon ElastiCache

A
  • Fully managed implementations Redis and Memcached
  • ElastiCache is a key/value store
  • In-memory database offering high performance and low
    latency
  • Can be put in front of databases such as RDS and DynamoDB
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Amazon Athena

A
  • Athena queries data in S3 using SQL
  • Can be connected to other data sources with Lambda
  • Data can be in CSV, TSV, JSON, Parquet and ORC formats
  • Uses a managed Data Catalog (AWS Glue) to store
    information and schemas about the databases and tables
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

AWS Glue

A
  • Fully managed extract, transform and load (ETL) service
  • Used for preparing data for analytics
  • AWS Glue runs the ETL jobs on a fully managed, scale-out
    Apache Spark environment
  • Works with data lakes (e.g. data on S3), data warehouses
    (including RedShift), and data stores (including RDS or EC2
    databases)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Amazon Kinesis Data Streams

A
  • Producers send data which is stored in shards for up to 7
    days
  • Consumers process the data and save to another service
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Amazon Kinesis Data Firehose

A
  • No shards, completely automated and elastically scalable
  • Saves data directly to another service such as S3, Splunk,
    RedShift, or Elasticsearch
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Amazon Kinesis Data Analytics

A
  • Provides real-time SQL processing for streaming data
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

AWS Data Pipeline

A

AWS Data Pipeline
* Processes and moves data between different AWS compute and
storage services
* Save results to services including S3, RDS, DynamoDB, and EMR

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Amazon QuickSight

A

Amazon QuickSight
* Business intelligence (BI) service
* Create and publish interactive BI dashboards for Machine
Learning-powered insights

18
Q

Amazon Neptune

A

Amazon Neptune
* Fully managed graph database service

19
Q

Amazon DocumentDB

A

Amazon DocumentDB
* Fully managed document database service (non-relational)
* Supports MongoDB workloads
* Queries and indexes JSON data

20
Q

Amazon QLDB

A
  • Fully managed ledger database for immutable change history
  • Provides cryptographically verifiable transaction logging
21
Q

Amazon Managed Blockchain

A
  • Fully managed service for joining public and private networks
    using Hyperledger Fabric and Ethereum
22
Q

AWS Migration Hub

A
  • Provides a single location to track the progress of application
    migrations across multiple AWS and partner solutions
23
Q

AWS Database Migration Service (DMS)

A
  • AWS Database Migration Service helps you migrate
    databases to AWS quickly and securely.
  • The source database remains fully operational during the
    migration, minimizing downtime to applications that rely on
    the database
24
Q

AWS Server Migration Service (SMS)

A
  • Migrates servers and virtual machines to Amazon EC2
  • Agentless service which makes it easier and faster for you to
    migrate thousands of on-premises workloads to AWS
  • Automate, schedule, and track incremental replications of
    live server volumes
25
Q

AWS DataSync

A
  • Online data transfer service
  • Transfer data between on-premises and AWS storage
    services
26
Q

Snowball Family

A
  • AWS Snowball and Snowmobile are used for migrating large volumes of data to AWS
    Uses a secure storage device for physical transportation
  • Snowball (80TB) (50TB ) “petabyte scale”
  • Snowball Edge (100TB) “petabyte scale”
  • Snowmobile – “exabyte scale” with up to 100PB per
    Snowmobile

Types

Snowball Edge Compute Optimized
Snowball Edge Storage Optimized
Snowcone

27
Q

Snowball Edge Compute Optimized

A
  • Provides block and object storage and optional GPU
  • Edge computing use cases
28
Q

Snowball Edge Storage Optimized

A
  • Provides block storage and Amazon S3-compatible object storage
  • Use for local storage and large-scale data transfer
29
Q

Snowcone

A
  • Small device used for edge computing, storage and data transfer
  • Can transfer data offline or online with AWS DataSync agent
30
Q

AWS Rekognition

A
  • Add image and video analysis to your applications
  • Identify objects, people, text, scenes, and activities in images
    and videos
31
Q

Amazon Transcribe

A
  • Add speech to text capabilities to applications
  • Recorded speech can be converted to text before it can be
    used in applications
32
Q

Amazon Translate

A
  • Neural machine translation service that delivers fast, highquality, and affordable language translation
  • Localize content such as websites and applications for your
    diverse users
33
Q

Amazon Sagemaket

A
  • Helps data scientists and developers to prepare, build, train,
    and deploy high-quality machine learning (ML) models
34
Q

Amazon Comprehend

A
  • Natural-language processing (NLP) service
  • Uses machine learning to uncover information in unstructured
    data
35
Q

Amazon Lex

A
  • Conversational AI for Chatbots
  • Build conversational interfaces into any application using voice
    and text
36
Q

Amazon Polly

A
  • Turns text into lifelike speech
  • Create applications that talk, and build entirely new categories of
    speech-enabled products
37
Q

Amazon Workspaces

A
  • Managed Desktop-as-a-Service (DaaS) solution
  • Provision either Windows or Linux desktops
38
Q

AWS AppStream 2.0

A
  • Fully managed non-persistent application streaming service
  • Alternative to popular products such as Citrix XenApp
39
Q

AWS Worklink

A
  • Provides secure, one-click access to your internal websites and
    web apps using mobile phone browsers
  • Does not require VPN client or App
40
Q

AWS WorkDocs

A
  • Fully managed, secure content creation, storage, and
    collaboration service
  • Create, edit, and share content that’s centrally stored on
    AWS
41
Q

AWS IoT Core

A
  • Lets you connect IoT devices to the AWS cloud without the
    need to provision or manage servers