Udemy lecture 7: Databases & analytics Flashcards
What is a relational database?
Relational database is when you make a link to multiple tables (ex. a student made 1 table with student ID, Dept ID, Name, Email, & then a second table was made linking to the first one where in the second table it starts with Dept ID, then gives futher information) (think of it like an excel sheet)
In relational databases it uses the __________ language to perform queries or lookups
SQL (Whenever you hear SQL think of relational databases)
______________ databases are nonrelational databases
NoSQL
____________ databases are purpose built for specific data models & have flexible schemas for building modern applications
NoSQL
What are some benefits of NoSQL databases?
- Flexible- easy to evolve data model
- Scalability- designed to scale out by using distributed clusters
- High-performance- optimized for a specific data model
-Highly functional- types optimized for data model
What does JSON stand for?
Javascript object notation
NoSQL can have its data in _________ format
JSON
Data can be _______ in the JSON format
Nested (storing data using in a structure way, but the fields (information) can change over time so have to change that information (support for new types of arrays))
What is AWS responsibility related to databases
- Responsible for the entire database in terms of patching
- Automated backup & restore, operations, upgrades
- Monitoring, alerting
-AWS offers to manage different databses
______ is a relational database
RDS
What does RDS stand for?
Relational database service
What is a Relational database service?
A managed database service for database that will use SQL as a query language, & it will allow you to create databases in the cloud that will be managed by AWS
_________ is a proprietary database from AWS
Aurora
What are the advantages to using RDS than deploying a database on EC2?
- Automated provisioning, OS patching
- Continuous backups & restore to specific timestamps (point-in-time restore)!
-Monitoring dashboards
-Read replicas for improved read performance
-Multi-AZ setup for DR (disaster recovery) - Maintenance windows for upgrades
- Scaling capability (vertical & horizontal)
With RDS databases you can’t connect ________ to it
SSH
What are the two kinds of database technologies that aurora supports?
- PostgreSQL
- MySQL
Aurora is supposed to be _________ optimized to yield better performances
Cloud
Aurora storage grows automatically from __________________
From 10 gigabytes to 128 terabytes
__________ & ___________ are the two ways to create relational databases on AWS
RDS & Aurora (They are both managed & aurora is more cloud-native whereas RDS is going to be running on the technologies you know that is a managed service)
The __________ option for Amazon Aurora is where the database instantiation is going to be automated
Serverless (also has auto scaling based on your usage)
Both ________ & ___________ are supported as engines of aurora serverless database
PostgreSQL & MySQL
Aurora serverless is great for _____________ workloads
Infrequent/unpredictable workloads
If your see Aurora with no management overhead then think of ______________
Aurora serverless
___________ can scale the read workload of your database
RDS read replicas (can create up to 15 replicas & data is only written to the main database)
__________ is useful to have in case of AZ outage or main database has problems (high availability)
failover database (so its bascially multi AZ)
In the ___________ data is only read/written to the main database & can only have one other AZ as a ________
Failover
You can use read replicas in multi- regions & use you it for a ___________ in case of region issue & local performance improve, less latency but also has a replication cost
Disaster recovery
____________ is used to get managed Redis or Memcached databses
Elasticache