5 Vs of big data Flashcards
https://explore.skillbuilder.aws/learn/course/19747/data-engineering-on-aws-foundations;lp=2195
Which big data characteristic deals with having accurate, precise and trusted data?
Veracity
Which big data characteristic deals with the speed in whcih data is generated, dist, and collected.
Veracity
Which big data chracteristic deals with speed in which data is generated, distributed and collected?
Velocity
type of volume data sources used ofr ingestion and storage which includes the following: Customer inforamation, Online product purchases and services contracts
Transactional data
Type of volume data sources for ingestion and storage like Internet browser cache
Temporary Data
Type of volume data sources for ingestion and storage like images, text messages, email messages
objects
Data stored as documents or key-value pairs
semi-structured
Data stored as files
unstructured
Data stored in tables
structured
a PDF is type of what data - unstructured, structured or semi-structured
unstructured.
Structured data are stored within ______
RDBMS - like SQL. Goal is optimized storage
fixed schema is a type of weakness for which database management system
RDBMS - need to consider data types used. Need to consider storage, hardware capabilities. There can be issues with storing unstructured data.
Good transactional latency is a Strength for which database management technology
RDBMS
type of non relational database that stores in a single table. The values are associated with a specific key.
Key-value pair
Database type built to store semi structure data for rapid retrieval and collection.
Non-relational database - like NoSQL often storing data as documents or key-value pairs.
The ability to link keys directly with value without having to index or join. This is a strength for which Database type is this?
Non_relational, Key-value pair
Being able to Query values stored in a single blob is weakness for which non relational database type
Key value pair.
Which non relational database type stores semi structured data in the form of files. Like CSV, PDF
Document stores
Data not being instantly updated is a weakness of which Database type - Relational or Non Relational
Non-Relational NoSQL Databases. Has low transactional latency.
OLAP columar data is stored on by
row by row or column by column
Column by column
Comparing OLTP with OLAP Which is better at sequential reads and writes - columnar or row based
columnar
Comparing OLAP with OLTP which database type does this key characteristic cover: Collection of documents single table with keys and values
Colunmar
COmparing OLTP with OLAP which database type handles horizontal scaling.
Colunmar
Which AWS RDBMS services provides serverless, scalable HA for MYSQL and Postgres?
Aurora