WEEK 10 Flashcards
IN WHAT WAY IS A DW SUBJECT OREIENTED
it is roganized by a fact table and dimensional table
which system is for daily oeprations & has high frequency of update operaitons
OLTP
where is data not updated
olap
what is the osurce of data for a depementa data mart
EDW
WHICH Of thefollowing can be relational db
ods, edw, data mart, operational db
what is a logical dm composed of
views
which of the following is typically created for a specific project and can be rmouved if no longer eneeded
data mart
what is data called one it is cleaned and loaded into EDW
RECONCLIED DDATA
charactieristcis of big data: 5 V’s
volume: much larger quantitiy of data than typical for relaitonal db
variety: lots of differnt data types and formats
velocity: data comes at very fast rate! (mobile sensors, web click stream)
veracity: trad data quality methods dont apply, how to judge accuracy and relevance?
value: big data is menaingless if it does not provide value toward some meanignufl goal
schema on read vs schema on write
schema on read (USING DATA FOR BANA)
- data model determiner later, depends on how you want to use it
- caputre and store the data and worry about how you want ot use it late
- DATA MARTS ARE LIKE THIS! you create scehma when you are doing a rpoject
schema on write (STORING DATA)
- preexsiting data model
-this is how tradiitonal db are designed (relaitonal db)
data lake
a large integraated repositoray for internal and external data that does not follow a prediefned schema
capture everything ,dive in anywhere, flexbile access, use ai to pulldata that ou ened
trad database design: schema on write
gather requrements and strcture > format data model > database schema > db use based on the predefined schemaa
big data appraoch: schema on read
collect large amts of data witg locally defined structures (JSON/XML…) > store data on lake > analyze store data to identify ways to structure> structure or org data during analysis process
NoSQL
not only swl
what does noSql mean
caetgory of recently intrduced data stroe and retrevial rech not based on the rleiatonal model
SCALING OUT rather than SCALING UP