WEEK 10 Flashcards

1
Q

IN WHAT WAY IS A DW SUBJECT OREIENTED

A

it is roganized by a fact table and dimensional table

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

which system is for daily oeprations & has high frequency of update operaitons

A

OLTP

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

where is data not updated

A

olap

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

what is the osurce of data for a depementa data mart

A

EDW

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

WHICH Of thefollowing can be relational db

A

ods, edw, data mart, operational db

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

what is a logical dm composed of

A

views

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

which of the following is typically created for a specific project and can be rmouved if no longer eneeded

A

data mart

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

what is data called one it is cleaned and loaded into EDW

A

RECONCLIED DDATA

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

charactieristcis of big data: 5 V’s

A

volume: much larger quantitiy of data than typical for relaitonal db

variety: lots of differnt data types and formats

velocity: data comes at very fast rate! (mobile sensors, web click stream)

veracity: trad data quality methods dont apply, how to judge accuracy and relevance?

value: big data is menaingless if it does not provide value toward some meanignufl goal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

schema on read vs schema on write

A

schema on read (USING DATA FOR BANA)
- data model determiner later, depends on how you want to use it
- caputre and store the data and worry about how you want ot use it late
- DATA MARTS ARE LIKE THIS! you create scehma when you are doing a rpoject

schema on write (STORING DATA)
- preexsiting data model
-this is how tradiitonal db are designed (relaitonal db)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

data lake

A

a large integraated repositoray for internal and external data that does not follow a prediefned schema

capture everything ,dive in anywhere, flexbile access, use ai to pulldata that ou ened

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

trad database design: schema on write

A

gather requrements and strcture > format data model > database schema > db use based on the predefined schemaa

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

big data appraoch: schema on read

A

collect large amts of data witg locally defined structures (JSON/XML…) > store data on lake > analyze store data to identify ways to structure> structure or org data during analysis process

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

NoSQL

A

not only swl

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

what does noSql mean

A

caetgory of recently intrduced data stroe and retrevial rech not based on the rleiatonal model

SCALING OUT rather than SCALING UP

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

NOSQL CHARCATERISITCS

A

NATURAL FOr cloud environeemnt (sacling out)

suports schema on read (big data is happy)

largely open source

not ACID complaint (atomicity, conssitency, isolations and durability ) this is transaction processing

BASE- bascially available, soft state, eventually consistent

17
Q

NoSQL: key value stores

A

this is a simple pair of a key associated colelction of values

key is usually a string! db has no knowledgee of hte sturcture or meaning of the values

REDIS

18
Q

Document stroes NOSQL

A

like a key-value stroe, but documetn goes further than value.

Doc is structued so specific elecments can be manipuated separelty

MONGODB

19
Q

Wide colymn stores NOSQL

A

rows and columns

distribution of data based on both key values (records) and columns, using column groups/families; key is two dimensional

APACHE CASSANDRA

20
Q

Graph oriented db NOSQL

A

maintain ingo regarding the relations between data items! nodes with properties

conenceiotns betwen nodes/ relationships can also have proeprties

Neo4j

21
Q

mongo db

A

document-store db

BSON based storage (binary json)

collections: equivalent to tables in relational db, set of docs intended to be stored togehter

documents :equivalent to rows in relational db, docs do not need to have the same structure (unliek rows), _id property fo runiquely identify a row

relationships: _id property serves as primary key, another doc can have a foreing key as anotehr JSON property

22
Q

MOngo db what is table and what is row

A

collection=table
document =row

23
Q

ask if mongo db will on final, if yes then from 2:30 onwards:

A

https://ucalgary.zoom.us/rec/play/w-ozhS2JaZaj3_CT0YRHmU70ROoJCYw8aSQNwS5lEft-EiBnhPEhojS-Uk9cxqV5BkH7ZijL-hmJRIDI._rRt0Z1vyrQzztMe

24
Q

what is a cluster on mongo db

A

cluster is the server

25
Q

WHAT ARE JSON FILES!!

A
26
Q
A