Overview of Data Repositories Flashcards

You may prefer our related Brainscape-certified flashcards:
1
Q

Definition of Data Repository

A

-general term used to refer to data that has been collected, organized, and isolated
-isolate data, make reporting and analytics more efficient and credible, and serve as a data archive

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Databases

A

collection of data (or information) designed for the input, storage, search and retrieval, and modification of data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Database Management System (DBMS)

A

-set of programs that creates and maintains the database
-allows you to store, modify, and extract information from the database using a function called querying

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Different types of databases determined by (5)

A

-data type and structure
-querying mechanisms
-latency requirements
-transaction speeds
-intended use of data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Relational Databases (RDBMSes) vs flat files

A

-data organized in tabular format (rows and columns) following a well-defined structure and schema (similar to flat files)
-RDBMSes are optimized for data operations and query involving many tables and much larger data volumes (unlike flat files)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Structured Query Language (SQL)

A

standard querying language for relational databases

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Non-relational Databases (NoSQL or “Not only SQL”)

A

-used to process big data
-stores data in a schema-less or free-form fashion

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

3 V’s of Big Data

A

Volume, Velocity, and Variety (aka scale, speed, and flexibility)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Big Data

A

-cloud computing
-Internet of Things (IoT)
-social media proliferation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Data Repository is comprised of

A

-a small or large database infrastructure with one or more databases that collect, manage, and store data sets

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Data Repository uses

A

-used in business operations or mined for reporting and data analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Data Warehouse

A

works as a central repository that merges information coming from disparate sources and consolidates it through the ETL process into one comprehensive database for analytics and business intelligence

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

ETL process

A

-the extract, transform, and load process
Extract data from different data sources
Transform data into a clean and usable state
Load data into enterprise’s data repository

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Data Warehousing and Data Marts were historically

A

relational as much of traditional enterprise data resided in RDBMes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

However, Data Warehousing and Data Marts are now

A

now include non-relational data repositories (with emergence of NoSQL technologies and new data sources)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Big Data Stores

A

-data repository that includes distributed computational and storage infrastructure to store, scale, and process very large data sets