Data to create and preserve value for organisations Flashcards

1
Q

Data Engineering

A

Practice of designing and building systems for collecting, storing and analysing large sets of data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Extraction, Transformation & Loading (ETL)

A

Three stages in blending data from multiple sources into a destination system, for example a data warehouse.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Extraction

A

Process of harvesting data from source databases. Prior to extraction the data needs to be analysed to understand its content, format and structure - Data Profiling.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Transformation

A

Takes extracted data and changes it into a format suitable for destination database and ultimate intended use. Done using code and rules designed to interrogate the source data before converting it to a new format as per the code instructions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Loading

A

When the newly cleaned and prepared data is uploaded into the destination database ready for use.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Data Warehouse

A

Store for data that has been loaded into the ETL process. Data will be held in a systematic and logical way ready for further interrogation and analysis by the business intelligence function.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Business Intelligence (BI)

A

Technology driven process of analysing business data to create insightful and actionable information to help improve the operations or products of a business.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Data Mining

A

Important component of BI. The process of uncovering patterns and other valuable information from large sets of data in the data warehouse.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Challenges of ETL

A

Rate of growth
Types and Sources
New Technologies

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Data Model

A

Considers data of an organisation in a systematic way. Allows it to be stored and retrieved in an efficient and effective manner.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Advantages of Data Modelling

A

Foundation for handling data
Enforces business rules and helps achieve compliance
Consistency
Quality of data is enhanced

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Three levels of a data modelling process

A

Conceptual - Business oriented and practical, considering the business data and its requirements.

Logical - Begins to develop a technical map of rules and data structures, defining how data will be held and used.

Physical - Considers how defined system requirements will be implemented using a specific database management system (DBMS).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Data Manipulation

A

Process of changing data to make it easier to read.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Data Analysis

A

Process of examining, transforming and arranging a given data set in specific ways in order to study its individual parts and extract useful information.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Data Strategy

A

A coherent approach for organising, governing, analysing and deploying an organisation’s information assets.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

The 4 V’s of Big Data

A

Volume
Velocity
Variety
Veracity

17
Q

Data Scientists

A

Individuals with the ability to extract meaning from and interpret data, which requires both tools and methods from statistics and machine learning.