S2-m5 Flashcards

1
Q

The Data Life Cycle

A

Describes the sequential steps all business data must go through from creation, through its use, storage, and final disposal.
1. Definition
2. Capture
3. Preparation
4. Synthesis
5. Analytical and usage
6. Publication
7. Archival
8. Purging

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Definition

A

defining what data a business needs and where to capture or retrieve such data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Capture/Creation

A

Obtain the data by creating internally or capturing data from where it has been created externally

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Cleaning Data

A
  1. Remove unnecessary headings or subtotals
  2. Clean leading zeros and nonprintable characters
  3. Format negative numbers
  4. Identify and correct inconsistencies across data in general
  5. Address inconsistent data types
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Preparation

A

to determine whether the data is complete, clean, encrypted, and user friendly

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Embanking Completeness and Integrity of Data

A

any time data is moved it is possible that some of the required data could have been lost. To validate the captured data:
1. Compare number of records that you intend to capture to the number of records in the source database
2. Compare descriptive statistics for numeric fields
3. Validate that field formats are consistent
4. Compare character limits for the attributes in the source file

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Data integration

A

when data is sourced externally, it is critical to design the data architecture to ensure that the data pipeline is integrated with the target location/database

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Data Encryption

A

The sensitivity of data and the consideration if integrity would generally require encryptions both in data transit and data storage

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Synthesis

A

a bridge between preparation and usage. once you have determined how you intend to use the captured data, you can create calculated fields to prepare that data for quicker usage and analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Analytics and Usage

A

focuses on the data being useful to the internal company-not being shared with external users

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Publication

A

sending monthly statements to clients, publishing financial statements, and sending quotes to customers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Archival

A

data sets are moved from active systems to passive systems for archiving to free up storage resources for the active systems

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Purging

A

the end of the life cycle occurs when the data is completely removed from the company’s storage system

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Extract, Transform, and Load

A

When data already exists, whether that data is internal or external, the data must be extracted from its original source, transformed into useful information, and loaded into the tool you choose to use for analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Active Data Collection

A

when you directly ask your users for data. This can occur from survey or interview results as well as forms gathering personal information as users emails, phone #

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Passive Data Collection

A

gather information without direct permission from their users through tracking web sage via cookies or gathering time stamps of when users interact with website

13
Q

What are data complexities when obtaining data from an external source?

A

Integrity, safety, and copyrights, are three complexities to consider when obtaining data from an external source

14
Q
A