Lecture 4 Flashcards

1
Q

Data Wrangling

A

the process of cleaning, structuring, and transforming raw data into a format suitable for analysis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Aspects of Data Wrangling
(2CTIR)

A
  1. Data Collection
  2. Data Cleaning
  3. Data Transformation
  4. Data Integration
  5. Data Reduction
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Data Collection

A

process of gathering data from various sources

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Data Cleaning

A

address missing values, duplicates, and inconsistencies in the dataset

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Data Tranformation

A

Convert data types, handle categorical variables, and normalize numerical features.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Data Integration

A

Combine data from multiple sources or tables if necessary

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Data Reduction

A

Reduce the dataset’s dimensionality through techniques like feature selection or extraction.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Data Wrangling vs Data cleaning

A

Data Wrangling: Data wrangling refers to the broader process of collecting, cleaning, and transforming raw data into a format suitable for analysis. I

Data Cleaning: a subset of data wrangling, a process of cleaning the data by handling missing values, duplicates and inconsistencies in the dataset

How well did you know this?
1
Not at all
2
3
4
5
Perfectly