Working With Data Flashcards

You may prefer our related Brainscape-certified flashcards:
1
Q

Data Types

Which data categories exist, and what are the differences?

A

Non-Parametric:
- Categorical or discrete data; can be names, ID’s. They do not represent numerical values.

Parametric:
- Numerical data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Data Types

Which category is the Data Type ‘Nominal’ in and what is it?

A

Category: Non-Parametric

  • Nominal: Simple Names or labels; e.g. car brand, gender.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Data Types

Which category is the Data Type ‘Ordinal’ in and what is it?

A

Category: Non-Parametric

Ordinal: Ordered data; e.g Ranks in competition or military, first, second, third; level of education, high school, AP degree, bachelor’s degree.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Data Types

Which category is the Data Type ‘Interval’ in and what is it?

A

Category: Parametric

Numerical data with even interval

Interval: Numerical data representing real values evenly spaced or continuous.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Data Types

Which category is the Data Type ‘Ratio’ in and what is it?

A

Category: Parametric

As interval data, but with a true zero representation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Taxonomies of Data

Structured Data

A

Data in well-defined structures and atomic elements.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Taxonomies of Data

Unstructured Data

A

Data that doesn’t follow a strict structure and is hard to represent in tables and columns, e.g. text, images, and video.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is ‘Categorical Data’?

A

Same as Non-Parametric data.

  • Labels and categorical data: Simple names and labels such as gender, high
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Taxonomies of Data

Quantitative Data

A

Same as Parametric

Interval data, representing real values that are evenly spaced. e.g. 1 is half of 2 and 4 is double of what 2 is,.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What do we use NumPy for

A

Python module for high-performance nummerical computation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What do we use Pandas for

A

Python module for high-performance data processing.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly