D1. Data Basics Flashcards

1
Q

Properties of categorical data

A

Qualitative: often represented using binary indicators (e.g. 1=employed, 0=unemployed).

Ordinal: order is meaningful but differences/ratios are not (e.g. 0=poor, 1=fair, 2=good).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Properties of numerical data

A

Value is cardinally meaningful.

Discrete: number of individuals in the household.

Continuous: height, age, etc. Many variables in reality are treated as continuous even if they are discrete (quantity, wage).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Define support

A

The set of values which a variable can take.

Discrete: usually an integer

Continuous: any real number (often within an interval).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is multivariate data?

A

Measuring multiple variables at once: multivariate variables (e.g. height, eye colour, weight or 2017, 2018, 2019).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is cross-sectional data?

A

Data on one attribute measured in N cases (e.g. people).

{X1, X2, …, XN} = {Xi}i = 1, 2, …, N

^ We ‘lay them out’ up and down on a real number line

If two attributes are measured, 2D number line.

If three, 3D number line.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is time-series data?

A

Series of data points indexed in time order.

{X1, X2, …, XT} = {Xt}t = 1, 2, …, T

Time is an implicit second attribute

Series is a sequence of bivariate observations {time, value}

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What sigma notations do we use for addition and multiplication?

A

Σ = add, Π = multiply

How well did you know this?
1
Not at all
2
3
4
5
Perfectly