Data And Data Preparation Flashcards

1
Q

What are descriptive statistics?

A

The summer of important aspects of a data set 

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Set unemployment rate in the Dow Jones industrial average, or example of what statistical branch

A

Descriptive statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the branch of statistic that draws conclusions from a sample of data called?

A

Inferential statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

When is it appropriate to use cross-sectional data?

A

When the time of measurement doesn’t matter

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

When is it appropriate to use timeseries data?

A

When the timing matters, and you only have one thing to measure

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the difference between structured an unstructured data?

A

Structure data here is like in the database. They have rows and columns, while unstructured data is just a bunch of data. Apparently like 80% of data is unstructured nowadays. This is crazy.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are the three characteristics of big data?

A

Volume velocity, and variety

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How can qualitative and quantitative data be described?

A

As is categorical and numerical

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the difference between discrete and continuous variables?

A

Continuous variables can be anything while discrete variables have a limited selection

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Which measurement scales are used for categorical variables

A

Nominal and ordinal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Which measurement scales are used for numerical variables

A

Interval and ratio

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What does it mean that interval, scale variables don’t have a meaningful zero

A

That the zero does not represent an absence of what’s being measured

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

How can you handle missing values in a dataset

A

Either by the omission strategy, removing the unit entirely or by the imputation strategy, replacing the missing value with the average, or some other relevant variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is subsetting?

A

To extract a relevant portion of the data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What are some ways of preparing data?

A

Counting sorting and subsetting

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is NPS

A

Net prompter score which shows air how likely people are to recommend this service