General Flashcards

1
Q

What is Z Score

A

used for data that is of normal distribution, just tells you how many standard distributions away from the mean a point is

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is Z distribution

A

this is a normal distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is Standard Deviation

A

The average deviation from the mean in a distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is Variance

A

The Standard Deviation, Squared

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Interactively edit a data frame

A

fix() or edit()

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

See data type of object

A

class()

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

See Data type of column

A

you can use class(Dataframe$columnname)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Return subset of dataframe that meets certain conditions

A

subset(Dataframe, Dataframe[, 2] == ‘Hamel’)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the kurtosis and skew of normal distribution

A

kurtosis = 3, skew = 0

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Interpret Kurtosis

A

if kurtosis > 3 then has a sharper peak

if kurtosis < 3 then has flatter peak

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Interpret Skew

A

If skewness is less than −1 or greater than +1, the distribution is highly skewed.
If skewness is between −1 and −½ or between +½ and +1, the distribution is moderately skewed.
If skewness is between −½ and +½, the distribution is approximately symmetric.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Different type of correlation measurements

A
  1. Pearson: when x & y are continous
  2. Point bi-serial: when 1 var is continous, 1 is dichotomous
  3. Phi coefficient: when both vars are dichotomous
  4. Spearman: when both vars are ordinal (ranked data)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Homoscedasticity

A

Means the residual is not related to the variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Assumptions when interpreting the Pearson correlation coefficient

A
  1. Normal distribution for x and y
  2. Linear relatinoship
  3. Homoskedasticity
  4. Relaibility
  5. Validity
  6. Random Sample
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is reliability

A

how closely does your sample reflect the true population?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Ways to measure reliability

A

1) Parallel tests
2) Inter-item tests
3)