Midterm Flashcards

1
Q

Descriptive statistics

A

just a way to describe the data - charts and graphs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Inferential statistics

A

allows you to make predictions from the data

  • raw data put through statistical tests to come up with a conclusion about a population
  • allows generalizations from a sample group
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

nominal level of measurement

A

named categories

-yes or no, race, gender, country, ethnicity, hair color

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

ordinal level of measurement

A

categorical data that’s ranked or ordered - has innate order

example: good, fair, poor; strongly agree to strongly disagree
* Weight could also be ordinal like boxes with >50kg, <50kg, etc.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

interval level of measurement

A

equal distance b/w each value

Difference b/w 30 and 35 degrees Celsius is the same measurement as b/w 40-45 degrees Celsius

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

ratio level of measurement

A

same attributes as interval measurement, but has absolute 0 and no negative values

*example: length in cm - can’t have negative cm measurement

 Place of absolute zero and no negative values
 Ex. Length in cm – can have zero but no negative
 Interval and ratio can be difficult to distinguish b/w (she will not make us do this)
 Weight could be ordinal and ratio

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Data classified as discrete

A

 Can take on one value out of a limited number of options
• Number of kids (1,2,3, etc. but can’t have 1.2)
• Heart rate, number of pregnancies, number of hospital admissions, number of students in a class, shoe sizes, number of questions answered correctly

 Dichotomous – a specific discrete variable where there are only two values
• Gender (M or F) Limited number of options (only 2 options)

Under 65 or over 65 would also be dichotomous discrete and ordinal…wouldn’t be interval or ratio because there is not an equal distance b/w each

Yes or no (dichotomous, nominal)

*all dichotomous data is nominal?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Data classified as continuous

A

 Can take on any number w/I a range
 Only limited by precision of measurement tool used
 Look at height – only cm marks 158cm tall, 159 cm tall  but height really could be more precise
o Some tests you can only use discrete or continuous – be able to pick this out in a study

*ex. standiometer can measure height to the 1/10 or the 1/100

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

central tendency

measured with what level of data?

A

gives you typical value (average), and three ways you can determine this are mean, median, and mode. Gives us a point value, one number that represents the whole data set.

Are measured in interval and ratio level data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Mean can only be calculated with?

A

mean is the average - doesn’t work with categories, and can only be calculated with interval or ratio level data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Median can be calculated with?

A

middle

calculated with ordinal, interval, ratio

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Mode can be calculated with?

A

any type of data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Dispersion/variability

A

how closely the numbers cluster around the mean, median, and mode

aka variance from the standard score - the range and spread of the data from the center

Gives information about the spread of scores and indicates how well a measure of central tendency represents the “middle/average” value in the data set

So, you will often see a median reported and then a standard deviation (so this is using central tendency & dispersion).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Variance, Standard deviation - Under DISPERSION

What level of data do you typically see these with?

What does a small SD tell you?

A

 Variance and standard deviation you will often see with interval or ratio level data. Don’t need to be able to calculate a standard deviation. She wants us to be able to look at one and figure out what it is telling us. So, if you have an article that tells you the standard deviation of something and it’s really small – that tells you that all the data points cluster closely together, and they are all close to the mean or the median.
 Variance and standard deviation are related. The square root of variance is standard deviation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Variance

A

the average difference b/w the data values and mean of a data set

*the average degree to which each point differs from the mean – the average of all data points

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

SD

A

standard deviation = the avg amount that data values will vary from the mean - how closely values are clustered

*example: SD small - data is close together and variance is small, if SD is large then there are more variables in the data so it’s more spread out

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Range

A

 Very simple measure of dispersion
 Calculate by taking the max value in the data set and subtract from the minimum value = range. Smaller number you have for a range the closer the data set is, and the more clustered and less variable.

Ex: 9,3,2,6,7,8,7,5 so 9-2 = 7 = RANGE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Interquartile range

A

difference b/w the 75th percentile and the 25th percentile

ex. 11222333445 mean is 3, 25% is 2 and 75% is 4 so, 4-2 = 2 which is the interquartile range

The range and interquartile range proves a rough estimate of the variability of a data set but doesn’t use all of the data values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Frequency distribution - curtosis of curve leptokurtic

A

thin - peaked curve

shows what continuous data looks like

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Frequency distribution - curtosis of curve mesokurtic

A

more normal curve

*shows what continuous data looks like

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Frequency distribution - curtosis of curve platykurtic

A

flat curve

*shows what continuous data looks like

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Describe the normal attributes of a normal distribution - bell curve

A

frequency distribution of data in which the data values are equally distributed around the center of the data point; normal bell curve; mean, median, and mode equal; symmetrical-not skewed

68% - within 1 SD of mean
95% - within 2 SD of mean
99% - within 3 SD of mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Kurtosis

A

measure of how peaked/flat a distribution is

24
Q

Skewness

A

measure of whether the set is symmetrical or off center

25
Q

Distribution is said to be normal when both measures of skewness and kurtosis fall b/w

A

-1 and 1

and, not normal if fall below -1 or above 1

26
Q

What does the level of significance mean?

When does researcher determine this value? What type of error is this?

A

researcher determines level before collecting data

% of the time that researcher will conclude that there is a statistically significant difference b/w groups or relationship b/w variables when there truly isn’t

Type I error

  • aka the probability of incorrectly rejecting the null hypothesis
  • level of significance (a)
27
Q

type II error

A

probability of concluding there is no difference b/w groups or NO relationship b/w variables when there truly is – probability of accepting the null when it is not true

*fail to reject null hypothesis by error

28
Q

P-value definition

A

probability that the difference, or one larger found, could arise by chance

*the difference b/w the two groups was statistically significant (p= 0.02) since it was less than 0.05 – reject the null hypothesis b/c there is a difference

29
Q

If the p value is less than level of sig…

A

reject the null b/c there IS a difference

30
Q

If the p value is greater than level of sig…

A

fail to reject null (accept the null) b/s there is NO difference

31
Q

When is an independent t-test appropriate to use? What assumptions must be met?

A

when you are comparing the means of two independent groups of subjects

Assumptions: independence, normality, homogeneity of variances, DV level of measurement interval or ratio

32
Q

In an SPSS output what does N stand for?

A

the sample size

33
Q

With Levene’s test for equality of variances - looking at homogeneity of variance…what do we want? Where is the P value for this test listed?

A

Want to fail to reject the null hypothesis, so you want P value to be greater than 0.05

Listed under “SIG”

Remember that if homogeneity of variance is not met, you can’t use a t-test

For t-test always use the line equal variances assumed*

34
Q

Use what formula to report t-test findings

A

t(df) = t, test p -value

t(22) = 2.225, p = 0.037

35
Q

Degrees of freedom - df = what?

A

you would add up your total number of people (12) + (12) = 24 and then subtract # 0f groups (2) so 24-2= 22 is your t value

36
Q

M & SD come from which chart t-test?

A

group statistics chart

37
Q

when is a paired t-test appropriate to use? what is a paired t test also called?

A

when you want to compare the means of two paired groups of subjects

ex. pre/post test, twin studies, husband/wives

you can’t assume independence b/c not two independent groups - somehow groups are connected*

don’t have to worry about homogeneity of variance for this reason

also called a t test for dependent groups**

38
Q

assumptions needed for t-test for paired data?

A

normality, DV (DEPENDENT VARIABLE) level of measurement interval/ratio

39
Q

The steps of hypothesis testing - 6 steps

A

1- develop null and research hypothesis
2- choose level of sig
3- determine which statistical test is appropriate
4- run analysis to obtain test statistic and p value
5- make decision about rejecting or failing to reject the null hypothesis
6- make a conclusion

40
Q

T test for paired data hypothesis - there will be a difference in IQ b/w the preschool and home group. Is this directional or non-directional?

A

non-directional, so 2 tailed test

41
Q

t - test results for paired data - here you can find

A

mean difference in scores b/w preschool and no preschool…see slide page 3, mod 2 session 4

42
Q

df for paired t test =

A

df = total # of pairs -1

43
Q

When is an ANOVA appropriate to use?

What assumptions must be met?

A

when comparing the means of 3 or more independent groups of subjects

*same as independent t test:
independence of groups, normality of data, homogeneity of variances, DV level of measurement must be interval/ratio

ex. 1 group - printed d/c instructions only; 1 group - verbal d/c instructions only; 1 group - printed and verbal

44
Q

After ANOVA if significance is found (reject the null) then what has to be done?

A

post hoc testing

45
Q

ANOVA degrees of freedom

n=? N=? k =?

A

k = # of groups

n = number in each group, but ONLY if all group sizes are equal and if they’re not you use big N which is the total sample size

N = total sample size

46
Q

df(between) =

47
Q

df(within) =

A

nk-k or N-k

48
Q

df(total) =

A

nk-1 or N-1

49
Q

Show results of ANOVA as

A

F (between, within) = F#, p-value

F(2,51) = 13.630, p < 0.000 (you can’t have p=0.000 so it has to be p<0.000

50
Q

F in ANOVA is

A

the F-statistic is this ratio: F = variation between sample means / variation within the samples.

51
Q

In ANOVA how can you find the total number of people in the study

A

total df + 1

52
Q

Post hoc test

A

if you find significance in ANOVA then must do post hoc to compare all the groups for significance to see which one is different

53
Q

when do you use a two-way ANOVA?

A

compare effects of more than one independent variable on a dependent variable

54
Q

two-way ANOVA assumptions

A

1-independence
2-normality
3-homogeneity of variance
4-DV interval/ratio

55
Q

Two-way ANOVA looks for what effects?

A
  • what is the main effect of IV A on the DV?
  • what is the main effect of IV B on the DV?
  • SO, post hoc testing would be used if we found a statistical sig when examining these two variables…

So, interaction effect…
-what is the interaction effect of IV A and IV B on the DV?

56
Q

Two-way ANOVA test b/w subjects interpreted as?

A

F(df, error#) = F, p-value