Lecture 4 Flashcards

Summarizing Data Collected in the Sample

1
Q

What are the 4 types of variables?

A
  • dichotomous
  • ordinal
  • categorical
  • continuous
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the two pieces of information in every frequency distribution table for any type of variable?

A

frequency and relative frequency

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How do you calculate relative frequency (%)?

A

RF = (frequency / total # of responses) x 100%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What two additional pieces of data is calculated for ordinal variables?

A

cumulative and cumulative relative frequency

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are the two types of measurements of continuous variables?

A

central tendencies and variability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are the three central tendency measurements?

A

mean, median, and mode

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are the four variability measurements?

A

range, IQR, variance, and standard deviation (SD)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

If data is missing, is that individual still counted in the total # of persons?

A

No, only participants who provided a value are accounted for in the denominator

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is mean denoted as?

A

x-bar

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are the advantages to median?

A
  • not impacted by outliers
  • good index of what is “typical” if distribution is skewed
  • some categorical data can have the median applied
  • good for ordinal data
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the main disadvantage of median?

A

does NOT take the data values themself into account (only a position indicator)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are the advantages of mode?

A
  • can be applied to any variable type (even nominal)
  • reflects and actual value from the data (easy to understand)
  • useful when there are multiple common values
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are the disadvantages to mode?

A
  • ignores majority of other info regrading distribution
  • tends to vary between samples
  • some samples do not have a mode
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

In a normal distribution, where are all the central tendencies found?

A

in the center of the bell-curve

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

In skewed distributions, where are the central tendencies found?

A

positive skew (tail to right)- mode > median > mean
negative skew (tail to left) - mean < median < mode
- in both the mean is what is off centered towards the tail end

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Where are the outliers found in skewed curves?

A

on the lower tail ends

17
Q

What are the two types of variability?

A

heterogeneous (high variability)
homogeneous (low variability)

18
Q

What percentage of the variation does range represent?

A

100%

19
Q

What is the advantage to range?

A

it’s easily understood

20
Q

What are the disadvantages to range?

A
  • only dependent on 2 scores (not all info taken into account)
  • sensitive to outliers
  • tends to vary between samples
  • influenced by sample size
21
Q

What percentage of the variation does the IQR represent?

A

50%

22
Q

What are the advantages to the IQR?

A
  • reduced influence of outliers (only the middle 50% is looked at)
  • uses more information than range
  • helpful for evaluating/identifying outliers
  • appropriate for ordinal measures
23
Q

What are the disadvantages of the IQR?

A
  • not easy to compute
  • not well understood
  • doesn’t take all values into account
24
Q

What is deviance?

A

the spread of the scores (how different each score is from the center of a distribution)

25
Q

What is the deviance equation?

A

deviance = xᵢ - X̄

26
Q

What is variance?

A

indicates the total dispersion of scores from the MEAN

27
Q

What is the variance equation?

A

variance = (sum of (devience)^2) / N - 1

28
Q

What needs to be done to the variance in order to obtain the standard deviation?

A

square root the value

29
Q

What is the interpretation of the standard deviation?

A
  • high result = values are FAR off from the mean
  • low results = values are not that far off (close to the mean)
30
Q

In a normal distribution, past what point are value considered outliers?

A

anything beyond +/- 1.96 standard deviations

31
Q

When outliers are NOT present, which central tendency and variability values are appropriate?

A

sample mean and the standard deviation

32
Q

When outliers ARE present, which central tendency and variability values are appropriate?

A

median and interquartile range

33
Q

What is the equation for calculating the upper and lower limits to determine mild outliers?

A

lower limit = Q1 - (1.5IQR)
upper limit = Q3 - (1.5
IQR)

34
Q

What is the equation for calculating the upper and lower limits to determine extreme outliers?

A

lower limit = Q1 - (3IQR)
upper limit = Q3 - (3
IQR)

35
Q

What type of graph is used to represent dichotomous or categorical variables?

A

bar charts (frequencies and relative frequencies)

36
Q

What type of graph is used to represent ordinal variables?

A

histograms (frequencies, relative frequencies, cumulative frequencies, and cumulative relative frequencies)

37
Q

What type of graph is used to represent continuous variables?

A

boxplot (central tendencies and variabilities/ranges)