Exam 3 Flashcards

Descriptive Statistics, Frequency Distributions and Frequency Distribution Tables, Measures of Central Tendency, and Measures of Variability.

1
Q

Frequency Distribution

A

Is an organized tabulation of the number of individuals in each category on the scale of measurement.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Absolute Frequency (f)

A

The number of participants that fall in each category.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Relative Frequency (rf)

A

The proportion of participants that fall in each category.

-f/N where N=total number of scores

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Percent

A

rf*100

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Cumulative Frequency (cf)

A

The number of people that score AT or LOWER than a given score.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Cumulative Relative Frequency (crf)

A
  • Use rf column

- Proportion of people that scored AT or BELOW a given score.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Cumulative Percent

A

crf*100

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Real Limits and Frequency Distributions

A

Begin 1 unit below lowest score and end 1 unit above highest score.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Frequency Graphs

A
  • Grouped or Ungrouped (interval) Scores

- Used when you have discrete variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Frequency Histogram

A

No spaces or gaps between bars.

-Used when the data consist of numerical scores that have been measured on an interval or ratio scale.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Frequency Polygon

A

Dots connected by a continuous line that begins and ends on the x-axis.
-Used when the data consist of numerical scores from an interval or ratio scale.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Bar Graph

A

Much like a histogram, except there are spaces between the bars.
-Used with nominal and ordinal scales.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Symmetrical Distribution

A

It is possible to draw a line down the middle so that one side of the distribution is a mirror image of the other.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Skewed Distribution

A

The scores tend to pile up toward one end of the scale and taper off gradually at the other end.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Positively Skewed Distribution

A

When the scores pile up on the left side of the distribution. The tail points toward the positive end of the x-axis (right).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Negatively Skewed Distribution

A

When the scores pile up on the right side of the distribution. The tail points toward the negative end of the x-axis (left).

17
Q

Line Plot

A

Same as polygon, but left and rightmost points are NOT closed at the abscissa.

18
Q

Stem and Leaf

A

Obtains the raw data in the display.

19
Q

Tukey’s Tallies

A

Does not maintain raw data.

20
Q

Heuristic for Grouping Scores

A

1) Determine the number of groups
- 5-15
2) Determine the size (width) of the interval (2,3, or some multiple of 5)
- Highest score-Lowest score
- Divide by number of groups
- Round to nearest commonly used interval size
3) Determine the beginning lowest interval (should be a multiple of width)

21
Q

Measures of Central Tendency

A

Mode, Median, Mean

22
Q

Mode

A

Most commonly occurring score.
Advantages: score that actually occurs in the data set, unaffected by extreme scores, represents most common observation
Disadvantage: may not be representative of entire distribution of scores

23
Q

Median

A

Divides the distribution into equal halves.
(N+1)/2=Median Location
Advantages: takes into account all the data in the distribution, unaffected by extreme scores (use when distributions are skewed), use if have missing values in data set, open-ended distribution, use with an ordinal scale
Disadvantages: value may not exist in the data, does not enter readily into equations and more difficult to work with, treats all scores alike; differences in magnitude not taken into account

24
Q

Mean

A

Arithmetic average of scores.
Advantages: representative of every score in the distribution, closely related to variance and standard deviation
Disadvantages: affected by extreme scores or “outliers” when you have only a limited number of scores in your distribution, value may not exist in data

25
Q

Central Tendency

A

An attempt to find a single score that represents the center of a distribution. Strives to find a number representative of the whole distribution.

26
Q

Variability

A

A qualitative measurement of the differences between scores and the degree to which they are different. Are they spread out or clustered?

27
Q

Biased Variance

A

An average value of the statistic that is equal to the population parameter.

28
Q

Unbiased Variance

A

An average value of the statistic that either over or underestimates the corresponding population parameter.

29
Q

Sum of Squares

A

How each score varies from the mean.
Advantage: takes into account ALL the scores in a distribution.
Disadvantage: size of SS depends on the amount of variability and is influenced by N.

30
Q

Population Variance

A

Average of the SUM of squared deviations from the mean (Mean Squared Distance).
-Takes into account N.

31
Q

Population Standard Deviation

A

Represents an average distance or direction from the mean.

32
Q

Sample Variance

A

Average SUM of squared deviations from the mean.

-Takes into account n.

33
Q

Standard Deviation

A

Represents an average distance or direction from the mean.