2: Descriptive Stats For Distirbutions Part 1 Flashcards

1
Q

Data matrices

A

Data organized into a grid format

  • row= case
  • column=variable
  • cell=value
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Frequency tables

A

Showing the number of cases with each value of a variable

  • calculate %
  • works best for categorical or discrete variables
  • for Quan that take many values report binned within certain ranges
  • hard to extract more detailed information or easily grasp shape of distribution
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Categorical variables (nominal data)

A

Pie graphs visually represent proportions or %

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Stemplots

A

Graphical with all values of a quantitative variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Frequency histograms

A

Display the frequency distribution of one variable
-possible values of variable on X
-frequency of each variable on Y
Height = frequency
-for continuous variables: values binned into ranges
-as number of measurements increases, a frequency histogram approximats a curved shape

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Normal distribution

A

Specific shape of frequency:
-bell shaped
-symmetrical
-

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Positive skew

A

Tail to the right. (Most values on left)

  • floor effects: cluster at low end
  • high outliers
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Negative skew

A

Tail to left (most values on right)

  • ceiling effects (many cluster at high end)
  • low outliers
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Central tendency

A
  • center of distributions

- mode,median,mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Describing variability

A

Spread of distribution

-range, standard deviation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Mode

A

Most frequently occurring

  • near center if normal
  • may have multiple modes (bimodal) non normal
  • values other than most frequent not considered
  • limited application for continuous variables

Preferred when:

  • discrete data
  • nominal scale
  • multimodal
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Median

A

Value at midpoint

  • resistant to outliers
  • takes more info into account than mode, but still ignores magnitudes

Preferred when;

  • skewed distribution
  • ordinal scale
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Mean

A

Average of all values

  • sensitive to outliers
  • most informative

Preferable when:

  • interval or ratio
  • normal distribution
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Skewed distribution

A

Median remains stable near center,

-outliers pull the mean in the direction of tail

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Multimodal

A

Modes may be most informative

-mean and median may not represent the typical value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Range

A

Max-min

  • very sensitive to outliers
  • values between extremes not accounted for

Preferred if Max and min relevant

17
Q

Interquartile range

A
Defines the range of middle 50% 
-Q1: median lower half
-Q3: median of upper half
These are boudriesnif middle 50%
-very resistant to outliers 
-more info than range but still ignores the magnitude of most values. 
IQR= Q3-Q1

Simple rough estimate

18
Q

Standard deviation

A

Variance: S^2 average squared deviation of each score from mean

Standard deviation is square root of variance
Variance= sum(X-M)^2/N-1

-most widely used
-sensitive to outliers
-take all scores
Preferred in most cases if normal distribution

19
Q

Five number summary

A
Max score
Third quartile
Median
First quartile
Min score
  • resistant measure of center
  • resistant measure of spread
  • sensitive measure of outer limits of spread
20
Q

Box plots

A

Values of variable Y
First and third quartile ends in box
A line in box is median
Whiskers max and min scores