Class 3 Spring 🌷 Flashcards

1
Q

What are the three measures of Central Tendency?

A

Mean, median, mode

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the measures of Dispersion?

A

Range, IQR, variance, standard deviation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What type of data is the mode primarily used for?

A

Categorical data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the definition of β€˜mode’?

A

The value with the most occurrences in the data set

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the formula to calculate the range?

A

Highest number - Lowest number

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is variance represented by?

A

sΒ²

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the relationship between variance and standard deviation?

A

Standard deviation is the square root of variance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What does the Interquartile Range (IQR) measure?

A

Dispersion related to the median

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How is the median represented in a Box-and-Whisker plot?

A

A dark line denoting the median

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What percent of data falls between Q1 and the median in a Box-and-Whisker plot?

A

50%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is a characteristic of a right (positive) skewed distribution?

A

Tail on the right side

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Fill in the blank: The _______ is the average squared distance from the mean.

A

Variance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the typical distribution of data within one standard deviation of the mean?

A

About 70%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What type of data is IQR used with?

A

Numerical data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What does a Box-and-Whisker plot’s whiskers represent?

A

Data outside of the box attempting to capture the spread

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

True or False: Outliers are defined by hard-and-fast rules.

A

False

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What is the purpose of identifying outliers in data?

A

Useful for various reasons in statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What are the shapes/modalities that a distribution can have?

A

Uniform, unimodal, bimodal, multimodal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What does skewness describe in a dataset?

A

Asymmetry of the distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Common examples of right skewed data include _______.

A

People’s incomes, house prices, number of accident claims

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What is the measure of centrality that is primarily used for numerical data?

A

Mean and median

22
Q

What is the main question to answer when describing a dataset regarding central tendency?

A

Where is the β€˜middle’ of the dataset?

23
Q

What is the primary measure of dispersion for categorical data?

24
Q

What does the term β€˜deviation’ refer to in statistics?

A

Distance from the mean

25
Q

What is the first step in building a Box-and-Whisker plot?

A

Drawing a line denoting the median

26
Q

Fill in the blank: The _______ is the typical deviation of observations from the mean.

A

Standard deviation

27
Q

What percent of data typically falls within two standard deviations of the mean?

28
Q

What statistical notation is used for the standard deviation of a sample?

29
Q

What is a common method to visualize median and IQR?

A

Box-and-Whisker plots

30
Q

What is the significance of the first and third quartiles in a Box-and-Whisker plot?

A

They define the boundaries of the box representing the middle 50% of the data

31
Q

What is a common example of right/positively skewed data?

A

People’s incomes

Other examples include mileage on used cars, reaction times, house prices, and number of accident claims.

32
Q

What is a common example of left/negatively skewed data?

A

Number of fingers

Most people have ten fingers, but some may lose one or more. The age at death in wealthy countries is also negatively skewed.

33
Q

What are two top choices for visualizing skewed data?

A
  • Histograms
  • Box-and-whisker plots
34
Q

In a skewed distribution, where does the mode typically lie?

A

Under the peak of the distribution.

35
Q

What happens to the mean in a skewed distribution?

A

The mean gets pulled in the direction of the skew.

36
Q

What is the relationship between skewness and the difference between the mean and median?

A

The greater the skewness, the greater the difference between the mean and the median.

37
Q

If the data are skewed, which measure of central tendency may not provide a good estimate?

38
Q

Fill in the blank: The median and IQR are only sensitive to numbers near _______.

A

Q1, the median, and Q3.

39
Q

What is the interquartile range (IQR)?

A

A measure of statistical dispersion.

40
Q

Which measure is likely more useful for understanding a typical individual loan?

A

The median.

41
Q

Which measure is likely more useful for understanding the total amount needed for 1,000 loans?

42
Q

True or False: In very skewed data, the mean provides a good estimate of the data center.

43
Q

What happens to the mean and median in right-skewed data?

A

Median < Mean.

44
Q

What happens to the mean and median in left-skewed data?

A

Mean < Median.

45
Q

What is the summary statistic for centrality of data in symmetrical data?

46
Q

What is the summary statistic for data spread?

A

Standard deviation.

47
Q

What statistical tools may not be usable with skewed data?

A
  • t-test
  • ANOVA
48
Q

What does the median represent in skewed data?

A

A better estimate of the center than the mean.

49
Q

What is a characteristic of robust statistics in relation to skewness?

A

They are stable in the presence of extreme observations.

50
Q

What are examples of potentially skewed datasets?

A
  • Sea Turtle Sizes
  • Stats Test Scores
  • Swim Times