Class 6 Spring 🌷 Flashcards

1
Q

What are the two types of data that can be analyzed after sampling from a larger population?

A

Categorical and numerical data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are some ways to visualize data trends and distribution shape?

A

Table or graph (e.g., frequency plot, bar chart, histogram, box-and-whisker, scatterplot)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What summary statistics are commonly calculated?

A
  • Central tendency (mean, median, mode)
  • Dispersion (range, IQR, standard deviation)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the relationship between the type of table or graph and the nature of the variables?

A

It depends primarily on whether the variables are numerical or categorical

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What shape does a normal (Gaussian) distribution always have?

A

Bell-shaped and symmetrical around a central mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What does the area under the normal distribution curve always equal?

A

1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What does the 68-95-99.7 Rule (Empirical Rule) describe?

A
  • About 68% falls within 1 SD of the mean
  • About 95% falls within 2 SD of the mean
  • About 99.7% falls within 3 SD of the mean
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the formula for calculating a z-score?

A

(observation – population mean) / standard deviation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the standard normal distribution notation?

A

N(Β΅ = __, Οƒ = __)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How can z-scores be used in relation to different distributions?

A

To compare how unusual two measurements are, even when looking at different normal distributions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What does a z-score represent?

A

The number of standard deviations an observation is above or below the mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

True or False: Z-scores can be used for distributions of any shape.

A

False

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the probability that a randomly sampled US adult is between 180 cm and 185 cm estimated by?

A

The shaded area under the curve

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What statistical function in R is used to calculate the lower tail area based on a z-score?

A

pnorm()

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Fill in the blank: The area under the normal curve always adds up to _______.

A

1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What happens to real-world data distributions compared to the normal distribution?

A

They will never produce a perfect curve

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What does a z-score of more than 3 signify?

A

Very unusual observations

18
Q

What is the mean and standard deviation for SAT scores?

A

Mean = 1500, Standard Deviation = 300

19
Q

What is the mean and standard deviation for ACT scores?

A

Mean = 21, Standard Deviation = 5

20
Q

What is the role of parameters in the normal distribution?

A

Parameters (mean Β΅ and standard deviation Οƒ) describe the normal distribution perfectly

21
Q

What method is used to find the probability of a measurement falling above a particular cutoff value?

A

1 - pnorm(value)

22
Q

What is the probability that a random student scores at least 1190 on her SATs calculated using?

A

1 - pnorm(z-score)

23
Q

What is the mean height of male adults in the US?

A

70.0”

24
Q

What is the standard deviation of heights for male adults in the US?

25
Q

What is the first use of z-scores?

A

To compare distributions by standardizing measurements/observations to z-scores

26
Q

What does the second use of z-scores allow you to calculate?

A

The percentile of a cutoff value

27
Q

What is the third use of z-scores?

A

To calculate the probability that a measurement will fall ABOVE a particular cutoff value

28
Q

What can you calculate using z-scores related to two values?

A

The probability that a measurement will fall BETWEEN two values

29
Q

What does the area below a cutoff value represent?

A

Area to the left

30
Q

How do you calculate the area above a cutoff value?

A

1 - pnorm()

31
Q

How is the area between two values calculated?

A

Area to left of higher value - area to left of lower value

32
Q

What does the 68-95-99.7 Rule describe?

A

Distribution of data in a normal distribution

33
Q

What percentage of data falls within 1 standard deviation of the mean according to the 68-95-99.7 Rule?

34
Q

What percentage of data falls within 2 standard deviations of the mean according to the 68-95-99.7 Rule?

35
Q

What percentage of data falls within 3 standard deviations of the mean according to the 68-95-99.7 Rule?

36
Q

What is considered β€˜unusual’ in terms of standard deviations from the mean?

37
Q

What is considered β€˜very unusual’ in terms of standard deviations from the mean?

38
Q

What are the SAT score ranges for the 68-95-99.7 Rule?

A
  • ~68%: between 1200 and 1800
  • ~95%: between 900 and 2100
  • ~99.7%: between 600 and 2400
39
Q

What function is used in R to find a z-score from an area?

40
Q

Fill in the blank: To find the z-score for a data point that is bigger than 40% of other data points, you would use _______.

A

qnorm(0.4)

41
Q

How can you manually input mean and standard deviation in R?

A

Using qnorm(area, mean=mean_value, sd=sd_value)