Basic Statistics & Hypothesis Testing (2024) Flashcards

1
Q

The mode of a dataset is defined as:

A

The most frequently occurring value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Which measure of central tendency is most affected by extreme values?

A

Mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What does the range of a dataset represent?

A

The difference between the highest and lowest values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Which of the following best describes standard deviation?

A

A measure of spread in the data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

In statistics, what does a histogram represent?

A

The frequency of data within intervals

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Which type of data is represented by categories?

A

Categorical

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

In a normal distribution, approximately what percentage of data falls within one standard deviation of the mean?

A

68%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the primary purpose of a scatter plot?

A

To analyze the relationship between two variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Which type of variable represents data that can take on any value within a range, like height?

A

Continuous

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

If the mean of a dataset is 10 and the median is 15, the data is likely (describe the shape of the curve):

A

Skewed to the right

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Which measure of spread is calculated as the difference between the highest and lowest values?

A

Range

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the term for a sample that accurately represents the population from which it was drawn?

A

Random sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What type of graph is best for displaying the frequency of categorical data?

A

Bar chart

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

If two events cannot happen at the same time, they are called:

A

Mutually exclusive

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

The median is defined as:

A

The middle value when data is ordered

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Which of the following data types is continuous?

A

Gender

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What does a box plot display about a dataset?

A

The quartiles and median

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What is an outlier in a dataset?

A

A value much higher or lower than the rest

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Which of the following is a measure of central tendency?

A

Mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What is the sum of probabilities for all possible outcomes in a probability distribution?

A

100%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

The interquartile range (IQR) is the range of values between:

A

The minimum and maximum

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What is a bar graph typically used to display?

A

Continuous data

23
Q

Which type of data has meaningful zero points and can be used in calculations?

24
Q

The probability of a single event is defined as:

A

The ratio of the number of favorable outcomes to the total outcomes

25
Which of the following best describes the null hypothesis? A) The hypothesis that there is a significant effect. B) The hypothesis that the results are due to chance. C) The hypothesis that there is no significant difference. D) The hypothesis that all sample means are equal.
The hypothesis that there is no significant difference.
26
In a hypothesis test, if the p-value is less than the significance level (α), you should: A) Fail to reject the null hypothesis. B) Reject the null hypothesis. C) Increase the sample size. D) Change the significance level.
Reject the null hypothesis.
27
A Type I error occurs when: A) The null hypothesis is rejected when it is true. B) The null hypothesis is not rejected when it is false. C) The alternative hypothesis is accepted when it is false. D) There is insufficient evidence to make a decision.
The null hypothesis is rejected when it is true.
28
Which of the following tests is appropriate for comparing means of two independent samples? A) Paired t-test B) Independent t-test C) Chi-square test D) ANOVA
Independent t-test
29
In hypothesis testing, the p-value represents: A) The probability of obtaining a test statistic at least as extreme as the one observed, assuming the null hypothesis is true. B) The probability that the null hypothesis is true. C) The probability that the alternative hypothesis is true. D) The probability of making a Type II error.
The probability of obtaining a test statistic at least as extreme as the one observed, assuming the null hypothesis is true.
30
When performing a hypothesis test for a population proportion, which distribution is generally used? A) Normal distribution B) Chi-square distribution C) t-distribution D) Exponential distribution
Normal distribution
31
The level of significance in hypothesis testing is typically denoted by: A) β B) α C) p D) μ
α
32
A p-value of 0.03 means: A) There is a 3% chance that the null hypothesis is true. B) There is a 3% chance that the results are due to random variation. C) There is a 97% chance that the null hypothesis is true. D) The probability of observing the sample results, given that the null hypothesis is true, is 3%.
The probability of observing the sample results, given that the null hypothesis is true, is 3%.
33
In an ANOVA test, the null hypothesis is: A) The variances of the populations are equal. B) The population means are equal. C) The sample means are different. D) The population variances are different.
The population means are equal.
34
What does a chi-square test measure? A) The mean difference between groups. B) The association between categorical variables. C) The probability of a continuous variable. D) The difference between paired sample means.
The association between categorical variables.
35
The power of a test is defined as: A) The probability of rejecting the null hypothesis. B) The probability of accepting the null hypothesis. C) The probability of correctly rejecting a false null hypothesis. D) The probability of making a Type I error.
The probability of correctly rejecting a false null hypothesis.
36
A Type II error occurs when: A) The null hypothesis is rejected when it is true. B) The null hypothesis is not rejected when it is false. C) The alternative hypothesis is rejected when it is false. D) There is insufficient evidence to make a decision.
The null hypothesis is rejected when it is true.
37
Which of the following would increase the power of a test? A) Increasing sample size B) Increasing significance level C) Reducing variance D) All of the above
All of the above
38
When should you use a paired t-test? A) Comparing two related groups B) Comparing two independent groups C) Comparing frequencies D) Comparing variances
Comparing two related groups
39
The F-distribution is used in which of the following tests? A) Chi-square test B) t-test C) ANOVA D) Z-test
ANOVA
40
What is the critical value in hypothesis testing? A) The probability of making a Type I error B) The threshold to reject the null hypothesis C) The value that must exceed the test statistic D) The effect size needed to reject the null hypothesis
The threshold to reject the null hypothesis
41
The null hypothesis is rejected when: A) The p-value is greater than α B) The test statistic is within the confidence interval C) The p-value is less than α D) The sample mean is equal to the population mean
The p-value is less than α
42
Which of the following would decrease the likelihood of a Type I error? A) Increasing sample size B) Decreasing α C) Increasing variance D) Using a two-tailed test
Decreasing α
43
What is the alternative hypothesis in a hypothesis test? A) The hypothesis that predicts a difference B) The hypothesis that predicts no difference C) The hypothesis that the sample is biased D) The hypothesis that the population mean is zero
The hypothesis that predicts a difference
44
The Central Limit Theorem states that: A) All sample distributions become normal B) The distribution of sample means will be approximately normal, regardless of the population distribution, as sample size increases C) The sample mean equals the population mean D) Variance decreases with larger samples
The distribution of sample means will be approximately normal, regardless of the population distribution, as sample size increases
45
In a hypothesis test, the test statistic measures: A) The likelihood of a Type I error B) The effect size C) The amount of evidence against the null hypothesis D) The variability of the sample
The amount of evidence against the null hypothesis
46
Which distribution is used in a Z-test? A) Normal distribution B) t-distribution C) Chi-square distribution D) Exponential distribution
Normal distribution
47
In hypothesis testing, if the confidence interval includes zero, this suggests: A) The null hypothesis should be rejected B) There is no significant difference C) The sample mean is larger than the population mean D) The test is invalid
There is no significant difference
48
A two-tailed test is used when: A) The test only evaluates one direction B) The test evaluates two directions C) The test is less powerful D) The test has more Type I errors
The test evaluates two directions
49
What is an effect size in hypothesis testing? A) The probability of making a Type I error B) The size of the effect of interest C) The size of the sample D) The amount of Type II error
The size of the effect of interest
50
If the sample size increases, the p-value will: A) Increase B) Decrease C) Stay the same D) Be zero
Increase
51
The t-test is suitable for: A) Testing the difference between means B) Testing the relationship between variables C) Testing frequencies D) Testing variances
Testing the difference between means
52
When conducting a test for proportions, which condition must be met? A) np ≥ 10 and n(1 − p) ≥ 10 B) np < 5 C) n > 30 D) n(1 − p) < 5
np ≥ 10 and n(1 − p) ≥ 10
53
An increase in sample variance will: A) Increase power B) Increase Type I error C) Decrease power D) Decrease significance level
Decrease power
54
Which of the following tests compares observed frequencies to expected frequencies? A) Paired t-test B) Chi-square test C) Z-test D) Mann-Whitney U test
Chi-square test