Basic Statistics & Hypothesis Testing (2024) Flashcards by Andrea Leong

The mode of a dataset is defined as:

The most frequently occurring value

How well did you know this?

Not at all

Perfectly

Which measure of central tendency is most affected by extreme values?

Mean

How well did you know this?

Not at all

Perfectly

What does the range of a dataset represent?

The difference between the highest and lowest values

How well did you know this?

Not at all

Perfectly

Which of the following best describes standard deviation?

A measure of spread in the data

How well did you know this?

Not at all

Perfectly

In statistics, what does a histogram represent?

The frequency of data within intervals

How well did you know this?

Not at all

Perfectly

Which type of data is represented by categories?

Categorical

How well did you know this?

Not at all

Perfectly

In a normal distribution, approximately what percentage of data falls within one standard deviation of the mean?

68%

How well did you know this?

Not at all

Perfectly

What is the primary purpose of a scatter plot?

To analyze the relationship between two variables

How well did you know this?

Not at all

Perfectly

Which type of variable represents data that can take on any value within a range, like height?

Continuous

How well did you know this?

Not at all

Perfectly

If the mean of a dataset is 10 and the median is 15, the data is likely (describe the shape of the curve):

Skewed to the right

How well did you know this?

Not at all

Perfectly

Which measure of spread is calculated as the difference between the highest and lowest values?

Range

How well did you know this?

Not at all

Perfectly

What is the term for a sample that accurately represents the population from which it was drawn?

Random sample

How well did you know this?

Not at all

Perfectly

What type of graph is best for displaying the frequency of categorical data?

Bar chart

How well did you know this?

Not at all

Perfectly

If two events cannot happen at the same time, they are called:

Mutually exclusive

How well did you know this?

Not at all

Perfectly

The median is defined as:

The middle value when data is ordered

How well did you know this?

Not at all

Perfectly

Which of the following data types is continuous?

Gender

How well did you know this?

Not at all

Perfectly

What does a box plot display about a dataset?

The quartiles and median

How well did you know this?

Not at all

Perfectly

What is an outlier in a dataset?

A value much higher or lower than the rest

How well did you know this?

Not at all

Perfectly

Which of the following is a measure of central tendency?

Mean

How well did you know this?

Not at all

Perfectly

What is the sum of probabilities for all possible outcomes in a probability distribution?

100%

How well did you know this?

Not at all

Perfectly

The interquartile range (IQR) is the range of values between:

The minimum and maximum

How well did you know this?

Not at all

Perfectly

What is a bar graph typically used to display?

Continuous data

Which type of data has meaningful zero points and can be used in calculations?

Ratio

The probability of a single event is defined as:

The ratio of the number of favorable outcomes to the total outcomes

Which of the following best describes the null hypothesis? A) The hypothesis that there is a significant effect. B) The hypothesis that the results are due to chance. C) The hypothesis that there is no significant difference. D) The hypothesis that all sample means are equal.

The hypothesis that there is no significant difference.

In a hypothesis test, if the p-value is less than the significance level (α), you should: A) Fail to reject the null hypothesis. B) Reject the null hypothesis. C) Increase the sample size. D) Change the significance level.

Reject the null hypothesis.

A Type I error occurs when: A) The null hypothesis is rejected when it is true. B) The null hypothesis is not rejected when it is false. C) The alternative hypothesis is accepted when it is false. D) There is insufficient evidence to make a decision.

The null hypothesis is rejected when it is true.

Which of the following tests is appropriate for comparing means of two independent samples? A) Paired t-test B) Independent t-test C) Chi-square test D) ANOVA

Independent t-test

In hypothesis testing, the p-value represents: A) The probability of obtaining a test statistic at least as extreme as the one observed, assuming the null hypothesis is true. B) The probability that the null hypothesis is true. C) The probability that the alternative hypothesis is true. D) The probability of making a Type II error.

The probability of obtaining a test statistic at least as extreme as the one observed, assuming the null hypothesis is true.

When performing a hypothesis test for a population proportion, which distribution is generally used? A) Normal distribution B) Chi-square distribution C) t-distribution D) Exponential distribution

Normal distribution

The level of significance in hypothesis testing is typically denoted by: A) β B) α C) p D) μ

A p-value of 0.03 means: A) There is a 3% chance that the null hypothesis is true. B) There is a 3% chance that the results are due to random variation. C) There is a 97% chance that the null hypothesis is true. D) The probability of observing the sample results, given that the null hypothesis is true, is 3%.

The probability of observing the sample results, given that the null hypothesis is true, is 3%.

In an ANOVA test, the null hypothesis is: A) The variances of the populations are equal. B) The population means are equal. C) The sample means are different. D) The population variances are different.

The population means are equal.

What does a chi-square test measure? A) The mean difference between groups. B) The association between categorical variables. C) The probability of a continuous variable. D) The difference between paired sample means.

The association between categorical variables.

The power of a test is defined as: A) The probability of rejecting the null hypothesis. B) The probability of accepting the null hypothesis. C) The probability of correctly rejecting a false null hypothesis. D) The probability of making a Type I error.

The probability of correctly rejecting a false null hypothesis.

A Type II error occurs when: A) The null hypothesis is rejected when it is true. B) The null hypothesis is not rejected when it is false. C) The alternative hypothesis is rejected when it is false. D) There is insufficient evidence to make a decision.

The null hypothesis is rejected when it is true.

Which of the following would increase the power of a test? A) Increasing sample size B) Increasing significance level C) Reducing variance D) All of the above

All of the above

When should you use a paired t-test? A) Comparing two related groups B) Comparing two independent groups C) Comparing frequencies D) Comparing variances

Comparing two related groups

The F-distribution is used in which of the following tests? A) Chi-square test B) t-test C) ANOVA D) Z-test

ANOVA

What is the critical value in hypothesis testing? A) The probability of making a Type I error B) The threshold to reject the null hypothesis C) The value that must exceed the test statistic D) The effect size needed to reject the null hypothesis

The threshold to reject the null hypothesis

The null hypothesis is rejected when: A) The p-value is greater than α B) The test statistic is within the confidence interval C) The p-value is less than α D) The sample mean is equal to the population mean

The p-value is less than α

Which of the following would decrease the likelihood of a Type I error? A) Increasing sample size B) Decreasing α C) Increasing variance D) Using a two-tailed test

Decreasing α

What is the alternative hypothesis in a hypothesis test? A) The hypothesis that predicts a difference B) The hypothesis that predicts no difference C) The hypothesis that the sample is biased D) The hypothesis that the population mean is zero

The hypothesis that predicts a difference

The Central Limit Theorem states that: A) All sample distributions become normal B) The distribution of sample means will be approximately normal, regardless of the population distribution, as sample size increases C) The sample mean equals the population mean D) Variance decreases with larger samples

The distribution of sample means will be approximately normal, regardless of the population distribution, as sample size increases

In a hypothesis test, the test statistic measures: A) The likelihood of a Type I error B) The effect size C) The amount of evidence against the null hypothesis D) The variability of the sample

The amount of evidence against the null hypothesis

Which distribution is used in a Z-test? A) Normal distribution B) t-distribution C) Chi-square distribution D) Exponential distribution

Normal distribution

In hypothesis testing, if the confidence interval includes zero, this suggests: A) The null hypothesis should be rejected B) There is no significant difference C) The sample mean is larger than the population mean D) The test is invalid

There is no significant difference

A two-tailed test is used when: A) The test only evaluates one direction B) The test evaluates two directions C) The test is less powerful D) The test has more Type I errors

The test evaluates two directions

What is an effect size in hypothesis testing? A) The probability of making a Type I error B) The size of the effect of interest C) The size of the sample D) The amount of Type II error

The size of the effect of interest

If the sample size increases, the p-value will: A) Increase B) Decrease C) Stay the same D) Be zero

Increase

The t-test is suitable for: A) Testing the difference between means B) Testing the relationship between variables C) Testing frequencies D) Testing variances

Testing the difference between means

When conducting a test for proportions, which condition must be met? A) np ≥ 10 and n(1 − p) ≥ 10 B) np < 5 C) n > 30 D) n(1 − p) < 5

np ≥ 10 and n(1 − p) ≥ 10

An increase in sample variance will: A) Increase power B) Increase Type I error C) Decrease power D) Decrease significance level

Decrease power

Which of the following tests compares observed frequencies to expected frequencies? A) Paired t-test B) Chi-square test C) Z-test D) Mann-Whitney U test

Chi-square test