statistical inference Flashcards

Question 1

Q

What is statistical inference?

Answer

A

The process of drawing conclusions about a population based on a sample.

Question 2

Q

True or False: Statistical inference can only be performed with large sample sizes.

Question 3

Q

Fill in the blank: A _______ is a numerical summary of a sample.

Answer

A

statistic

Question 4

Q

What is a population in statistical terms?

Answer

A

The entire group of individuals or instances about whom we hope to learn.

Question 5

Q

What is a sample?

Answer

A

A subset of the population selected for analysis.

Question 6

Q

Multiple Choice: Which of the following is a common method for selecting a sample?

Answer

A

Random sampling.

Question 7

Q

What is the purpose of hypothesis testing?

Answer

A

To determine whether there is enough evidence to reject a null hypothesis.

Question 8

Q

True or False: The null hypothesis typically represents a statement of no effect or no difference.

Question 9

Q

Fill in the blank: The _______ is the probability of making a Type I error.

Answer

A

alpha level

Question 10

Q

What is a Type I error?

Answer

A

Rejecting the null hypothesis when it is actually true.

Question 11

Q

What is a Type II error?

Answer

A

Failing to reject the null hypothesis when it is false.

Question 12

Q

Multiple Choice: Which of the following is a common significance level used in hypothesis testing?

Question 13

Q

What does p-value represent?

Answer

A

The probability of obtaining test results at least as extreme as the observed results, assuming the null hypothesis is true.

Question 14

Q

True or False: A smaller p-value indicates stronger evidence against the null hypothesis.

Question 15

Q

What is confidence interval?

Answer

A

A range of values derived from a sample that is likely to contain the value of an unknown population parameter.

Question 16

Q

Fill in the blank: A 95% confidence interval means that if we were to take many samples, _______ of those intervals would contain the true population parameter.

Question 17

Q

What is the Central Limit Theorem?

Answer

A

The theorem stating that the distribution of sample means approaches a normal distribution as the sample size increases.

Question 18

Q

True or False: The Central Limit Theorem applies only to normally distributed populations.

Question 19

Q

What is a point estimate?

Answer

A

A single value estimate of a population parameter.

Question 20

Q

Multiple Choice: Which of the following is NOT a method of estimation?

Answer

A

Extrapolation.

Question 21

Q

What is the difference between descriptive statistics and inferential statistics?

Answer

A

Descriptive statistics summarize data, while inferential statistics draw conclusions about a population based on sample data.

Question 22

Q

Fill in the blank: A _______ is a graphical representation of the distribution of a dataset.

Answer

A

histogram

Question 23

Q

What is the purpose of a t-test?

Answer

A

To determine if there is a significant difference between the means of two groups.

Question 24

Q

True or False: The t-test can only be used for normally distributed data.

Question 25

Q

What is ANOVA?

Answer

A

Analysis of Variance, a method used to compare means among three or more groups.

Question 26

Q

Multiple Choice: Which of the following is a prerequisite for conducting ANOVA?

Answer

A

Independence of observations.

Question 27

Q

What is a chi-square test used for?

Answer

A

To determine if there is a significant association between categorical variables.

Question 28

Q

Fill in the blank: The _______ test is used to compare observed frequencies with expected frequencies.

Answer

A

chi-square

Question 29

Q

True or False: Correlation implies causation.

Question 30

Q

What does a correlation coefficient of 1 indicate?

Answer

A

A perfect positive linear relationship between two variables.

Question 31

Q

What does a correlation coefficient of -1 indicate?

Answer

A

A perfect negative linear relationship between two variables.

Question 32

Q

Multiple Choice: What is the range of values for a correlation coefficient?

Question 33

Q

What is regression analysis used for?

Answer

A

To model the relationship between a dependent variable and one or more independent variables.

Question 34

Q

Fill in the blank: In regression analysis, the _______ variable is the one being predicted.

Answer

A

dependent

Question 35

Q

True or False: In linear regression, the relationship between variables is assumed to be linear.

Question 36

Q

What is multicollinearity?

Answer

A

A situation in regression analysis where independent variables are highly correlated.

Question 37

Q

What is the purpose of the residual analysis in regression?

Answer

A

To check the validity of the regression model by analyzing the differences between observed and predicted values.

Question 38

Q

Multiple Choice: Which of the following is a common criterion for model selection in regression?

Answer

A

Adjusted R-squared.

Question 39

Q

What is the difference between parametric and non-parametric tests?

Answer

A

Parametric tests assume a specific distribution, while non-parametric tests do not.

Question 40

Q

Fill in the blank: Non-parametric tests are often used when data does not meet _______ assumptions.

Answer

A

normality

Question 41

Q

What is the role of sample size in statistical inference?

Answer

A

Larger sample sizes generally provide more reliable estimates and reduce sampling error.

Question 42

Q

True or False: Increasing the sample size can decrease the margin of error.

Question 43

Q

What is the importance of random sampling?

Answer

A

It helps ensure that the sample is representative of the population, reducing bias.

Question 44

Q

Multiple Choice: Which of the following is a common misconception about statistical significance?

Answer

A

Statistical significance implies practical significance.

Question 45

Q

What is a power analysis?

Answer

A

A method used to determine the sample size required to detect an effect of a given size with a certain degree of confidence.

Question 46

Q

Fill in the blank: The _______ is the probability of correctly rejecting the null hypothesis.

Question 47

Q

What is effect size?

Answer

A

A quantitative measure of the magnitude of a phenomenon.

Question 48

Q

True or False: Effect size can help to interpret the practical significance of a result.

Question 49

Q

What does the term ‘sampling distribution’ refer to?

Answer

A

The probability distribution of a statistic obtained from a large number of samples drawn from a specific population.

Question 50

Q

Multiple Choice: Which of the following is a key assumption of linear regression?

Answer

A

Linearity of relationships.

Question 51

Q

What is the significance of the F-test in ANOVA?

Answer

A

It tests whether there are significant differences between group means.

Question 52

Q

Fill in the blank: The _______ is a summary measure that describes the strength and direction of a linear relationship between two variables.

Answer

A

correlation coefficient

Question 53

Q

What is a confidence level?

Answer

A

The percentage of times that a confidence interval would contain the true population parameter if the same sampling method were repeated.

Question 54

Q

True or False: A 99% confidence interval is wider than a 95% confidence interval.

Question 55

Q

Question 56

Q