statistical inference Flashcards

1
Q

What is statistical inference?

A

The process of drawing conclusions about a population based on a sample.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

True or False: Statistical inference can only be performed with large sample sizes.

A

False.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Fill in the blank: A _______ is a numerical summary of a sample.

A

statistic

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is a population in statistical terms?

A

The entire group of individuals or instances about whom we hope to learn.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is a sample?

A

A subset of the population selected for analysis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Multiple Choice: Which of the following is a common method for selecting a sample?

A

Random sampling.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the purpose of hypothesis testing?

A

To determine whether there is enough evidence to reject a null hypothesis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

True or False: The null hypothesis typically represents a statement of no effect or no difference.

A

True.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Fill in the blank: The _______ is the probability of making a Type I error.

A

alpha level

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is a Type I error?

A

Rejecting the null hypothesis when it is actually true.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is a Type II error?

A

Failing to reject the null hypothesis when it is false.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Multiple Choice: Which of the following is a common significance level used in hypothesis testing?

A

0.05.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What does p-value represent?

A

The probability of obtaining test results at least as extreme as the observed results, assuming the null hypothesis is true.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

True or False: A smaller p-value indicates stronger evidence against the null hypothesis.

A

True.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is confidence interval?

A

A range of values derived from a sample that is likely to contain the value of an unknown population parameter.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Fill in the blank: A 95% confidence interval means that if we were to take many samples, _______ of those intervals would contain the true population parameter.

A

95%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What is the Central Limit Theorem?

A

The theorem stating that the distribution of sample means approaches a normal distribution as the sample size increases.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

True or False: The Central Limit Theorem applies only to normally distributed populations.

A

False.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What is a point estimate?

A

A single value estimate of a population parameter.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Multiple Choice: Which of the following is NOT a method of estimation?

A

Extrapolation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What is the difference between descriptive statistics and inferential statistics?

A

Descriptive statistics summarize data, while inferential statistics draw conclusions about a population based on sample data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Fill in the blank: A _______ is a graphical representation of the distribution of a dataset.

A

histogram

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

What is the purpose of a t-test?

A

To determine if there is a significant difference between the means of two groups.

24
Q

True or False: The t-test can only be used for normally distributed data.

25
Q

What is ANOVA?

A

Analysis of Variance, a method used to compare means among three or more groups.

26
Q

Multiple Choice: Which of the following is a prerequisite for conducting ANOVA?

A

Independence of observations.

27
Q

What is a chi-square test used for?

A

To determine if there is a significant association between categorical variables.

28
Q

Fill in the blank: The _______ test is used to compare observed frequencies with expected frequencies.

A

chi-square

29
Q

True or False: Correlation implies causation.

30
Q

What does a correlation coefficient of 1 indicate?

A

A perfect positive linear relationship between two variables.

31
Q

What does a correlation coefficient of -1 indicate?

A

A perfect negative linear relationship between two variables.

32
Q

Multiple Choice: What is the range of values for a correlation coefficient?

33
Q

What is regression analysis used for?

A

To model the relationship between a dependent variable and one or more independent variables.

34
Q

Fill in the blank: In regression analysis, the _______ variable is the one being predicted.

35
Q

True or False: In linear regression, the relationship between variables is assumed to be linear.

36
Q

What is multicollinearity?

A

A situation in regression analysis where independent variables are highly correlated.

37
Q

What is the purpose of the residual analysis in regression?

A

To check the validity of the regression model by analyzing the differences between observed and predicted values.

38
Q

Multiple Choice: Which of the following is a common criterion for model selection in regression?

A

Adjusted R-squared.

39
Q

What is the difference between parametric and non-parametric tests?

A

Parametric tests assume a specific distribution, while non-parametric tests do not.

40
Q

Fill in the blank: Non-parametric tests are often used when data does not meet _______ assumptions.

41
Q

What is the role of sample size in statistical inference?

A

Larger sample sizes generally provide more reliable estimates and reduce sampling error.

42
Q

True or False: Increasing the sample size can decrease the margin of error.

43
Q

What is the importance of random sampling?

A

It helps ensure that the sample is representative of the population, reducing bias.

44
Q

Multiple Choice: Which of the following is a common misconception about statistical significance?

A

Statistical significance implies practical significance.

45
Q

What is a power analysis?

A

A method used to determine the sample size required to detect an effect of a given size with a certain degree of confidence.

46
Q

Fill in the blank: The _______ is the probability of correctly rejecting the null hypothesis.

47
Q

What is effect size?

A

A quantitative measure of the magnitude of a phenomenon.

48
Q

True or False: Effect size can help to interpret the practical significance of a result.

49
Q

What does the term ‘sampling distribution’ refer to?

A

The probability distribution of a statistic obtained from a large number of samples drawn from a specific population.

50
Q

Multiple Choice: Which of the following is a key assumption of linear regression?

A

Linearity of relationships.

51
Q

What is the significance of the F-test in ANOVA?

A

It tests whether there are significant differences between group means.

52
Q

Fill in the blank: The _______ is a summary measure that describes the strength and direction of a linear relationship between two variables.

A

correlation coefficient

53
Q

What is a confidence level?

A

The percentage of times that a confidence interval would contain the true population parameter if the same sampling method were repeated.

54
Q

True or False: A 99% confidence interval is wider than a 95% confidence interval.