STA441 Flashcards

1
Q

Quantitative variable

A

Representing amount of something

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Categorical

A

Codes represent category membership

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Explanatory Variable

A

Predictor or cause (Contributing factor)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Response Variable

A

Predicted or effect

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Statistic

A

Numbers that can be calculated from sample data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Parameters

A

Numbers that could be calculated if we knew the whole population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Distribution

A

Population histogram

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Conditional distribution

A

For each value x of the explanatory variable X, there is a separate distribution of the response variable Y

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

When are response and explanatory variables unrelated

A

If the conditional distribution of the response variable is identical for each value of the explanatory variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

When are response and explanatory variables related

A

If the distribution of the response variable does depend on the value of the explanatory variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Null hypothesis

A

Explanatory and response variable are unrelated in the population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

P-value

A

The probability of getting our results (or better) just by chance.

i.e. the minimum significance level at which the null hypothesis can be rejected.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Type 1 error

A

Null hypothesis is true, but we reject it

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Type 2 error

A

Null hypothesis is false, but we fail to reject it

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Power

A

Probability of correctly rejecting the null hypothesis.

i.e. 1-P(type 2 error)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Confidence interval

A

Pair of numbers chosen so that the probability they will enclose the parameter is large (e.g. 0.95)

17
Q

What to say if results are not statistically significant

A

The data does not provide enough evidence to conclude that the variables are related

18
Q

Independent observations

A

Simple random sampling.
Cases are not linked.

19
Q

Assumption and usage of Independent T-test

A

Random sampling, independently from 2 normal populations.
Possible different population means.
Same population variance.
Compares 2 means.

20
Q

One or two tailed tests? How to draw directional conclusion?

A

Always two-tailed. Can draw directional conclusion based on estimates of the parameters.

21
Q

Assumption and usage of two-sample t-test

A

If both samples are large, normality and equal variance does not matter much.
Observations are independent.
Random sampling of pairs.
Differences are normally distributed.
Compare difference of 2 explanatory variables.

22
Q

Between cases

A

A case contributes exactly one explanatory and one response variable value.

23
Q

Within cases

A

A case contributes several pairs (explanatory and response), usually one pair for each value of the explanatory variable.

24
Q

ANOVA

A

Extension of independent t-test: More than two values of the explanatory variable.

25
Q

Simple regression and correlation

A

One explanatory variable.
Random sampling of (X,Y) pairs.
Variance all equal.
Response variable is quantitative.
Explanatory variable usually quantitative.
r = 0 indicates no linear relationship (slope of least squares line is 0)
r^2 is proportion of variation explained.

26
Q

Chi-squared test of independence

A

Both variables are categorical.
Large random sample.
The variable consisting of combinations of explanatory variable, response variable, has multinomial distribution.
Lowest expected frequency no more than 5.
Independent observations are important.

27
Q

Confounding variable

A

A variable that contributes to both explanatory and response variable, causing a misleading relationship between them.

28
Q

Observational study

A

Explanatory variable and response variable just observed and recorded

29
Q

Experimental study

A

Cases randomly assigned to values of explanatory variable

30
Q

How to write conclusion to pure observational studies

A

There is enough evidence to suggest that X is related to Y