Week 6 - Introduction to statistics Flashcards

You may prefer our related Brainscape-certified flashcards:
1
Q

statistics

A

tools for describing and measuring the properties of populations from samples.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

goal of statistics

A

To estimate the values of important parameters, including: means, proportions, variances, effects
To test hypotheses about those parameters - are estimates more extreme than we expect simply by chance? If yes, results are significant

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

null hypothesis

A

a specific statement about a population parameter made for the purposes of argument. A good null hypothesis is a statement that would be interesting to reject.
Tested
Rejected in support of the alternative hypothesis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

alternative hypothesis

A

includes all other feasible values for the population parameter besides the value stated in the null hypothesis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

hypothesis testing steps

A

State the hypotheses
Compute the test statistic with the data
Determine the p-value
Draw appropriate conclusions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

stating the hypothesis

A

Alternative hypothesis is two-sided (includes parameter values on both sides of the parameter value specified by the null hypothesis)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

test statistic

A

A number calculated from the data that is used to evaluate how compatible the data are with the result expected under the null hypothesis
Evaluates how well the observed result matches the result claimed by the null hypothesis
Some mismatch is expected by chance so is our result too extreme for chance to explain?
To decide we need a suitable test statistic
Then we need its p-value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

test statistic = signal/noise

A

Signal - value of interest
Noise - uncertainty of value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

the null distribution

A

The sampling distribution of outcomes for a test statistic under the assumption that the null hypothesis is true
Distribution of test statistic when the null is true
Shape varies for different test statistics and sample size but:
All possible values is on x-axis
The densities of values are on the y axis
Densities sum to 1, giving probabilities

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

quantifying uncertainty: the p value

A

The probability of obtaining the data (or data showing as great or greater difference from the null hypothesis) if the null hypothesis were true
The refers to the probability of a specific event when sampling data under the null hypothesis: it is the probability of obtaining a result as extreme as or more extreme than the observed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

draw the appropriate conclusion

A

The significance level is a probability used as a criterion for rejecting the null hypothesis. If the p value is less than or equal to a then the null hypothesis is rejected. If the p-value is greater than a the null hypothesis is not rejected
The rejection threshold is called the significance level (a)
Usually a = 0.05

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

reporting the results

A

Summary of statistical test:
The value of the test statistic
The sample size
The p-value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

type I error

A

rejecting the true null hypothesis. The significance level sets the probability of committing a Type I error. False positive

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

type II error

A

failing to reject a false null hypothesis. False negative

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

the power

A

of a test is the probability that a random sample will lead to rejection of a false null hypothesis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

one sided test

A

the alternative hypothesis includes parameter values only one side of the value specified by the null hypothesis. Rejected only if the data depart from it in the direction stated by p-value.

17
Q

paired design

A

both treatments are applied to every sampled unit.
Similar conditions in each block - can control for some noise
Usually more powerful

18
Q

two sample design

A

each treatment group is composed of an independent, random sample of units.
Every plot is independent

19
Q

paired comparisons of means

A

Paired measurements are converted to a single measurement by taking the difference between them

20
Q

paired comparisons of means assumptions

A

The sampling units are randomly sampled from the population
The paired differences have a normal distribution in the population

21
Q

pooled sample variance

A

the average of the variances of the samples weighted by their degrees of freedom.

22
Q

two sided and two sample assumptions

A

Each of the two samples is a random sample from its population
The numerical value is normally distributed in each population
The standard deviation of the numerical variable is the same in both populations

23
Q

welch’s

A

compares the means of two groups and can be used even when the standard deviations of the two groups are not equal.

24
Q

contingency analysis

A

estimates and tests for an association between two or more categorical variables.

25
Q

Chi squared test H0 and Ha

A

H0: variables are independent

Ha: variables are not independent

26
Q

Chi squared test

A

If two events are independent, then, the probability of both occurring is equal to the probability of one event occurring times the probability of the other event occurring
Use degrees of freedom
Same assumptions as goodness-of-fit tests: samples are independent and no more than 20% of cells have an expected frequency of less than 5 and no cells have an expected frequency of less than 1