Final Exam Flashcards
null hypothesis (H0)
statement that is the skeptical viewpoint of your research question
- no difference
4 steps of a hypothesis tests
- define null and alternative hypothesis
- establish null distribution
- conduct statistical test
- draw scientific conculsions
null distribution
sampling distribution we expect from sampling a statistical population when the null hypothesis is trues
alternative hypothesis (HA)
statement that is the positive viewpoint viewpoint of your research question
- everything not in null (mutually exclusive)
- there is a difference
3 factors to the hypotheses
- mutually exclusive
- they describe all possible outcomes = exhaustive
- null always includes the equality statement
non-directional hypothesis
state that there should be a difference in alternative hypothesis
directional hypothesis
state that the difference should be in a specific direction (smaller vs. larger)
statistical inference
conclusion that a set of data are unlikely to come from the null hypothesis
statistical decision
whether we believe our data came from the null distribution or not
- if its likely data came from null distribution = “fail to reject”
- if it is unlikely data came from null distribution = “reject null”
2 probabilities for null distribution
- type 1 error rate
- p-value
type 1 error rate (alpha)
probability of rejecting the null hypothesis when it is true
- set by researcher without any inference to data
p-value (p)
probability of seeing your data, or something more extreme, under the null hypothesis
- are under curve from data to more extreme values
rules of making statistical decision
- if p-value is less than type 1 error rate, then we “reject null hypothesis”
- if p-value is greater than or equal to type 1 error rate, then we “fail to reject null hypothesis”
what the scientific conclusions consider
- strength of inference: how strong evidence is
- effect size: only consider it when we reject null hypothesis (small = low impact)
error rates
probability of making a mistakes
- type I and II have an inverse relationship (when one increases the other decreases)
type II error rates
probability of failing to reject null hypothesis when it is false
- area under alternative distribution from data point to something more extreme
types of t-tests
- single-sample t-tests
- paired-sample t-tests
- two-sample t-tests
single-sample t-tests
evaluate whether mean of your sample is different from some reference value
ex. is mean test score from a sample of high school students different than national standards
paired-sample t-tests
evaluate whether mean of paired data is different from some reference value
- looks at changes in a SU
ex. does tutoring improve grade for a student
two-sample t-tests
evaluate whether mean of two groups are difference from each other (compare two groups)
ex. do dogs sleep more than cats
mean
= m
reference value
= mew - u
- it is given
the reporting of a single-sample t-test should include…
- sample mean and standard deviation
- observed t-score
- degrees of freedom
- p-value
observed t-score
calculated using sample mean, standard deviation, size and reference value
reference value for paired t-tests
typically 0
null and alternative hypotheses for paired t-tests
the statements about how difference between the paired measurements is related to reference value
scientific conclusions for paired t-tests
- if we reject null = the sample data provide strong evidence that the difference between the paired measurements is different from reference value
- if we fail to reject null = the sample data do not provide strong evidence that the difference between paired measurements is different from reference value
the reporting of a paired t-test should include…
- mean difference between paired measurements, and standard deviation of the differences
- observed t-score
- degrees of freedom
- p-value
sample means for two-sample t-test
m1=sample mean of first group
m2=sample mean of second group
- can change
scientific conclusions for two-sample t-tests
- if we reject the null = the sample data provide strong evidence that the means of the two groups are different
- if we fail to reject the null = the sample data do not provide strong evidence that the means of the two groups are different
the reporting of a two-samples t-test should include…
- mean, satndard deviation, and sample size for each group
- observed t-score
- degrees of freedom
- p-value
expected contingency table
expected frequencies under null hypothesis
1-way contingency table
one categorical variable
- is there a difference in counts among the levels of that variable?
- all counts are distributed equally
key features of 1-way contingency table
- ECT is always given as counts
- sum of all expected counts must be same as sum of all counts in observed contingency table
- ECT has fractional values
2-way expected contingency table
two categorical variables
- looking for an interaction between the variables
- counts are distributed independently among cells
ex. is age independent of year
calculating 1 way
calculate marginal distributions as proportions
- row and column sums/table total
calculating 2 way
product of row and column proportions for each cell x table total
chi-squared distributions
measure of the distance between the observed and expected contingency tables
steps to calculating chi-squared distributions
- take difference between each observed and expected cell
- square the difference
- divide by the expected value
- sum over all cells in the table
chi-squared distribution
when you sample an imaginary statistical population where the null hypothesis is true, you would get the distribution of chi-squared scores
key features of chi-squared distribution
- area under curve sums to one
- degrees of freedom determines shape of distribution - different for 1 and 2 way tables
- only positive values
chi-squared test
used with only categorical data and contains the variation we expect from sampling error rate
- always directional
statistical decision of chi-squared test
- reject the null if observed score is greater than critical score or if p-value is less than type 1 error rate (a)
- fail to reject the null is observed score is less than or equal to critical score or if p-value is greater than or equal to a
what side do p-value and type 1 error always go on in chi-squared tests
the right side
scientific conclusions for 1-way tables
- reject null and conclude theres evidence to support that the counts are not equal among cels
- fail to reject null conclude that there is not evidence to support that the counts are not equal among cells
scientific conclusions for 2-way tables
- reject null and conclude there is evidence to support that the variables are not independent of each other
- fail to reject null and conclude there is no evidence to support that the variables are not independent of each other
the reporting of a chi-squared test should include…
- short name of the test (X2)
- degrees of freedom
- total count in the observed table yes
- observed chi-squared value
- p-value
factors of a correlation test
- no implied causation between variables (one variable doesn’t cause another)
- both variables are assumed to have variation
- is not used for prediction
pearsons correlation coefficient
measures the strength of association between two numerical variables
r=measuring from sample
p=population parameter - about stats pop
correlation coefficients
p=-1 indicates a perfect negative correlation
p=0 indicates no association
p=1 indicates a perfect positive correlation
assumptions behind a correlation tests
- each pair of numerical values is measured on same sampling unit
- numerical values come from continuous numerical distributions with non-0 variation
- association = straight lines
null and alternative hypothesis for correlation tests
H0: correlation coefficient is 0
HA: correlation coefficient is not 0
directional = if there is a positive or negative association
null distribution for correlation tests
sampling distribution of correlation coefficients from a statistical population with no association between variables (p=0)
statistical decision for correlation tests
same as chi-squared tests