Samples and Sampling Flashcards
why are statistics important
- to analyze data and draw conclusions
- quantify uncertainty
- making predictions
- assessing evidence
- sampling populations
what are two goals of statistics
estimation and hypothesis testing
parameters
quantities describing populations being studied
estimates relate to
samples
how are estimates and parameters linked
inferring a parameter is done through the use of estimates
examples of parameters
- averages
- numbers (size of pop)
- variants (spread of data)
- proportions (precent something is true)
how is a null hypothesis used in hypothesis testing
start with a null hypothesis stating/assuming there is no difference or effect regarding the testable quantity of a population and through the tests either support or reject the relationship
how are estimates and hypothesis testing related
your estimate is what is used for the hypothesis testing
what are reliable population estimates dependent on
a good sampling practice
what kind of samples are most desirable for science/ stats
random samples
why are random samples wanted for stats
limits possibility of bias
what is a population
the entire group/individual units being studied that are too large to measure individually
examples of populations
- all cats falling from buildings in a city
- all fish in a lake
- all genes in a genome
what is a sample
selection of the subset of a population used to draw conclusions that ideally apply to the whole population
are samples smaller or larger than the population
smaller
examples of samples
- cats taken to the vet (after falling from buildings)
- random selection of fish in a lake
are sampling errors mistakes
NO - just differences between the estimate and the true value seen in the population
how will estimates differ from population characteristics
by random chance
is sampling error related to precision or accuracy
precision
high vs low sampling error
high
- estimates are more spread out = imprecise = high error
low
- estimates are close together = precise = low error
what defines an unbiased sample
when the average of estimates MATCHES the true population value
what is bias a symptom of
sampling problem
is bias related to precision or accuracy
accuracy
high vs low bias
high
- estimates may be close together but FAR from the true value = inaccurate = biased
low
- estimates may be close or far apart, but are average or even on the true value in the pop = accurate = unbiased