Module 1 & 2 Flashcards
Introduction to Statistics and Experimental Design
What is a sample?
A subset of individuals from a population of interest
What is a population?
Set of all subjects relevant to the scientific hypothesis under examination
What is a statistic?
A value calculated from a sample, used to estimate population parameters
What is a parameter?
A true measurement that describes a population
What is a statistical hypothesis?
A claim regarding a population parameter
What is sampling error?
The deviation from estimates and a true population parameter, purely based on chance
What are the characteristics of a good sample?
1) It is a random sample
2) It is precise
3) It is unbiased
What does precision refer to?
The spread of values for an estimate due to sampling error
What is the relationship between sample size and precision?
Higher sample size = higher precision and lower sampling error
What does bias refer to?
Systematic discrepancy between estimates from multiple samples and the true population parameter
What are 2 types of non-random samples?
1) Sample of convenience
2) Volunteer sample
What are 2 types of studies?
1) Experimental, where treatments are assigned
2) Observational, where treatments are not assigned by the researcher
What can be a problem for observation studies?
Confounding variables - variables that influence the outcome, which is not accounted for
What are the 2 types of variables
1) Qualitative/Categorical (membership)
2) Quantitative (magnitude)
What are the measurement scales for qualitative data?
Nominal and ordinal
What are the 2 types of quantitative data?
Continuous and discrete
What are the measurement scales for quantitative data?
Ratio and interval
What is descriptive statistics?
Quantities that describe the population
What are the components of descriptive statistics?
1) Shape
2) Spread
3) Location
4) Frequency distribution
What is frequency distribution?
Describes the number of times a particular value of a variable occurs in a sample (can be absolute or relative)
What are 2 ways we can depict frequency distributions?
Bar graphs or histograms
When is it best to use a histogram?
When we are looking at the frequency distribution of a numerical data set
When is it best to use a bar graph?
When we are looking at categorical data sets
What are the 3 types of distribution?
1) Frequency distribution
2) Probability distribution
3) Sampling distribution