Biostatistics Flashcards
What is Biostatistics defined as?
-the application of statistical theory in medicine, public health, or biology
Population
- large group of all subjects interest in a study.
- very large almost impossible to collect data from whole population
- have to select a subset of subjects to get samples from a population, then generalize findings from samples to the target population
conceptual population
-population of persons people/ identity
population in clinical setting?
-usually talk about it as population of measurements such as weight, height, and blood pressure
Total population
- target population
- all of the subjects of interest in a study, about which the study wants to generalize the conclusion
- ex: all 12 year olds in US
Defined population
- a subpopulation confined by certain characteristic(s) such as demographics and geographical areas in the total population
- ex: 12 year olds in RUSD
Study population
The group of individuals in a study
- In a clinical trial, all participants who followed criteria of inclusions and exclusions make up the study population
- ex 12 year olds in certain school in RUSD
Sample
- a procedure to select samples from the study population, they are representative of the total population
- validity based on how random and well rounded this sample group is
How do we estimate the required sample size?
1) pre-determined power (80/90%)
2) specific significance level
3) mean & variance of the primary outcome; can be approximated
4) the design of the study
what is a statistic?
- calculated from a sample for a specific characteristic of the sample
- will be used to estimate the corresponding parameter of the study population & further generalized to the target population to find mean & standard deviation
Why do we sampling to get samples?
Money, Time, Practicality, and Accuracy
Probability sampling
- each subject has a known probability of being selected
1) simple random sampling
2) stratified random sampling
3) Systematic random sampling
4) Clustered random sampling
simple random sampling?
based on probability to take the samples
-each member of the subset has an equal probability of being chosen
stratified random sampling?
- stratify the study population into subgroups, then take random samples from such subgroups
- ex: names of 25 employees being chosen out of a hat from a company of 250
Systematic random sampling
- type of probability sampling
- members from a larger population are selected according to a random starting point and a fixed, periodic interval
- interval, is calculated by dividing the population size by the desired sample size
Clustered random sampling
- the researcher divides population into separate groups (clusters)
- a random sample of clusters is selected from the pop
- researcher conducts his analysis on data from the sampled clusters
non-probability sampling
- each subject doesnβt know probability of being selected, βstimations are biased
1) voluntary samples
2) convenience samples
voluntary samples
-whoever is self selected into the samples
convenience samples
-whoever is convenient to be selected and/or investigated. -ex: staff members in med school recruited for some trials
Sampling errors
- random errors
- are unavoidable
- the differences between the sample & population, due to sampling randomness
sampling errors effect on the data?
- Random error does not have consistent effects across the entire sample
- the sum would be zero if the sample size is large enough.
what does sampling error add? what does it not affect?
-random error adds variability to the data but doesnβt affect average performance of the samples
Non-sampling errors
- more serious due to mistakes made in the acquisition of data, inappropriate sample selection, or response biases
- can bias estimation