WEEK 9: STATISTICS Flashcards
What is biostatistics
is the science of analyzing data and interpreting the results so that they can be applied to solving problems related to biology, health, or related fields
what is univariate analysis
describes one variable in a data set using simple statistics like counts (frequencies), proportions, and averages
what is bivariable analysis
uses rate ratios, odds ratios, and other comparative statistical tests to examine the associations between two variables (mostly exposure and outcome)
what is multivariable analysis
analysis encompasses statistical tests such as multiple regression models that examine the relationships among three or more variables
what is a variable
Any quantity that varies from one entity to another (sometime within an entity over time)
- any attribute, phenomenon or event that can have different values
what are the 2 types of variables
quantitative and qualitative
nominal variables (qualitative)
- no intristic or logical order or value
- ex. university programs
- you can assign numbers to a different categories
- do not have any other numeric properties
ordinal variables (qualitative)
Intrinsic value but with no clear or equal differences between levels (a set of ordered categories)
- ex. mild vs. moderate vs. severe pain
- rating scales
3 ways to display qualitative data (nominal, ordinal)
pie chart, bar chart, frequency tables
numeric variable (quantitative)
- any positive real number, depends on the nature of the variable can be expressed in decimals
- meaningful numeric scales
- age, blood pressure, # of friends, temperature
- assigned numbers have total mathematical meaning
continuous variable
- can take any value within a range
- ex. a persons height. can be 60 inches
- blood pressure, temp.
- plotted as. a line
discrete variable
- can take a finite or limited number of values
- not continious
- a family can not own 10 1/2 cars
- age in year, number of drinks
- can be plotted as dots
quantitative variables: interval vs ratio
interval:
- difference is meaningful
- no natural zero
ratio:
- ratio is meaningful
- zero means absense of attribute (is natural)
Mean
is calculated by adding up all the values for a particular variable and dividing that sum by the total number of individuals with a value for the variable=arithmetic average
median
is the value in the middle when you rank the data in ascending or descending order
- Divides the data into 2 equal parts
Mode
the most frequently occurring value for a particular variable in a data set
histogram
a graph that shows the frequency of numerical data using rectangles
- important to manage the intervals
range
range for a variable is the difference between the minimum (lowest) and the maximum (highest) values in the data set
what are quartiles
mark the three values that divide a data set into four equal parts
what is the interquartile range
captures the middle 50% of values for a numeric variable
standard error of the mean
adjusts for the number of observations in the data set by dividing the variance by the total number of observations and then taking the square root of that number
confidence intervals
Provide information about the expected value of a measure in a source population based on the measured value in a study population
- a larger sample size will yield a narrower confidence interval
what does a 95% confidence interval mean
interval is usually reported for statistical estimates, which means that 5% of the time the confidence interval is expected to miss capturing the true value of a measure in the source population
inferential statistics
Techniques that use statistics from a random sample of a population to make evidence-based assumptions (inference) about the values of parameters in the population as a whole