Week 9: Descriptive and Comparative Statistics Flashcards
Research Process
Descriptive Statistics
Biostatistics
is the science of analyzing data and interpreting the results so that they can be applied to solving problems related to BIOLOGY, HEALTH, or related fields
Univariate analysis
describes ONE variable in a data set using simple statistics like counts (frequencies), proportions, and averages
Bivaraible analysis
uses rate ratios, odds ratios, and other comparative statistical tests to examine the associations between two variables (mostly exposure and outcome)
Multivariable analysis
encompasses statistical tests such as multiple regression models that examine the relationships among three or more variables
Advanced statistics should be used only when…
they are appropriate for the study question and the analyst knows how to use and interpret them correctly
Link to Study Design
What is a Variable?
-Any quantity that varies from one entity to another (sometime within an entity over time)
- Any attribute, phenomenon, or event that can have different values
ØTo describes characteristic of a person, place, thing, or idea
We measure variables when an experiment is carried out or an observation is made (week 5-2)
What is a Variable?
-Any quantity that varies from one entity to another (sometime within an entity over time)
- Any attribute, phenomenon, or event that can have different values
ØTo describes characteristic of a person, place, thing, or idea
We measure variables when an experiment is carried out or an observation is made (week 5-2)
What is The Big Picture?
Types of Variables
Quant.dis.con
Qual.no.or
(Qualitative)
Types of Variables
Nominal
ØNo intrinsic or logical order or value
-University programs
-Countries
ØTypes of fruits
ØYou can assign numbers to different categories
Ø1=apple
Ø2=peach
Ø3=pair
ØBut they do not have any other numeric properties
Ø
ØDoing arithmetic (e.g., 1 + 2 = 3) is nonsensical
(Qualitative)
Types of Variables
Ordinal
-Intrinsic value but with no clear or equal differences between levels (a set of ordered categories)
-Primary vs. secondary vs. university education
-Mild vs. moderate vs. severe pain
-Rating scales (assigning numbers)
-Legitimate to say: 1 ≠ 2; 5 > 4 > 3 > 2 > 1
-But in terms of the attribute being measured, we cannot say
Ø (4-3) = (3-2) = (2-1)
Ø 4 is not two times larger than 2
How can you display qualitative(nominal, ordinal) data ?
Who in the absolute fuck is Florence Nightingale
the lady with the pie chart
How else can we display qualitative (nominal, ordinal) data?
-Frequency tables
Classification of Quantitative Variables
What is the difference between Continuous and Discrete
Quantitative Variables: Interval vs Ratio
What is the difference?
Numeric Variables; Measures of Central Tendency:
Mean
A sample mean is calculated by adding up all the values for a particular variable and dividing that sum by the total number of individuals with a value for the variable=arithmetic average
*Find the mean for the set of measurements:
*2, 9, 11, 6, 6, 26
*Solution: x̅=(2+9+11+6+6+26)/6=10
Median
The median is the value in the middle when you rank the data in ascending or descending order
Divides the data into 2 equal parts
Find the median for the set of measurements: 9, 5, 11, 6, 6, 26
Solution:
1.Rank the measurements from smallest to largest: 5, 6, 6, 9, 11, 26
2.Find the middle observation(s)
Choose a value between the two middle observations
Median = (6+9) = 7.5
2
Mode
The most frequently occurring value for a particular variable in a data set
–Find the mode for the set of measurements:
2 9 11 6 6 26
Displaying Distributions: H
Histograms
-important to manage the intervals
Shape of the histogram: Normal Distribution