Exam 1: Biostatistics Flashcards
Descriptive Statistics
- Involves
- Purpose
Involves: Collecting, Presenting, and Characterizign Data
Purpose: Describe Data
Inferential Statistics
- Involves
- Purpose
Involves: Estimation and Hypothesis Testing
Purpose: Make decisions about population characteristics
***Allows us to describe a population based on a sample***
Variable
A symbol of an event, act, characteristic, trait, or attribute that can be measured and to which we can assign some values
Categorical Variable
Consists of some numeric or character codes that represent either:
- The presence or absence of something that is of research interest
- The relative weight or rank of the thing that is of research interest
Quantitative Variable
Variable that holds the numerical result of some measurement
Process
Series of actions or operations that transforms inputs to outputs; generates output over time
Characteristics of Variables: Nominal Scale
Simplest level of measurement - categories without order
Characteristics of Variables: Ordinal Scale
Nominal variables with an inherent order among the categories
Characteristics of Variables: Interval Scale
Measruable difference or interval or distance between observations
Characteristics of Variables: Ratio
Same as interval but with an absolute reference point (such as “0”)
Data Presentation: Qualitative Data
Summary Table –> Either a Bar Graph or Pie Chart
Data Presentation: Quantitative Data
Dot Plot, Stem and Leaf Display,
or
Frequency Distribution –> Histogram
Class
One of the categories into which qualitative data can be classified
Class Frequency
Number of observations in the data set falling into a particular class
Class Relative Frequency
Class frequency divided by the total numbers of observations in the data set
Class Percentage
The class relative frequency multipled by 100
Bar Graph
Classes (Bars) have heights equivalent to class frequency, class relative frequency, or class percentage
(Unlike Histogram –> just class frequency and class relative frequency, bars are touching)
Pie Chart
Classes are in slices proportional to the class relative frequency
Central Tendency
Tendency to cluster/center about certain numerical values
Variability
Spread of the data
What symbols represents Sample/Population Mean and Size?
X bar should be lower case x
Which is used for both quantiative and qualitative data, mean, median, or mode?
Which is not effected by extreme values?
Mode
Median and Mode
Summary of Mean, Median, and Mode
Variance and Standard Deviation
- Measures of dispersion ***More reliable than Range***
- Most common measures
- Consider how data are distributed (unlike Range)
- Show variation about mean
What does Normal Distribution mean?
Mean = Median = Mode
What does the mean equal in the standard normal curve and what is the first standard deviation?
Mean = 0
First SD is +/- 1
Standard Notation (Sample vs. Population)
- Mean
- Standard Deviation
- Variance
- Size
When do you use n-1 vs. n in the denominator of the Variance Formula?
n-1 = Sample Variance
n = Population Variance
Shape of Curve: Mean vs. Median
1. Left-Skewed
Left Skewed
Mean < Median
Shape of Curve: Mean vs. Median
- Right-Skewed
Right-Skewed
Mean > Median
The Empirical Rule
- Applies to
- What percentage of the measurements lie within 1, 2, and 3 SDs of the mean? What are their Z-scores?
Applies to: Data sets that are mound-shaped and symmetric (i.e. Normal Distributions)
68% of measurements lie within one SD of the mean (x-s to x+s) z-score = b/w -1 and 1
95% of measurements lie within two SDs of the mean (x-2s to x+2s) z-score = b/w -2 and 2
99.7% of measurements lie within three SDs of the mean (x-3s to x+3s) z-score = b/w -3 and 3
If you scored in the 58th percentile, what percentage of test takers scored lower/higher than you?
Lower: 58%
Higher: 42%
Numerical Measures of Relative Standing: Z-Scores
- Describes…
- Measures…
Describes the relative location of a measurement compared to the rest of the data
Measures the number of standard deviations away from the mean a data value is located
What is the Frequentist definition of Probability?
If an experiment is repeated n times under identical conditions and if the event A occurs m times, then as n grows, the ratio of m/n approaches a fixed limit called the probability of A
P(A) = m/n
“Law of Large Numbers”
Probability Equation
Frequency of times an outcome occurs divided by the total number of possible outcomes (symbolized as p)
Random Event
Any event where the outcomes observed in that event involves uncertainty or the outcome can vary
(predicted by Probability)
When is probability unnecessary to calculate?
For a fixed event
An Event (Two Definitions)
- An occurrence due to nature
- A collection of one or more outcomes of an experiment
Simple vs Compound Probabilities
Simple = Single occurrence
Compound = Result of operations
-Define relationships between or combination of event occurrences