Lectures 1-4 Flashcards
Frequent Statistics
What is the probability of a wrong decision about the treatment effect?
►What should we conclude from the observed data given a specified null hypothesis?
Bayesian Statistics
What should we believe about the treatment effect given the data that are observed?
Likelihood Inference
What is the evidence about the treatment effect given the data that are observed?
Biostatistics
The science of learning from biomedical data involving appreciable variability or uncertainty
The application of statistical reasoning and methods to the solution of biological, medical, and public health problems
►The scientific use of quantitative information to describe or draw inferences about natural phenomena
►Scientific—accepted theory (ideas) and practice; ethical standards
►Quantitative information—data reflecting variation in populations
►Inference—to conclude or surmise from evidence
14
Generate hypotheses
Ask questions
►Falsifiable
Design and conduct studies to generate evidence
Collect data
Descriptive statistics
Describe the distributions of observations
Statistical inference
Assess strength of evidence in favor of competing hypotheses
►Use data to update beliefs and make decisions
Also known as confirmatory data analysis (CDA)
►Draw conclusions about a population (whole group; true mechanism) from a sample (representative part of a group; “trials”)
►Assess strength of evidence in support of competing hypotheses
►Make comparisons
►Make decisions
►Make predictions
Design of a Study
Ask a precise, testable, and appropriatequestion ►Choose a research approach and design ►Define outcome of interest ►Define comparison groups ►Choose a population to study ►Implementation—collect data
Descriptive Statistics / Exploratory Data Analysis (EDA)
Organization and summarization of data
►Graphical display to visualize important patterns and variation
►Hypothesis generating
Explanations
hypotheses about mechanisms
Variable
a characteristic taking on different values
Simple
scientists prefer simple, rather than complex, explanations
►Occam’s razor
►Principle of parsimony
Interrelationships
associations; causal connections
Variable
a characteristic taking on different values
Random variable
a variable for which the values obtained are usually thought of as arising partly as a result of chance factors
Response variable (𝒀)
the outcome measure; that which may be affected or caused; often a health measure
Explanatory variables (𝑿)—
those that affect or cause the response:
►Treatment (intervention)—explanatory variable that can be controlled by the scientist
►Risk factors—explanatory variables that influence the risk of the outcome; of scientific interest (e.g., smoking, salt intake, environment) and usually cannot be controlled
Quantitative
concept of amount; numerical
Discrete variables
gaps in values; e.g., number of births, number of drinks per week
Continuous variables
no gaps in values; e.g., blood pressure, age, height, time to seroconversion
Special case
time-to-event data in which we need to deal with “censoring”
4
Qualitative
concept of attribute; categorical
Nominal scale
Binary or dichotomous—e.g., disease status (diseased or not diseased), vital status (alive or dead)
●Polychotomous or polytomous—e.g., occupation, marital status
Ordinal or ordered scale
e.g., ratings, preferences
Variation
refers to the differences among a set of measurements
Natural variation
differences among persons (experimental units) in the “true” values of the variable of interest
Measurement variation (or error)
differences between the measured and true values
Bias
difference between the average (expected) value of a measurement (variable) and the true value that it targets
Variance
variation among measurements about their average or mean value, even if that mean differs from the true targeted value
Mean Squared Error
MSE= variance + bias^2