midterm chapters 1-3 Flashcards
CH 1-3 STATS
2 Types of Evidence
Observational
Evidential
CH 1-3 STATS
Anecdotal evidence
Evidence based on short stories or examples of interesting events
CH 1-3 STATS
Confounding variable
A variable that correlates with both the independent and dependent variables making it look like the two variables are directly related
CH 1-3 STATS
Placebo
An inert substance; a substance that looks like something else but is not composed of the same materials or ingredients
(ex. medicine vs. a sugar pill)
CH 1-3 STATS
Placebo effect
The tendency of people to respond favorably to any treatment, even if it is only inert
(Nocebo effect: the tendency of people to respond negatively to any treatment, even if it is only inert)
CH 1-3 STATS
Blinding
A technique used to keep the treatment assignment secret from the experimental subject
CH 1-3 STATS
Double-blinding
A technique used to keep the treatment assignments from both the experimental subjects and the persons making evaluations of the response
CH 1-3 STATS
Panel bias
Bias attributable to the study having influenced the behavior of the subjects
CH 1-3 STATS
Historical Controls
Controls are previous patients with the same illness who were treated with a different therapy than the one in question
CH 1-3 STATS
Problem with historical controls
Often there is a tendency for later patients to show a better response (even to the same therapy) than the previous patients with the same diagnosis
CH 1-3 STATS
Population
All subjects/animals/specimens/plants, and so on, of interest
CH 1-3 STATS
Sample (size n)
Subset of the population
CH 1-3 STATS
Effect of a biased sample
A biased sample systematically overestimates or systematically underestimates a characteristic of the population
CH 1-3 STATS
Qualifications of a simple random sample
a) Every member of the population has the same chance of being included in the sample
b) The members of the sample are chosen independently of each other
CH 1-3 STATS
Qualifications of a nonsimple random cluster sample
a) IDs are assigned to entire groups of individuals
b) Entire groups of individuals are selected for the sample
CH 1-3 STATS
Qualifications of a nonsimple stratified random sample
a) Population is divided into strata
b) Many random samples are taken-one within each stratum-and combined to comprise the sample
CH 1-3 STATS
Chance error due to sampling (sampling error)
A discrepancy between the sample and the population
CH 1-3 STATS
Sampling bias
Systematic tendency for some individuals of the population to be selected more readily than others
CH 1-3 STATS
Nonsampling error
Error that is not caused by the sampling method; An error that would have arisen even if the researcher had a census of the entire population
CH 1-3 STATS
Nonresponse bias
A bias caused by persons not responding to some of the questions in a survey or not returning a written survey
CH 1-3 STATS
Missing data
Observations that were planned but could not be made
CH 1-3 STATS
Variable
A characteristic of a person or a thing that can be assigned a number or a category
CH 1-3 STATS
Categorical variable
A variable that records which of several categories a person or thing is in
CH 1-3 STATS
Ordinal variable
A categorical variable in which the categories can be arrayed in a meaningful rank order
CH 1-3 STATS
Numeric variable
A variable that records the amount of something
CH 1-3 STATS
Continuous variable
A numeric variable that is measured on a continuous scale (includes decimals, fractions, etc.)
CH 1-3 STATS
Discrete variable
A numeric variable for which we can list the possible variables (integers)
CH 1-3 STATS
Observational unit
The persons or things on which we are measuring variables
CH 1-3 STATS
Frequency distribution
A display of the frequency (number of occurrences) of each value in the data set
CH 1-3 STATS
Bar chart
A simple graphic showing the categories that a categorical variable takes on and the number of observations in each category for the data in the sample
CH 1-3 STATS
Dotplot
A simple graph that can be used to show the distribution of a numeric variable when the sample size is small
CH 1-3 STATS
Histogram
Like a bar chart, except that a histogram displays a numeric variable, which means that there is a natural order and scale for the variable
CH 1-3 STATS
Relative frequency
= Frequency / n
CH 1-3 STATS
Mode
The central peak in a frequency distribution; the value with the most occurences
CH 1-3 STATS
Skewed to the right
The right tail is more stretched out than the left
CH 1-3 STATS
Skewed to the left
The left tail is more stretched out than the right
CH 1-3 STATS
Unimodal
The graph has only one mode
CH 1-3 STATS
Bimodal
The graph has two modes: can indicate the existence of two distinct subgroups of observational units
CH 1-3 STATS
Median ( ỹ)
The value that most nearly lies in the middle of the sample–the value that splits the ordered data into two equal halves
1/2 (n+1) denotes the rank position of the median within an ordered data set
CH 1-3 STATS
Mean ( ȳ)
Sum of the observations divided by the number of observations (average)
CH 1-3 STATS
Robust statistic
A statistic that is relatively unaffected by changes in a small portion of the data (even if the changes are dramatic ones)
CH 1-3 STATS
Boxplot
A visual representation of the five number summary
CH 1-3 STATS
Five number summary
Minimum, Quartile 1, Median, Quartile 3, Maximum
CH 1-3 STATS
First quartile (Q1)
Median of the data values in the lower half of the data set (between the minimum and median)
CH 1-3 STATS
Third quartile (Q3)
Median of the data values in the upper half of the data set (between the median and maximum)
CH 1-3 STATS
Interquartile Range (IQR)
The difference between the first and third quartiles
CH 1-3 STATS
Outlier
A data point that differs so much from the rest of the data that it doesn’t seem to belong with the other data
upper fence = Q1 - 1.5 * IQR
lower fence = Q3 - 1.5 * IQR
CH 1-3 STATS
Modified boxplot
A boxplot in which the outliers are graphed as separate points
CH 1-3 STATS
Univariate summary
A graphical or numeric summary of a single variable
CH 1-3 STATS
Bivariate summary
A graphical summary used to examine the relationship between pairs of variables
CH 1-3 STATS
Categorical-categorical relationship graphs
Bivariate frequency table
Stacked bar chart
Stacked relative frequency bar chart
CH 1-3 STATS
Numeric-categorical relationship graphs
side-by-side boxplots
side-by-side dotplots
CH 1-3 STATS
Numeric-numeric relationship graphs
Scatterplot
CH 1-3 STATS
Scatterplot
A graph that examines the relationships between two numeric variables, X and Y
CH 1-3 STATS
Lowless smooth line
A line on a scatterplot that is helpful in visualizing curved or nonlinear relationships in data
CH 1-3 STATS
Regression line
A line in scatterplots that is helpful in visualizing linear trends in data
CH 1-3 STATS
Range
The difference between the largest (maximum) and smallest (minimum) observations in a sample
CH 1-3 STATS
Standard deviation (s)
- find the deviations for all data points
- square all
- add all together
- divide the sum by n-1
- take the square root
CH 1-3 STATS
Coefficient of variation
The standard deviation expressed as a percentage of the mean
(standard deviation / mean) * 100%
CH 1-3 STATS
The empirical rule
68% within 1 standard deviation of the mean
95% within 2 standard deviations of the mean
> 99% within 3 standard deviations of the mean
CH 1-3 STATS
Population mean (μ)
= population average value of Y
Y=quantitative variable
CH 1-3 STATS
Population Standard Deviation (σ)
=square root (population average value of (Y - μ)^2)
Y=quantitative variable
CH 1-3 STATS
Sample Variance (s^2)
Square of the standard deviation (s)
CH 1-3 STATS
Effect of addition on the transformation of variables
- Linear transformation
- Add constant, c, to the variables
- Has no effect on standard deviation
CH 1-3 STATS
Effect of multiplication on the transformation of variables
- Linear transformation
- Multiply values by constant, c
- Multiply standard deviation by absolute value of c