Exam 1 Flashcards
Quantitative characteristics
- N = population size
- correlation doesn’t equal causation, correlation is an association
Quantitative strengths
- good at finding correlation
- yields many responses (more representative)
- easier to chart
- gives a general outlook on a social situation
Quantitative weaknesses
- not good at finding causation
Qualitative characteristics
- n = sample size
Qualitative strengths
- easier to establish causation
- in-depth
Qualitative weaknessess
- generalization more difficult to establish
- not applicable to the general population
Population
- total set of subjects of interest in a study
Sample
- subset of the population on which the study collects data
Parameter
- numerical summary of the population
- the value you are trying to uncover, cannot often do it precisely
- we do not always have access to the entire population
Statistic
- numerical summary of the sample data
- to get a better sense of what the perimeter value might be
Descriptive statistics
- statistics summarizing (outlining) sample or population data
Inferential statistics
- statistics making predictions about population parameters based on sample data
Qualitative variable
- variable that is placed on a measurement scale that has numerical values
Quantitative variable
- variable that is placed on a measurement scale that has a set of categories
Discrete variable
- variable taking the form of a set of separate numbers, such as 0, 1, 2, 3
Continuous variable
- variable that can take an infinite continuum of real number values
Measurement scales: interval
- quantitative, scale with specific numerical distances between levels
- Nominal: qualitative, scale with categories that are in no specific order
- Ordinal: qualitative, scale with categories that are in a specific order
Sampling methods
- Random sampling: drawing a sample of n subjects who each have the same probability of being drawn, ability to better make inferences / draw conclusions about the population
Sampling errors
- Sample bias: sample is not representative
- Response bias: under and over reporting
- Non-response bias: large sample, few participants
Distribution shapes
- normal (bell-shaped)
- U-shaped
- skewed to the right
- skewed to the left
Frequency
- relative frequency: proportion of observations falling into a category
Frequency distributions
- bar graph (typically categorical data)
- pie chart (typically categorical data)
- histogram (typically quantitative data)
Central tendency: mean
- an average
- sum of the observations divided by total number of the observations
Central tendency: median
- observation that falls in the middle of the ordered sample
- what is the most typical observation you can come across
- mean and median are usually close in a normal distribution
Central tendency: mode
- value that occurs most frequently in the distribution
- unimodal: for one mode
- bimodal: for two modes
- multimodal: for more than two modes
Variability: range
- difference between the largest and smallest observations
Variability: deviation
- difference between an observation and the mean (i.e. how far an observation ‘falls’ from the mean of the population or the sample)
Variability: standard deviation
- typical (average) deviation from the mean for an observation in the set
- will always have a positive value, but can go either way
Variance
- approximate average of squared deviations in a distribution
Percentile
- measure of data dispersion breaking down the distribution in percentage points.
- an observation’s percentile indicates the percentage of observations that are of equal or lesser value in the distribution.
- conversely, an observation’s percentile also allows us to calculate the percentage of observations that fall above it in the distribution.
- impossible to have a 100 percentile (implies that 100% of the population has the same value or be below your value)
Quartile
- measure of data dispersion breaking down the distribution in four ordered segments.
- when ordered in ascendance, the first 25% of the data distribution comprise the lower quartile, whereas the first 75% of the distribution comprise the upper quartile
Quartile order
- Min: 0% of the data
- Q1: First 25% of the data
- Med: First 50% of the data
- Q3: First 75% of the data
- Max: Fully 100% of the data
Empirical Rule
- ~ 68% of observations fall between the mean and one standard deviation on either side
- about 2/3 will fall on both side of the middle
~ 95% of observations fall between the mean and two standard deviations on either side - two standard deviations (20-7-7 — 13, 6)(95% will be greater than 6 and lower than 13)
- over 99% of observations fall between the mean and three standard deviations on either side.
- data will be cluster around the middle
- rare to find observations that fall off of the distribution
Z-score
- the number of standard deviations that any given observation in a distribution falls away from the mean of that distribution
- tells you the right tail probability associated with that z-score, can also use it to find the LTP
- RTP is the probability of encountering another observation that is further away removed from the mean than the observation in question
Sampling distribution of a statistic
- probability distribution that specifies probabilities for the possible values the statistic can take
- every sample a mean; can draw a distribution form the mean
Sampling distribution of sample means
- the probabilities of specific values the mean of a sample would take if we repeatedly drew random samples from the population
- the sample distribution of y-bars (distribution of the means of the samples that we have collected)
- whereas a ‘regular’ probability distribution has a standard deviation (σ), a sampling distribution of sample means has a standard error (σȳ).
- same concept of a standard deviation
Central Limit Theorem
- for random sampling with a large sample size n, the sampling distribution of the sample mean ȳ is approximately a normal distribution (n=30 is sufficient)
Measurement scales: nominal
- qualitative, scale with categories that are in no specific order
Measurement scales: ordinal
- qualitative, scale with categories that are in a specific order