Session 1 Flashcards
what is a Population
All entities or individuals of interest
what is the point of a population
We often want to learn something about the population
what are Parameters
A value that describes the population
what is the symbol for population mean
Population mean, μ
what is the symbol for population variance
Population variance, σ2
what is Sample
A subset of individuals from the population aka Data that we will examine
what does N refer to
usually refers to sample size
what is an Estimate
A value that describes the sample
what is the symbol for sample mean
Sample mean, X (bar on top)
what is the symbol for sample variance
Sample variance, s2
what are the 2 types of stats
Descriptive Statistics and Inferential Statistics
what is Descriptive Statistics
Summarize/describe properties of the sample (or the population if we gather data from the entire population)
what is inferential stats
Draw conclusions/inferences regarding the properties of the population, but based only on sample data
what is a Variable
A characteristic that varies across observations (people, location, time, etc.)
aka Often a single column in our dataset
wha are Variables also known as
levels of measurements
what are the two types of variables
Quantitative and Categorical
what are types ofRatio
Interval Quantitative variables
Ratio
Interval
what are the types of Categorical variables
Ordinal
Nominal
what is Nominal variable
Classifies objects
aka Are two observations the same or different on some attribute?
Not quantitative, though we can use numbers to index the categories
what kind of variables are being used here: Which restaurant do you prefer? Tim Horton’s Burger King Thai Express
nominal
Gender
Country of Birth Native Language
these are all examples of what kind of variables
nominal
what does Dichotomous (binary) mean
Two categories/levels
how many levels in nominal variables
2 (Dichotomous (binary))
e.g Treatment/control Correct/incorrect
ranks the variables from highest level of measurement to lowest level of measurement
Ratio
Interval
Ordinal
Nominal
what is Ordinal
Rank data
aka Does one observation have more or less of an attribute than a second
observation?
Relative standing of two observations on the attribute Does not say by how much the observations differ
Birth order (1st, 2nd, 3rd) Students’ standing in class relative to others Importance of personal values
these are examples of what kind of variables
ordinal
what is Interval
Rating data (equal distances)
aka Assigned numbers have meaningful units, and these unit sizes remain constant
Temperature (when measured using Celsius or Fahrenheit)
Calendar year
these are examples of what kind of variables
intervals
what is Ratio
Interval, with an absolute 0 point or meaningful origin
aka 0 means lack of the attribute
Comparisons such as “2 times as much” of something or “half as much” make sense
Height (feet, inches, meters), weight, elapsed time, pulse rate, car speed, price ($)
these are examples of what kind of data
ratio
what are they 2 types of variable
Independent Variable (IV) Dependent Variable (DV)
what is Independent Variable (IV)
PredictOR (or covariate)
Factors in an experimental design
what is Dependent Variable (DV)
Outcome/Response
PredictED variables
what are the Types of Research
Correlational vs. Experimental
Between-subjects vs. within-subjects
what is the Correlational IV measured by
the researcher
what is Correlational good for
Good for ecological validity - generalizing research findings to the real world
what is Correlational not good for
Not good for inferring causality; IV and DV may have a relationship due to a third (confounding) variable or common cause
what is done to the IV in Experimental
IV is manipulated by the researcher
what is experimental good for
Good for inferring causality
what is experiment not good for
Manipulating IV in lab settings may sometimes feel detached from the real world
Statistical methods to analyze data may be ________ for correlational and experimental designs
Some data examples used in this course may be correlational
the same/similar
what is Between-subjects design (research)
Each participant in only one experimental condition (e.g., control or treatment)
with Between-subjects design , If random assignment is used what should happen to the groups
groups should be approximately equal on any confounding variables
what is Within-subjects design
Each participant does more than one experimental conditions (e.g., control and treatment)
aka DV measured multiple times
what must researchers be careful about with Within-subjects design
Vulerable to practice effects and fatigue/boredom effects as alternative explanations for differences betwen conditions
what is Counterbalancing
used to help rule out alternative explanations
Counterbalancing is usually thought of as a method for controlling order effects in a repeated measures design (see the notes on variance and experimental design). In a counterbalanced design to control for order effects, we use separate groups of subjects, each group receiving treatments in a different order.
what type of research is Counterbalancing used for
Within-subjects design
In this course, We focus on the relationships between how many IV and DV
one/multiple independent variables (IV) and one dependent variable (DV)
what is consistent about the DV
continuous (usually normally distributed)
what is consistent about the IV
Categorical/continuous
what is the point of Models in statistics
Summarize/describe a large amount of data with just a few numbers
what is the equation for the stat model according to field
Outcomei = (Model) + Errori
explain the parts to the model in statistics according to field
i subscript used to indicate a participant i
Outcome - the DV
Model - systematic part explained by IV1
Error - unsystematic part that is not explained by IV
what are Descriptive Statistics
Describe properties of samples (or populations, if completely known)
what kind of properties are described with descriptive stats
How are the data distributed?
Where is the centre?
How much variability is there in scores? What shape is the distribution?
what is used to describe ‘centre
central tendency
what is in central tendency
Mean Median Mode
what is used to describe variability
variation
what is in variation
Range
Variance
Standard Deviation
what is used to describe shape of distribution
shape
what is in shape
Skewness Kurtosis
what is Mean
the average
The mean is vulernable to what
extreme values (outliers)
what is Median
the value in the middle
is the median vulnerable to outliers
The median is less vulnerable to extreme values (outliers)
what is Mode
The value that occurs most frequently
is the mode impacted by outliers
Not affected by extreme values
what is mode used for
Used for either numerical or categorical data
how many modes might there be
There may be no mode
There may be several modes
Mean is most commonly used when
always unless extreme values (outliers) exist
when is the Median used
often used if extreme outliers are present
when is the Mode used
is often used with categorical (nominal) data
what are Measures of variation
Measures of variation give information on the spread or variability of data values
what are the ways to measure variation
Range
Variance
Standard deviation
what is Range
Simplest measure of dispersion
Difference between the largest and the smallest observations:
what is Variance
(Approximate) average of ‘squared’ deviations of values from the mean
what is the equation for sample variance
s^2 = SS / N-1 = (Xi −X)^2/N-1
explain the parts to the variance equation
X = mean
N = sample size
Xi = ith value of the variable X
SS = Xi − X)2 = Sum of Squares (sum of squared deviations)
what is Standard deviation used for
Most commonly used measure of variation
what does standard variation show
variation about the mean
what units does SD have
Has same units as the original data
how to get the SD
Square root of variance
what does the Shape of a Distribution explain
Also describes how data are distributed Symmetric or skewed
what is skewed
Which way does the “tail” point?
what are the types of skewness
left
symmetric
right
what is another name for Normal Distribution
Gaussian or bell-shaped
In many statistical techniques for experimental designs, the dependent variable is assumed to be what
continous and Normally distributed
if normally distributed, what is true about the mean, median and mode
Mean = Median = Mode
if normally distributed, what is true about the mean and SD
Mean (μ) and Standard deviation (σ) are sufficient to describe a normal distribution
If the data distribution is normal, then the interval is what
μ ± σ contains about 68% about of the values in the population (or sample
what does μ ± 2σ mean
contains about 95% of the values in the population (or sample)
what does μ ± 3σ mean
contains about 99.7% of the values in the population (or sample)