Midterm revision 3 Flashcards
What is data analysis?
Non inferential description of an empirical distribution
What is an empirical distribution?
A set of scores on a variable or set of variables, where one score is for one variable or set of variables
Which has scientific priority - data analysis or inferential stats?
data analysis
What are quantitative representations?
Succinct and accurate descriptions of an empirical distribution’s: location, dispersion and shape
ARE NOT: references to normality, hypothesis speculations, claims regarding population
What is location?
In a metaphorical sense, it is where the empirical distribution sits on the X axis.
Includes the mean, median and mode
What is the median?
The point at/below which 50% of the scores lie
What is the mode?
The number of peaks
poor information
What is dispersion?
how spread out the scores are
Includes: variance, SD, and range
What is variance?
Average squared distance that each score is from the mean.
Natural dispersion counterpart of the mean
Has to be non-negative, not bounded above, bounded below
What is standard deviation?
Square root of variance
What is shape?
Shape of the empirical distribution
Includes skewness and kurtosis
What is skewness?
Captures degree of symmetry (NOT normality)
Symmetrical: 0
Positively skewed: hill falls down to positive end
Negatively skewed: hill falls down to negative end
What is kurtosis?
Captures the degree of peakedness or flatness
What are transformations?
Involve taking a function of a variable
What are the reasons for transforming?
- To re-express distribution for taste, preference, and convenience (e.g., transforming to a proportion for comparison across different scales)
- To bring a distribution into alignment with a theoretical distribution (involves nonlinear transformations that change the shape of a distribution, e.g., normality)
What are the outcomes of linear transformations?
Linear transformations do not change the shape, but change the location and dispersion in predictable ways
What happens to the mean when a constant is added/deducted to/from it?
+/- by the same
What happens to the mean when a constant is multiplied/divided to all scores?
x/div by the same
What happens to the SD when a constant is added/deducted to/from it?
It stays the same
What happens to the SD when a constant is x/div to all scores?
x/div by the same
What is the outcome of a nonlinear transformation?
It will alter the shape in unpredictable ways
What is a sampling distribution?
A frequency distribution of a statistic (repeated with an infinite number of samples)
*theoretical
What is standard error?
Standard deviation of a sampling distribution
*theoretical
What are confidence intervals?
Upper and lower bounds for the population. Shows where the data from the actual population is likely to fall.
e.g., 95% confidence interval = 95% all samples will contain that value; 95% chance that a sample from the population will have that obtained value.
What are the 10 steps in hypothesis testing?
- Data analysis
- Specify hypothesis pair
- Specify population distribution
- Deduce assumptions
- Formulate if/then link
- Type 1 Error control
- Type 2 Error control
- Decision Rule
- Employ a procedure to make a decision about hypothesis pair
- If H0 rejected, estimate effect size
What is the purpose of inferential testing?
To render a decision as to which of the two (H0 or H1) is in fact that state of nature at the moment the procedure is employed. i.e., to make a correct binary decision regarding the extant state of nature.
What are parametric approaches?
Begin with modeling the distribution of X in P by a theoretical density function
Makes an assertion regarding the parent population distribution
Makes assumptions regarding the parent population distribution
What are the advantages of parametric approaches?
you know what is going on regarding the distribution, errors, powers, etc.