Statistical Measurements Flashcards
BELL CURVE
34% (68%) - 13.5% (95%) - 2% (99%)
The normal, or bell-shaped, curve distributes scores into six equal parts. Three of these parts are below the mean and three of these parts are above the mean. Sixty-eight percent (34% and 34%) comprise one standard deviation; 95% (13.5% and 13.5%) comprise two standard deviations; and 99% (2% and 2%) comprise three standard deviations.
KUDER-RICHARDSON FORMULA
DICHOTOMOUS RELIABILITY (KR-20)
Measures internal consistency reliabilty. Can be used when the test contains dichotomous items, such as true-false questions
ALPHA ERROR
TYPE I ERROR
A Type I error, also known as an alpha error, refers to the researchers’ rejection of the null hypothesis when it is correct. If the significance level is changed, such as .05 to .01, the probability of a Type I error changes as well.
SPEARMAN-BROWN FORMULA
RELIABILITY OF SPLIT TEST
Sometimes it is useful to measure the internal consistency of a study by splitting the test into two halves. This reduces the test’s measured reliability, so researchers may use the Spearman-Brown formula to calculate the reliability of the test if it had not been split in two.
MULTIVARIATE ANOVA (MANOVA)
GROUP DIFFERENCES ON DVs
A statistical procedure for assessing possible group differences on a set of dependent variables. For example, a researcher could conduct a MANOVA to assess whether a group of participants who receive a new educational method differ significantly from another group of participants who are taught with a traditional method on a set of achievement variables, such as quiz scores, homework scores, exam scores, and project scores.
CONVERGENT VALIDITY
CORRELATION
Refers to two separate tests that measure the same attributes that are correlated. Convergent validation refers to times when there is a high correlation between the concept the test is meant to study and other constructs.
T-TEST
COMPARE MEAN SCORES
When there are two groups, and therefore two mean scores, researchers can use the t-test. This test compares the t value from the first calculation to the t value in the second calculation to find whether the mean scores of the two groups are significantly different from each other.
FACTORIAL ANOVA
CATEGORICAL IVs ON QUANTITATIVE DV
A statistical procedure to understand the effect on a dependent variable that is quantitative in nature of two or more independent variables that are categorical in nature. For example, a health researcher may use a factorial analysis of variance to examine the effects of diet (e.g., high vs. low carbohydrates) and exercise (e.g., 3 hours per week vs. 1 hour per week) on weight.
PEARSON PRODUCT-MOMENT CORRELATION COEFFICIENT
PEARSON’S r
An index of the degree of linear relationship between two variables. Devised by Karl Pearson, it is often known as the Pearson product-moment correlation coefficient (Pearson’s r) and is one of the most commonly used sample correlation coefficients.
PARAMETRIC STATISTICS
T-TEST & ANOVA
Statistical procedures that are based on assumptions about the distribution of the attributes in the population being tested (e.g., that there is a normal distribution of values). Parametric statistics, such as the t-test and analysis of variance, can be used when samples are randomly drawn from the population and results are distributed along a normal curve.
SCHEFFÉ TEST
IV DIFFERENCES
A post hoc test used after a researcher obtains a significant F ratio in an analysis of variance that has more than two levels (i.e., more than two conditions of an independent variable that are being examined for differences among their mean values).
NONPARAMETRIC STATISTICS
CHI-SQUARE & MANN-WHITNEY U
Statistical procedures in which the nature of the data being analyzed is such that certain common assumptions about the distribution of the attribute(s) in the population being tested (e.g., normality, homogeneity of variance) are not necessary or applicable. Nonparametric statistics, such as chi-square and the Mann-Whitney U test, are used when data is not normally distributed and variances are inconsistent. Nonparametric statistical measures, which are often used with descriptive data, should be used with nominal data.
COEFFICIENT OF DETERMINATION
COMMON VARIANCE (r2)
Obtained by multiplying the value of the correlation coefficient (r) by itself, the coefficient of determination ranges in value from 0 to 1. Low values indicate the outcome is relatively unrelated to the predictor, whereas values closer to 1 indicate that the two variables are highly related. For example, if r = .30, then the squared correlation coefficient is .302 = .09 and interpreted to mean 9% of the variance between the two variables is common or overlapping.
FACE VALIDITY
SURFACE
In which the test looks to be valid
TYPES OF DATA MEASUREMENT
NOMINAL, ORDINAL, INTERVAL, RATIO
- NOMINAL - Refers to numbers that represent categories or qualities of the variable, such as race, gender, and age. Nonparametric statistical measures, which are often used with descriptive data, should be used with nominal data.
- ORDINAL - Pertaining to rank, order, or position in a series
- INTERVAL - With interval data, the numbers on a scale have the same amount of the variable throughout the scale; for instance, degrees on the Fahrenheit temperature scale. Interval scales provide a constant and consistent unit of measurement.
- RATIO - Numerical values that indicate magnitude and have a true, meaningful zero point
TRUE VARIANCE
NATURAL
Naturally occurring variability within or among research participants. This variance is inherent in the nature of individual participants and is not due to measurement error, imprecision of the model used to describe the variable of interest, or other extrinsic factors. It represents the variance of the true scores among the participants taking the measure.
CONCURRENT VALIDITY
SIMULTANEOUS
In which test results are compared with other results around the same time
CONSEQUENTIAL VALIDITY
SOCIETAL CONSEQUENCES
Refers to the consequences of a study on society. Some researchers believe a test must benefit society in order to be considered valid, though not all researchers agree on this point.