Statistical Measurements Flashcards

1
Q

BELL CURVE

A

34% (68%) - 13.5% (95%) - 2% (99%)

The normal, or bell-shaped, curve distributes scores into six equal parts. Three of these parts are below the mean and three of these parts are above the mean. Sixty-eight percent (34% and 34%) comprise one standard deviation; 95% (13.5% and 13.5%) comprise two standard deviations; and 99% (2% and 2%) comprise three standard deviations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

KUDER-RICHARDSON FORMULA

A

DICHOTOMOUS RELIABILITY (KR-20)

Measures internal consistency reliabilty. Can be used when the test contains dichotomous items, such as true-false questions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

ALPHA ERROR

A

TYPE I ERROR

A Type I error, also known as an alpha error, refers to the researchers’ rejection of the null hypothesis when it is correct. If the significance level is changed, such as .05 to .01, the probability of a Type I error changes as well.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

SPEARMAN-BROWN FORMULA

A

RELIABILITY OF SPLIT TEST

Sometimes it is useful to measure the internal consistency of a study by splitting the test into two halves. This reduces the test’s measured reliability, so researchers may use the Spearman-Brown formula to calculate the reliability of the test if it had not been split in two.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

MULTIVARIATE ANOVA (MANOVA)

A

GROUP DIFFERENCES ON DVs

A statistical procedure for assessing possible group differences on a set of dependent variables. For example, a researcher could conduct a MANOVA to assess whether a group of participants who receive a new educational method differ significantly from another group of participants who are taught with a traditional method on a set of achievement variables, such as quiz scores, homework scores, exam scores, and project scores.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

CONVERGENT VALIDITY

A

CORRELATION

Refers to two separate tests that measure the same attributes that are correlated. Convergent validation refers to times when there is a high correlation between the concept the test is meant to study and other constructs.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

T-TEST

A

COMPARE MEAN SCORES

When there are two groups, and therefore two mean scores, researchers can use the t-test. This test compares the t value from the first calculation to the t value in the second calculation to find whether the mean scores of the two groups are significantly different from each other.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

FACTORIAL ANOVA

A

CATEGORICAL IVs ON QUANTITATIVE DV

A statistical procedure to understand the effect on a dependent variable that is quantitative in nature of two or more independent variables that are categorical in nature. For example, a health researcher may use a factorial analysis of variance to examine the effects of diet (e.g., high vs. low carbohydrates) and exercise (e.g., 3 hours per week vs. 1 hour per week) on weight.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

PEARSON PRODUCT-MOMENT CORRELATION COEFFICIENT

A

PEARSON’S r

An index of the degree of linear relationship between two variables. Devised by Karl Pearson, it is often known as the Pearson product-moment correlation coefficient (Pearson’s r) and is one of the most commonly used sample correlation coefficients.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

PARAMETRIC STATISTICS

A

T-TEST & ANOVA

Statistical procedures that are based on assumptions about the distribution of the attributes in the population being tested (e.g., that there is a normal distribution of values). Parametric statistics, such as the t-test and analysis of variance, can be used when samples are randomly drawn from the population and results are distributed along a normal curve.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

SCHEFFÉ TEST

A

IV DIFFERENCES

A post hoc test used after a researcher obtains a significant F ratio in an analysis of variance that has more than two levels (i.e., more than two conditions of an independent variable that are being examined for differences among their mean values).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

NONPARAMETRIC STATISTICS

A

CHI-SQUARE & MANN-WHITNEY U

Statistical procedures in which the nature of the data being analyzed is such that certain common assumptions about the distribution of the attribute(s) in the population being tested (e.g., normality, homogeneity of variance) are not necessary or applicable. Nonparametric statistics, such as chi-square and the Mann-Whitney U test, are used when data is not normally distributed and variances are inconsistent. Nonparametric statistical measures, which are often used with descriptive data, should be used with nominal data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

COEFFICIENT OF DETERMINATION

A

COMMON VARIANCE (r2)

Obtained by multiplying the value of the correlation coefficient (r) by itself, the coefficient of determination ranges in value from 0 to 1. Low values indicate the outcome is relatively unrelated to the predictor, whereas values closer to 1 indicate that the two variables are highly related. For example, if r = .30, then the squared correlation coefficient is .302 = .09 and interpreted to mean 9% of the variance between the two variables is common or overlapping.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

FACE VALIDITY

A

SURFACE

In which the test looks to be valid

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

TYPES OF DATA MEASUREMENT

A

NOMINAL, ORDINAL, INTERVAL, RATIO

  • NOMINAL - Refers to numbers that represent categories or qualities of the variable, such as race, gender, and age. Nonparametric statistical measures, which are often used with descriptive data, should be used with nominal data.
  • ORDINAL - Pertaining to rank, order, or position in a series
  • INTERVAL - With interval data, the numbers on a scale have the same amount of the variable throughout the scale; for instance, degrees on the Fahrenheit temperature scale. Interval scales provide a constant and consistent unit of measurement.
  • RATIO - Numerical values that indicate magnitude and have a true, meaningful zero point
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

TRUE VARIANCE

A

NATURAL

Naturally occurring variability within or among research participants. This variance is inherent in the nature of individual participants and is not due to measurement error, imprecision of the model used to describe the variable of interest, or other extrinsic factors. It represents the variance of the true scores among the participants taking the measure.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

CONCURRENT VALIDITY

A

SIMULTANEOUS

In which test results are compared with other results around the same time

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

CONSEQUENTIAL VALIDITY

A

SOCIETAL CONSEQUENCES

Refers to the consequences of a study on society. Some researchers believe a test must benefit society in order to be considered valid, though not all researchers agree on this point.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Z-SCORE

A

DISTANCE FROM POPULAR MEAN SCORE

Z-Score, aka Standard Score, describes distance from the population mean of a distribution of scores. It is measured in Standard Deviations. A normal distribution ranges from -3 SDs to +3 SDs.

20
Q

CORELATION COEFFICIENT

A

LINEAR RELATIONSHIP (-1 - +1)

A numerical index reflecting the degree of linear relationship between two variables. It is scaled so that the value of +1 indicates a perfect positive relationship (such that high scores on variable x are associated with high scores on variable y), –1 indicates a perfect negative relationship (such that high scores on variable x are associated with low scores on variable y, or vice versa), and 0 indicates no relationship. The most commonly used type of correlation coefficient is the Pearson

21
Q

F RATIO

A

EXPLAINED VARIANCE / ERROR VARIANCE

In an analysis of variance or a multivariate analysis of variance, the amount of explained variance divided by the amount of the error variance; that is, the ratio of between-groups variance to within-group variance. Its value determines whether to accept the null hypothesis stating that there is no difference between the treatment and control conditions, with a large value indicating the presence of a significant effect. Ideally, a researcher prefers to have rather small variation within each group and maximal variation between the groups in order to demonstrate significant group differences.

22
Q

PREDICTIVE VALIDITY

A

FUTURE

Refers to the degree to which a measure predicts future performance

23
Q

VARIABILITY

A

SD RANGE

When measuring variability, researchers may use standard deviation (SD) to describe the variability within a distribution of scores.

24
Q

SPEARMAN’S RHO

A

CORRELATION COEFFICIENT

Spearman’s rank correlation coefficient, or Spearman’s rho, the first statistical measure for intelligence tests. It determines how well the relationship between two variables can be described.

25
Q

ANALYSIS OF COVARIANCE (ANCOVA)

A

ADJUSTS FOR NON-INVESTIGATED IV INFLUENCE

It is a statistical method of studying the responses of different groups to a dependent variable that adjusts for the influence of a variable that is not being investigated but nonetheless is related to the dependent variable and thus may influence the study results.

For example, suppose a researcher analyzes whether there is a difference in learning among three types of instruction—in-class lecture, online lecture, and textbook only. He or she divides a random selection of adult students into three groups, implements the different instruction types, and administers the same test to all participants to determine how much they learned. If the researcher knows each participant’s educational background, he or she could use an analysis of covariance to adjust the treatment effect (test score) according to educational level, which would reduce the observed variation between the three groups caused by variation in education levels rather than by the instruction itself.

26
Q

CURRICULAR VALIDITY

A

CURRICULUM

Refers to how well a test measures the curriculum being tested and is evaluated by experts.

27
Q

AD HOC TESTS

A

REPS NON-RANDOM SUB-GROUPS

Ad hoc sampling refers to non-probability sampling that is gathered with the goal of obtaining any participant that meets specific characteristics. This allows for more inclusion of individuals who may otherwise be excluded, and is often used when a group within the larger population needs representation. One drawback of this technique is that because the sample is not random, study results will not be representative of the greater population.

28
Q

INTERNAL v EXTERNAL VALIDITY

A

INTEGRITY v APPLICABILITY

While internal validity relates to how well a study is conducted (its structure), external validity relates to how applicable the findings are to the real world.

29
Q

STANDARD DEVIATION

A

MEAN OF ALL DEVIATIONS FROM THE MEAN

Measure of variability. Standard deviation (SD) is a measure of variability and describes the variability within a distribution of scores. It is the mean of all the deviations from the mean, and is a popular measure of the dispersion of scores

30
Q

ERROR VARIANCE

A

MEASUREMENT-INDUCED

The element of variability in a score that is produced by extraneous factors, such as measurement imprecision, and is not attributable to the independent variable or other controlled experimental manipulations.

31
Q

CONTENT VALIDITY

A

DOMAIN-SPECIFIC

in which the test material comes from a certain domain

32
Q

CHI-SQUARE TEST

A

RELATIONSHIP OF CATEGORIES

Used to determine whether there is a relationship between two variables whose values are categories. For example, it may be used to test whether sex (male vs. female) is unrelated to having a household pet (yes vs. no).

33
Q

VARIANCE

A

SD2

A measure of variability. Also of the spread, or dispersion, of scores within a sample or population, whereby a small variance indicates highly similar scores, all close to the sample mean, and a large variance indicates more scores at a greater distance from the mean and possibly spread over a larger range. When measuring variability, researchers may use standard deviation (SD) to describe the variability within a distribution of scores. Variance is the square of the standard deviation and is used when conducting statistical analyses.

34
Q

EX POST FACTO RESEARCH DESIGN

A

AFTER-THE-FACT VARIABLES

Also known as a causal-comparative design, is a non-experimental quantitative design that examines variables after the fact (ex post facto). The researcher can then draw several conclusions about why these relationships occur. The analysis of variance and the t-test are frequently used in ex post facto designs.

35
Q

COEFFICIENT OF NONDETERMINATION

A

UNEXPLAINED VARIANCE

The Coefficient of Non-Determination explains the amount of unexplained, or unaccounted for, variance between two variables, or between a set of variables (predictors) in an outcome variable.

36
Q

CONSTRUCT VALIDITY

A

DO YOUR JOB!

Which refers to the degree to which a test actually measures what it is meant to measure

37
Q

DISCRIMINANT VALIDITY

A

NON-CORRELATION

Refers to two separate tests that don’t correlate

38
Q

ONE-WAY ANOVA

A

MULTI-LEVEL IVs ON SINGLE DV

An analysis of variance that evaluates the influence of different levels or conditions of a single independent variable upon a dependent variable. The mean values of two or more samples are examined in order to determine the probability that they have been drawn from the same population.

39
Q

BETA ERROR

A

TYPE II ERROR

Type II error, also known as beta error, refers to researchers’ failure to reject the null hypothesis when there is a difference between groups. Researchers can change the significance level to change the probability of Type I and Type II errors occurring. If the significance level goes down, Type I error decreases, though Type II error increases.

40
Q

BIVARIATE TABULAR ANALYSIS

A

IV PREDICTS DV

Bivariate tabular analysis can be used when the value of an independent variable can predict the value of a dependent variable. When this occurs, researchers typically plot scores along an XY plot graph to illustrate the relationship between variables.

41
Q

POST HOC TESTS

A

AFTER ANOVA

Researchers might apply a test after the analysis of variance is calculated (post hoc) if it is unclear as to which mean scores are significantly different from each other. Post hoc tests that may be able to clarify this problem include Scheffe’s method, Tukey’s HSD, Newman-Keuls, and Duncan’s new multiple range test.

42
Q

IV v DV

A

MANIPULATED v UNCHANGING

The independent variable (aka stimulus variable) is the one that is manipulated in order to observe the effects on the dependent variable, which is unchanging. The independent variable is sometimes called the stimulus variable, predictor variable, or experimental variable. Other terms for the dependent variable include the response variable, the outcome variable, and the criterion variable.

43
Q

EXPLAINED VARIANCE

A

PREDICTED - ACTUAL DATA

The discrepancy between a model and actual data.

44
Q

INCLUSIVE RANGE

A

RANGE + 1

45
Q

STANDARD ERROR OF MEASUREMENT (SEM)

A

RELIABILITY RANGE

CONFIDENCE LIMIT

The standard error of measurement (SEM) is a measure of reliability and is sometimes referred to as the confidence limit. The SEM helps researchers know that a person’s test score likely falls within a certain range of scores. The SEM is determined by assessment developers and can be found on the test’s profile. If a certain assessment has an SEM of 4.0, you first add and subtract 4 to the individual score to find the range. Therefore, because the score in this question is 82, there is a 2 out of 3 chance that the individual’s score falls between 78 and 86.

46
Q

MULTIPLE REGRESSION

A

MULTIPLE IVs ON A DV

Multiple regression is able to add together the predictive power of many independent variables. It is often used to predict a single outcome variable (DV) from a set of predictor variables (IVs).

47
Q

DUNCAN’S MULTIPLE RANGE TEST

A

DUNCAN’S MULTIPLE RANGE TEST

Duncan’s MRT. A multiple comparisons test used to follow up on an analysis of variance that yields significant results. After obtaining a significant F ratio, a researcher could conclude that at least one pair of samples is significantly different on the dependent variable and then conduct a Duncan multiple range test on pairs of samples to identify that specific pair. The overall Type I error rate would be protected, which would not be the case if separate t tests were conducted on all possible pairs.