Statistics for People Who (Think They) Hate Statistics Flashcards

Question

**Degrees of freedom**

Answer 1

A value, which is different for different statistical tests, that approximates the sample size of number of individual cells in an experimental design.

Answer 2

The outcome variable or the predicted variable in a regression equation. See ***Criterion***

Answer 3

Values that organize and describe the characteristics of a collection of data, sometimes called a *data set.*

Answer 4

A positive correlation where the values of both variables change in the same direction. See ***Positive correlation***

Answer 5

A research hypothesis that points a difference between groups in one direction. See ***Nondirectional research hypothesis***

Answer 6

A measure of the magnitude of difference between two groups, usually calculated as Cohen's *d*.

Answer 7

The difference betwen the observed score (*Y*) and the predicted score. See ***Standard error of estimate***

Answer 8

The part of a test score that is random and contributes to the unreliability of a test.

Answer 9

1,152,921,504,606,846,976,976 bytes of data - lots and lots of data, and the amount of data in the world grew just as you read this. Wow.

Answer 10

An analysis of variance with more than one factor or independent variable.

Answer 11

A research design used to explore more than one treatment variable.

Answer 12

A method for illustrating how often scores occur in groups called class intervals.

Answer 13

A graphical representation of a frequency distribution that uses a continuous line to show the number of values that fall within a *class interval*.

Answer 14

A chi-squqare test on one dimension, which examines whether the distribution of frequencies is different from what one would expect by chance.

Answer 15

A graphical representation of a frequency distribution that uses bars of different heights to show the number of values that fall within each *class interval*.

Answer 16

An if-then statement of conjecture that relates to variables to one another and is used to reflect the general problem statement or question that is the motivation for asking a research question.

Answer 17

The treatment variable that is manipulated or the predictor variable in a regression equation. See ***Predictor***

Answer 18

A negative correlation where the values of variables move in opposite directions. See ***Negative correlation***

Answer 19

Tools that are used to infer characteristics of a population based on data from a sample of that population.

Answer 20

The outcome where the effect of one factor is differentiated across another factor.

Answer 21

A type of reliability that examines whether items on a test measure only one-dimension, construct, or area of interest.

Answer 22

A type of reliability that examines whether observers are consistent with one another.

Answer 23

A level of measurement that places a variable's values into catagories that are equidistant from each other, as when points are evenly spaced along a scale.

Answer 24

The quality of a distribution that defines how flat or peaked it is.

Answer 25

The quality of a normal curve that is relativeley peaked compared with a normal distribution.

Answer 26

The regression line that best fits the observed scores and minimizes the error in prediction.

Answer 27

A correlation that is best expressed visually as a straight line.

Answer 28

In analysis of variance, when a factor or an independent variable has a significant effect upon the outcome variable.

Answer 29

A type of average calculated by summing values and dividing that sum by the number of values. Also known as ***Arithmetic mean***.

Answer 30

The average deviation for all scores from the mean of a distribution, calculated as the sum of the absolute value of the scores' deviations from the mean divided by the number of scores.

Answer 31

The mean, the median, and the mode.

Answer 32

The midpoint in a set of values, such as that 50% of the cases in a distrbution fall below the median and 50% fall above it.

Answer 33

The central point in a class interval.

Answer 34

The most frequently occurring score in a distribution.

Answer 35

A statistical technique whereby several variables are used to predict one.

Answer 36

A negative correlation where the values of variables move in opposite directions. See ***Indirect correlation***

Answer 37

The most gross level of measurement by which a variable's value can be placed in one and only one catagory.

Answer 38

A research hypothesis that posts a difference between groups but not in either direction. See ***Directional research hypothesis***

Answer 39

Distribution-free statistics that do not require the same assumptions as do parametric statistics. See ***Parametric statistics***

Answer 40

A distribution of scores that is symmetrical about the mean, the median, and the mode and has asymptotic tails. Often called the ***Bell-shaped curve.***

Answer 41

A statement of equality between sets of variables. See ***Research hypothesis***

Answer 42

The score that is recorded or observed. See ***True score***

Answer 43

The value that results from the application of a statistical test. See ***Test statistic value***

Answer 44

A visual representation of a cumulative frequency distribution.

Answer 45

Used to compare a sample mean to a population mean.

Answer 46

A directional test, reflecting a directional hypothesis.

Answer 47

A test for the difference between two or more means. A simple analysis for variance (or ANOVA) has only one independent variable, whereas a factorial analysis of variance tests the means of more than one independent variable. One-way analysis of variance looks for differences between the means of more than two groups. See ***Analysis of variance***

Answer 48

A level of measurement that places a variable's value into a catagory and assigns that category an order with respect to other categories.

Answer 49

Those scores in a distribution that are noticeably much more extreme than the majority of scores. Whether a score is an outlier or not is usually an arbitrary decision made by the researcher.

Answer 50

A type of reliability that examines consistency across different forms of the same test.

Answer 51

Statistics used for the inference from a sample to a population that assume the variances of each group are similar and that the sample in large enough to represent the population. See ***Nonparametric statistics***

Answer 52

A numerical index that reflects the relationship between two variables with the removal of the influence of a third variable (called a mediating or confounding variable).

Answer 53

A numerical index that reflects the relationship between two variables, specifically how the value of one variable changes when the value of the other variable changes. See ***Correlation coefficient***

Answer 54

The percentage of cases equal to and below a particular score in a distribution or set of scores.

Answer 55

A tool in statistical software, such as SPSS or Excel, that allows the user to easily manipulate the rows, columns, and frequencies included in cross-tabulation tables.

Answer 56

THe quaility of a normal curve that is relatively flat compared with a normal distribution.

Answer 57

All the possible subjects or cases of interest. See ***Sample***

Answer 58

A positive correlation where the values of both variables change in the same direction. See ***Direct correlation***

Answer 59

After the fact, referring to tests done to determine the true source of a difference among three or more groups.

Answer 60

The treatment variable that is manipulated or the predictor variable in a regression equation. (See ***Independent variable***)

Answer 61

The positive difference between the highest and lowest score in a distribution. It is a gross measure of variability.. Exclusive range is the highest score minus the lowest score. Inclusive range is the highest score minus the lowest score plus 1.

Answer 62

A level of measurement defined as having an absolute zero.

Answer 63

The equation that defines the points and the line that are closest to the observed scores.

Answer 64

The line drawn based on values in a regression equation. Also known as a ***trend line***.

Answer 65

The consistency of a test.

Answer 66

A statement of inequality between two variables. See ***Null hypothesis***

Answer 67

A subset of a population. See ***Population***

Answer 68

The difference between sample and population values.

Answer 69

Different ways of categorizing measurement outcomes: nominal, ordinal, interval, and ratio.

Answer 70

A plot of paired data points on an x-axis and y-axis, used to visually represent a correlation.

Answer 71

The risk set by the researcher for rejecting a null hypothesis when it is true. See ***Statistical significance***

Answer 72

A test for the difference between two or more means. A simple analysis for variance (or ANOVA) has only one independent variable, whereas a factorial analysis of variance tests the means of more than one independent variable. One-way analysis of variance looks for differences between the means of more than two groups. See ***Analysis of variance*** and ***One-way analysis of variance***

Answer 73

The quality of a distribution that defines the disproportionate frequency of certain scores. A longer right tail than left corresponds to a smaller number of occurrences at the high end of the distribution; this isk ***positively sewed distribution***. A shorter right tail than left corresponds to a larger number of occurrences at the hight end of the distribution; this is a ***negatively skewed distribution***.

Answer 74

An analysis of variance summary table that lists sources of variance.

Answer 75

The average amount of variability in a set of scores or the scores average deviation from the mean.

Answer 76

A measure of accuracy in prediction that reflects variability about the regression line. See ***Error in prediction***

Answer 77

A raw score that is adjusted for the mean and standard deviation of the distribution from which the raw score comes. (See ***z score***)

Answer 78

The risk set by the researcher for rejecting a null hypothesis when it is true. (See ***Significance level***)

Answer 79

A set of tools and techniques used to describe, organize, and interpret information or data.

Answer 80

A chi-square test of two dimensions or more that examines whether the distribution of frequencies on a variable is independent of other variables.

Answer 81

A type of reliability that examines a test's consistency over time.

Answer 82

The value that results from the application of a statistical test. (See ***Obtained value***)

Answer 83

The line drawn based on values in a regression equation. Also known as a ***regression line***.

Answer 84

The score that, if it could be observed, would reflect the actual ability or behavior being measured. Also known as ***Observed score***.

Answer 85

A nondirectional test, reflecting a nondirectional hypothesis.

Answer 86

The probability of rejecting a null hypothesis when it is true.

Answer 87

The probability of accepting a null hypothesis when it is false.

Answer 88

A conservative estimate of a population parameter.

Answer 89

How well a test measures what it says it does.

Answer 90

How much scores differ from one another or, put another way, the amount of spead or dispersion in a set of scores.

Answer 91

The square of the standard deviation and another measure of a distribution's spread or dispersion.

Answer 92

The predicted Y value in a regression equation.

Answer 93

A raw score that is adjusted for the mean and standard deviation of the distribution from which the raw score comes. See ***Standard score***