Statistics Review Flashcards

1
Q

Descriptive statistics

A

A set of statistics used to organize and summarize the properties of a set of data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Data matrix

A

A grid presenting collected data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Frequency distribution

A

A table that gives a visual picture of the observations on a particular variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Frequency histogram

A

A data visualization technique showing how many of the cases in a batch of data scored each possible value, or range of values, on the variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Dot plot

A

A data visualization technique in which every data point for a given variable is represented.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Central tendency

A

A measure of what value the individual scores tend to center on.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Mode

A

The value of the most common score; the score that was received by more members of the group than any other.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Bimodal

A

A distribution with more than one mode or score.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Multimodal

A

Having more than two modes or scores.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Median

A

The value at the middlemost score of a distribution of scores; the score that divides a frequency distribution into halves.
Most appropriate measure of central tendency when a set of data has outliers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Mean

A

An arithmetic average; a measure of central tendency computed from the sum of all scores in a set of data, divided by the total number of scores.
Most common measure of central tendency (but not always the most appropriate way to measure central tendency).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Variance

A

A computation that quantifies how spread out the scores of a sample are around their mean; it is the square of the standard deviation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Standard deviation

A

A computation that measures how far, on average, each score is in a data set from the mean.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Box plot

A

A data visualization technique that depicts a sample’s median, interquartile range (25th and 75th percentiles), and outliers. AKA box and whiskers plot.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Outlier

A

A score that stands out as either much higher or much lower than most of the other scores in a sample.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

z score

A

A computation that describes how far an individual score is above or below the mean, in standard deviation units. Also called standardized score.

17
Q

Z score formula

A

z = (X - M)/SD

18
Q

Variance formula (compute sample SD)

A

SD^2 = Σ(X - M)^2/N

19
Q

Standard deviation formula

A

SD = square root of (SD)^2

20
Q

Variance formula (estimate pop SD)

A

SD^2 = Σ(X - M)^2/(N - 1)

21
Q

r formula

A

r = Σ(z[x]z[y])/(N - 1)

22
Q

Cohen’s d

A

A measure of effect size indicating how far apart two group means are, in standard deviation units. Tells us how much overlap there is between the two sets of scores.

23
Q

Cohen’s d formula

A

d = (M1 - M2)/SD[pooled]

24
Q

Inferential statistics

A

A set of techniques that use data from a sample to estimate what is happening in the population.

25
Q

Point estimate

A

A single estimate based on our sample data of the true value in the population. May be a percentage, difference between means, or relationship between two variables.

26
Q

Confidence interval (CI)

A

A given range indicated by a lower and upper value that is designed to capture the population value for some point estimate (e.g., percentage, difference, or correlation); a high proportion of CIs will capture the true population value.

27
Q

Population Estimation

A
  1. Our research question is about the whole population.
  2. The quality of the sample data matters.
  3. The population value is unknown.
  4. Larger samples give more certain estimates.
  5. To get a better estimate, we should do more than one poll.
28
Q

The steps of estimation and precision

A
  1. State a research question using terms such as “how much” or “to what extent”.
  2. Design a study: operationalize the question in terms of variables that are either manipulated or measured.
  3. Collect the data and compute the point estimate and confidence interval.
  4. Interpret the results in the context of your research question.
  5. If possible, conduct the study again and meta-analyze the results.
29
Q

Margin of error of a percent estimate formula

A

SD * sqare root of (1/N) * 1.96

30
Q

Standard error

A

The typical, or average, error researchers make when estimating a population value.

31
Q

The constant associated with 95% confidence

A

1.96. This is a Z score from a normal distribution.

32
Q

Dependent samples design

A

A design in which each person has two scores because they were tested under two conditions, and we are interested in the difference between them. Also called a paired design.

33
Q

Null hypothesis significance testing (NHST)

A

An inferential statistical technique in which a result is compared to a hypothetical solution in which there is no relationship or no difference.

34
Q

p value

A

In NHST, the probability of getting the result in a sample or one more extreme, by chance, if there is no relationship or difference in the population.

35
Q

The NHST procedure

A
  1. Assume that there is no effect (null hypothesis)
  2. Collect data and calculate your result
  3. Calculate the probability of getting a result of that magnitude, or one even more extreme, if the null hypothesis is true
  4. Decide whether to reject or retain the null hypothesis
36
Q

Statistically significant

A

In NHST, the conclusion assigned when p < .05; that is, when it is unlikely the result came from the null-hypothesis population. There is a less than 5% chance that the null hypothesis is true.

37
Q

Not statistically significant

A

In NHST< the conclusion assigned when p > .05; that is, when it is likely the result came from the null-hypothesis population.

38
Q

Alpha level

A

The value, determined in advance, at which researchers decide whether the p value obtained from a sample statistic is low enough to reject the null hypothesis or too high, and thus retain the null hypothesis.