Test #1 Flashcards

0
Q

Variable

A

Any characteristic of a case

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
1
Q

Cases

A

The subjects of a data set (objects or people)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Histogram

connected

A

Compares the values of different items, uses quantitative variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q
Bar chart
(not connected)
A

Compares the values of different items, uses categorical variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Ways to describe a bar chart/histogram

A
  1. Shape
  2. Center
  3. Spread
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Outlier

A

Any value that falls outside the overall pattern, can affect mean and standard deviation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Mode

A

The most common value in a data set, the major peaks of a bar chart/histogram

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Symmetric

A

Distribution creates a mirror image

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Skewed

A

Distribution is concentrated to the left or right

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Shape

A

Symmetric vs. skewed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Center

A

Mean vs. median

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Spread

A

Standard deviation vs. IQR

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Categorical variables

A

Data is words, places cases into categories

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Quantitative variables

A

Data is numbers, measures the values of each case

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Median

A

The middle value or midpoint of a distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Mean

A

The average value of a distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Best ways to describe a distribution

A

Measure of center and measure of spread

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Q1

A

The median of the data which fall to the left of the overall median

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Q3

A

Median of the data which falls to the right of the overall median

19
Q

Five-number summary

A
Min
Q1
Median
Q3
Max
20
Q

Boxplot

A

A graph of the five-number summary

21
Q

IQR

A

Q3 - Q1 (the distance between the quartiles)

22
Q

Standard deviation formula

A

S = square root of: 1 / number of cases - 1 E (x1-mean)squared

23
Q

Standard deviation

A

How much distance there is from the mean, greater than 0.

24
Q

Normal distribution

A

Bell curve, symmetric, unimodal

N(mean, standard deviation)

25
Q

Unimodal

A

A distribution that contains one single peak

26
Q

68-95-99.7 rule

A

68% of observations fall within 1 standard deviation of the mean
95% fall within 2 SD of the mean
99.7% fall within 3 SD of the mean

27
Q

Z-score

A

Standardized value of x

28
Q

Z-score formula

A

z = x - mean / standard deviation

29
Q

Proportion

A

Decimals

30
Q

Response variable (y- axis)

A

Dependent variable, measures outcome

31
Q

Explanatory variable (x-axis)

A

Independent variable, explains or causes the change in the response variable

32
Q

Scatterplot

A

Shows the relationship between 2 quantitative variables measured on the same individuals

33
Q

Ways to describe a scatterplot

A
  1. Form
  2. Direction
  3. Strength
34
Q

Form

A

Linear

35
Q

Direction

A

Positive vs. negative vs. none

36
Q

Strength

A

Strong vs. weak

37
Q

Correlation r formula

A

r = 1/n-1 (x-mean of x/standard deviation of x) (y-mean of y/standard deviation of y)

38
Q

Correlation r

A

Measures direction and strength. Between -1 and 1. Positive if positive correlation, negative if negative correlation

39
Q

Regression line

A

A straight line that shows how the response variable changes as the explanatory variable changes. Used to predict the value of y for a given value of x.

40
Q

Formula for predicting y (regression line)

A

y = slope (x) + intercept

41
Q

Slope formula

A

Slope = r (standard deviation of y / standard deviation of x)

42
Q

Slope

A

A change of one standard deviation in x corresponds to a change of r standard deviations in y

43
Q

Measure of center and spread for symmetric data

A

Mean and standard deviation

44
Q

Measure of center and spread for skewed data

A

Median and IQR