stats other stuff Flashcards

1
Q

how do you calculate a marginal distribution

A

take every value in a column or row of a table and add them together, then divide by the table’s total

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

how do you calculate a conditional distribution

A

take one specific variable and divide it by its category (ex: girls with blue eyes divided by girls, when men with different colored eyes are also included in the table)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

what is a confounding variable

A

an outside variable that can cause disruption in a distribution’s results

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

what type of graph is good for categorical data

A

bar graphs and pie charts

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

what type of graph is good for quantitative data

A

dot plots

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

why is it important to start the y axis of a graph on 0

A

starting the y axis on a higher number may cause exaggerated results

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

what acronym should you use to describe a distribution’s properties

A

SOCS (shape, outliers, center, and spread)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

how do you interpret a stem and leaf plot

A

Digits in the stem represent the numbers first digit(s) and leafs represent the numbers second digit

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

what type of data uses histograms

A

quantitative

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

what is the difference in bar graphs and histograms

A

histograms are for quantitative data and their bars represent a range of data, but bar graphs are for categorical data and their bars represent specific values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

what is the difference in X bar and Mu (mean variables)

A

X bar represents the sample mean, while Mu represents the population mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

which type of distribution is described using mean and standard deviation

A

Normal distribution (symmetric)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

what type of distribution is described using median and IQR

A

A skewed distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

what percentile is Q1

A

25th

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

what percentile is Q2

A

50th

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

what percentile is Q3

A

75th (meaning 75% of the data values fall below the value of Q3)

17
Q

how do you calculate the high outliers in a data set

A

low outliers occur if points are higher than what you get when you calculate Q3 + (IQR x 1.5)

18
Q

how do you calculate the low outliers in a data set

A

low outliers occur if they are less than what you get when you calculate Q1- (IQR x 1.5)

19
Q

what elements does adding and subtracting affect in a distribution

A

mean and median but not spread (IQR or standard deviation)

20
Q

what acronym should you use to describe correlation and scatterplots

A

DUFS (direction, unusual points, form: whether its linear or not, and strength)

21
Q

what is the meaning of r2

A

how much of the data can be accounted for or explained by the independent variable in a distruibution

22
Q

what does correlation require

A

that both variables are quantitative

23
Q

how do you calculate residuals

A

Actual value - predicted value

24
Q

a census…

A

collects data from every individual in a population

25
Q

what is a simple random sample

A

a sample taken from a population where every individual has equal chance of being chosen

26
Q

blocking helps to

A

take away confounding variables by separating individuals in a data set into groups of shared characteristics

27
Q

undercoverage bias

A

some members can’t be chosen

28
Q

nonresponse bias

A

individual can’t be contacted or refuses to participate

29
Q

response bias

A

instances include when individuals lie

30
Q

wording of a question bias

A

when the question urges certain answers out

31
Q

what is a control in statistics

A

when other variables that may affect the response are kept the same for all groups

32
Q

replication

A

use enough groups to see more accuracy in affects

33
Q

statistically significant

A

when an observation has an affect so large it could not have happened by chance DOES IMPLY CAUSATION

34
Q
A