Test Flashcards

1
Q

Categorical

A

Most common: graphical display
Pie chart
Bar chart

Pictograms
Frequency tables

Numerical summaries: category counts and percentages

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Quantatative

A

Histogram [=including, (=not including
Stem plot
Box plot (5# summary)
- longer/shorter quartile means spread of data not more data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Mean

A

Average

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Median

A

Middle

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

More

A

Most often occurring

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Standard Deviation

A

Use w/ symmetry and mean

68% fall w/in 1 SD of the mean
95% fall w/in 2 SD of the mean
99% fall w/in 3 SD of the mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

IQR

A

Inner quartile range

Gives us the middle 50%

Used w/skewed data and median

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

1.5 IQR

Used to detect outliers

A

Q1-1.5(IQR)

Q3-1.5(IQR)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Explanatory Variable

X

A

Variable that claims to explain, predict or affect the response

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Response variable (Y)

A

Outcome of the study

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

C — Q

A

Box plots

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

C — C

A

Two way tables / contingency table

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Q — C

A

Conditional percentile tables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Q — Q

A

Scatter plot

Increase in X = increase in Y
Decrease in X = decrease in Y

U shape = not positive or negative

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

r= linear correlation coefficient

A

( -1 to 1 )

0 to -1 = neg relationship
0 to +1 = pos relationship

Measures strength of linear relationship

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Simpsons Paradox

A

When a lurking variable causes us to think the direction of an association

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Population

A

Group chosen for sampling

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Sample Frame

A

List of individuals to be sampled

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Sample

A

Actual individuals chosen for sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Simple Random Sample

A

Individuals sampled at random without replacement.

Selecting names out of a hat

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Cluster sample

A

Used when population is naturally divided into groups

Students in university divided into majors

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Stratified sample

A

Used when population naturally divided into subpopulations

Students in certain college divided by gender or year in college

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Systematic sample

A

Obtain contact information and sample every so many people (I.e. Every 50th person)

24
Q

Observational Study

A

Values of variables are recorded as they naturally occur

25
Q

Experiment

A

Researcher defines the explanatory variable

26
Q

Prospective

A

Values of the variables recorded forward in time

27
Q

Retrospective

A

Values of variables recorded backward in time

28
Q

Blind experiment

A

Subjects unaware of which treatment they are receiving

29
Q

Double Blind Experiment

A

Testing procedure designed to eliminate biased results

Where identity of those receiving a test treatment is concealed from both administrators and subjects until study is completed.

30
Q

Hawthorne Effect

A

People in an experiment behave differently from how they would normally behave.

31
Q

Lack of realism

A

Subjects/treatments/setting of an experiment may not realistically duplicate the conditions we want to study.

32
Q

Noncompliance

A

Failure to conform to roles / standards

33
Q

Blocking

A

Divide subjects into groups of individuals who are similar with respect to an outside variable.

34
Q

Matched pairs

A

Special case of randomized block design. Used when experiment has two treatment conditions and subjects can be grouped into pairs. Then within each pair subjects are randomly assigned to different treatments.

35
Q

Randomized response

A

Survey technique for eliminating evasive answers.

36
Q

Leading question

A

Questions that influence the response

37
Q

Sensitive questions

A

Questions that may make someone answer dishonestly because of how they feel. (I.e. Questions about lowest grade last year).

38
Q

Classical problems (theoretical/true problems)

A

Games of chance

Flipping coins, rolling dice, spinning spinners

39
Q

Empirical problems (relative frequency)

A

Run a simulation or use a random sample

Use a series of trials that produce outcomes that cannot be predicted in advance

40
Q

Law of large numbers

A

As the number of trials increases, the relative frequency becomes the actual probability

41
Q

Rule #1. Probabilities are between 0 and 1

A

A +B + C = 1

42
Q

Rule #2

Something must happen

A

As number increases should see a change

43
Q

Rule #3. Complement Rule

A

P(not A) = 1-P(A)

44
Q

Rule #4 addition rule for disjoint events

A

P(A or B) = P(A) + P(B)

P(A or B) = probability that event A occurs or event B occurs or both)

45
Q

Rule #5 multiplication rule for independent events

A

P(A and B) = P(A)*P(B)

P(A and B) = Probability that event A and event B occur.

46
Q

Disjoint events

A

Whether or not it is possible for the events to occur at the same time

47
Q

Independent events

A

If event A occurring does not effect the probability of event B will occur.

48
Q

Probability of at least

A

L = at least / or not

P(L) = 1-P(notL)

49
Q

Rule #6 general rule of addition

A

P(A or B) = larger number

P(A and B) = smaller number

50
Q

Distribution means…

A

What values the variables take and how often the variables take those values.

51
Q

Skewed right

A

Most data on left, minimal on right.
(Right tail (larger values) is much longer than left tail (smaller values))

Example: distribution of salary.

52
Q

Skewed Left

A

Left tail (smaller values) is much longer than the right tail (larger values).

(Example: age of death from natural causes).

53
Q

Stemplot

A

Retains actual data and organized it.

54
Q

Given that the student is male, what is the probability that he has one or both ears pierced?

A

P(E) = probability of having one or both ears pierced

P(M) = male student

P(E | M )

55
Q

Formal definition of probability

A

P(B | A) = P(A and B) / P(A)

56
Q

If service A has failed to deliver the document on time, what is the probability that it has arrived on time using service B.

A

P(B | not A) = P( B and not A) / P ( not A)

57
Q

Testing for independence

A

Compare the overall probability to the conditional probability.

Compare P(B | A) to P (B) as well as P(A | B) to P(A) or,
P(B | A) to P(B | not A)

If the two events are equal then they are independent.