Stats Final Exam Flashcards

1
Q

What type of data is collected through a count?

A

Discrete quantitative

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What type of data is collected through measurement

A

Continuous quantitative

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What type of data is categorical?

A

Qualitative

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are the 3 principles of experimental design?

A
  1. Control
  2. Randomization
  3. Replication
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the difference between observation and experiment

A
  • o= observe + take measurements, data that already exists
  • e= impose treatments and controls// one variable is the cause of changes
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

This is an experimental condition that determine the levels of single/multifactors

A

experimental treatment

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the sampling method used to eliminate bias?

A

Simple Random Samples (SRS)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are 3 biased sampling schemes?

A

convenience sampling, voluntary response sampling, nonresponse sampling

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

A call-in radio show that solicit audience participation in surveys on controversial topics like abortion is an example of

A

voluntary response sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

This is a variable that is not the explanatory variable but is thought to affect the response variable

A

Confounding, lurking

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

When can confounding variables be a problem?

A
  • The results show a false correlation btwn dependent and independent variables,
  • Null hypothesis.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Why is control, randomization, + replication important?

A

differences in the results of an experiment are not attributed to chance but are caused by treatments.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

How do you look at a histogram and find the median

A
  1. write the values on the graph
  2. Add them up
  3. +1
  4. divided by 2
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Which measure of center is NOT resistant to outliers- Median or Mean?

A

The Mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What does the z score measure?

A

whether the data value is above or below average in standard deviations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the symbol for correlation

A

r

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

In a scatter plot- what does it mean when the data points are spread apart

A

r value further away from 1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What types of variables must you use to compute the r value

A

quantitative values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What do the regression line and the correlation have in common?

A

both have the same sign (neg or pos)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What is included in a 5 number summary

A
  1. Min
  2. IQ1
  3. Median
  4. IQ3
  5. Max
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

If the scatter plot is curved is the regression line and r value appropriate?

A

no

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What are the 3 percentages affiliated with the empirical rule? (if histogram is bell shaped)

A

68

95

99.7

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

How many standard deviations is 68% away from the mean?

A

1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

How many standard deviations is 95% away from the mean?

A

2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
How many standard deviations is 99.7% away from the mean?
3
26
What is the formula for the z score?
individual- pop mean / standard deviation
27
What is the value you get in the z-score mean
it is the number of standard deviations= correspons with the s.d. value
28
What are the steps to determine an outlier?
1. IQR x 1.5 2. Add that value to Q3 3. Add value to Q1 4. Evaluate new range and compare
29
What must you do before graphing a box plot?
determine if there are any outliers
30
In terms of scatter plot, what does bivariate mean?
2 *quantitative* variables being compared
31
Which value is the **slope** b0 or b1?
b1
32
How do you compute the y-intercept?
b0= yavg- ( b1 \* xavg)
33
What tables do you need to get the sum for b1?
x, y, x - xavg, y - yavg, x - xavg2, y - yavg2
34
_Sxy_/Sxx
35
In the error sum of squares what new value are you computing?
the comparison of y actual and y with the equation
36
What does r2 measure
the percent of variation explained by observed values in response to the regression
37
the total sum of squares
is the total sum of variation in the *response* variable.
38
SSR or regression sum of squares
the variation explained by the regression
39
What indicates a strong linear relationship of r
value close to -1 or 1
40
# Define a random experiment
an action whose outcome cannot be predicted with certainity. Each subject equally likely.
41
# Define a sample space
the collection of all possible outcomes for an experiment
42
# Define an Event
a collection of outcomes. A subset of a sample space.
43
# Define mutually exclusive
2 or more events, no 2 having anything in common
44
Define independent events
may still have something in common, but does not affect the probability of the former event.
45
# Define the complement of an event
1- the probability the evend does not occur. *Not A*
46
Descrime the **union** of 2 events
A can occur B can occur or Both can occur
47
What is the **intersection** of 2 events
A and B occur simultaneously
48
P(A) = 1 means..
the event is **certain** to happen
49
P(A) = 0 neabs
the event will **NOT** happen
50
What rule does this equation fall under: P(A **or** B) = P(A) + P(B)
Special Addition for _mutually exclusive_ events
51
The complementation rule formula
P(A) = 1 - P(not A)
52
What rule does this equation fall under: P(A or B) = P(A) + P(B) - **P(A + B)**
_General Addition Rule_. Because it will always get you the right answer
53
What is a contingency table?
* the distribution of one variable in rows another in columns * study the association between the two variables
54
the probability of one event occuring when it's known that another one has occurred.
Conditional Probability
55
Finish the equation P ( B | A)=
P(A & B) / P(A). (Pay attention to the sample size.) **General Conditional Rule.**
56
When referring to the intersection of 2 events, What is the general rule?
**P(A & B)** = P(A) \* P (B | A)
57
What is the special conditional rule-
When A & B are independent events. P(B | A) = P(B)
58
When referring to the union of 2 events. What is the special rule for **independent events**
P( A & B) = P(B) \* P(A)
59
What is a random variable?
a **numerical** value that's determined by chance
60
# Define the probability distribution
the probabilities with which X takes those values
61
How to remember greater than and less than....
the pointy end is facing the smaller one ex 9 \>6
62
When reading the equation, decide which formula to use before you plug in the equation
63
If two events are mutually exclusive, what is their probability?
0
64
If two events are **independent** what is the probabilty?
you must multiply them
65
Suppose E and F are 2 mutually exclusive event, with P(E)=0.4 and P(F)=0.2 then P(E or F) equals..
0.6
66
How would you write 5% of students who work full time are full time student
P(S|W) = 0.05
67
How do you explain in words when 2 events are dependent
P(S|W) does NOT equal P(S)
68
What would a venn diagram look like if Student A skips class and Student B does not skip class
69
70
What is the proportion formula for Z in terms of confidence interval
P(1 - P)/N, then SQUARE ROOT
71
Where does a density curve lie on a graph
the *x* axis
72
What is the total area under a curve?
1
73
What is a continuous probability distribution?
a density curve
74
How do you find the probability association with a normal distribution
* z = x - µ / σ * then use the Table II
75
What is a percentile?
Solve for X "unstandardize" x= µ + zσ
76
# Define statistic
the value of a statistic will vary from one random sample to the next
77
If the x variable follow a normal distribution with µ and σ, then which rule applies?
68 - 95 - 99.7 %
78
Parameter
the value of a parameter remains constant
79
# Define Sampling Error
* Because statistics are random variables * a slight error associated with the estimate * x - u
80
What is a sampling distribution
* a probability distribution for all possible samples
81
If the sample is from a normal poulation then what does x bar have?
a normal sampling distribution
82
What is σx?
= σ / square rt of *n*
83
How are probabilities found?
standardizing using the standard normal table
84
How are percentiles found?
by unstandardizing from the standard normal table
85
# Define the Central Limit Theorem
* when the sample size *n* is large is a normal distribution
86
# Define a point estimate?
* single number estimate * give no indication of the size of the sampling error
87
Will the point estimate equal the true value?
No, because of sampling error
88
What does the margine of error indicate
how big the sampling error might be
89
What is a confidence interval?
a range of estimate for the true population parameter. = Point Estimate +/- Margin of Error
90
What does level of confidence mean?
how certain we can be that the true value is contained in the interval (%)
91
What also increases as the level of confidence increases?
the width of the confidence interval
92
What happens to the width of the confidence interval when *n* increases?
the width decreases
93
If you are given the endpoints of a confidenct interval, how do you calculate the mean?
add and divide by 2
94
What weight sparates the value of the heaviest 10% al all books from the 90% (90th percentile) what type of question is this?
unstandardizing- look for Z in table II
95
If you are given *n * in a problem with mean and standard deviation, what is the first thing you need to do?
1. convert σ to σx 2. use z equation 3. look up in table II
96
which equation do you use to obtain the confidence interval?
t-interval ch 8
97
When looking for a proportion, where can you find the equation?
Chapter 12- "z-interval for p"
98
For a hypothesis test, if the question says the words "true mean" which equation do you use?
99
When the equation says **not equal** what do you need to do?
* multiply T x 2, if its comparing u * multiply P x 2, if it comparing 2 u values
100
If the P value is less than alpha what does that mean?
101
The greater the t value means
102
If the p value is greater than alpha, what does this mean?
103
if your using the t-interval for u. what do you do to finish the formula?
look up t value in table iv
104
when do you use the t-interval for u equation
when you do not know sigma (standard deviation)