Final Exam Flashcards

Question

How to explain direction? of a scatterplot

Answer 1

positive association: high values occur together, positive slop on LOBF negative association: low values occur when the other variable is high, negative slope

Answer 2

linear relationships, points show a straight line pattern Curved and clustered are also good ways to describe form!

Answer 3

determined by how close the points in the scatterplot lie to a simple form such as a line

Answer 4

the strength and direction of linear association between two quantitative variables x and y. r only measures straight line. is between -1 and 1. indicates strength by how close it is to -1 or 1 (-1 for neg ass, 1 for pos ass). CORRELATION IS NOT RESISTANT

Answer 5

straight line that describes how a response variable y changes as explanatory variable x changes. You can use it to PREDICT value of y for value of x.

Answer 6

a straight line y = a+bx that minimizes sum of squares of vertical distances of observed points from line.

Answer 7

use of a regression line for prediction using values outside range of data from which the line was calculated. YES AVOID

Answer 8

differences between observed point and predicted values of y.

Answer 9

average size of the prediction errors when using regression line

Answer 10

r^2. fraction of variation in one variable that is accounted for by least squares regression on other variable. Example: (r^2*100)% of y's variation can be explained by least square regression of x!

Answer 11

always interpret with caution. look for outliers that could affect regression line. do not conclude cause and effect between two variables JUST because of a strong correlation.

Answer 12

selects a sample from the population of all individuals about which we ant info from.

Answer 13

uses chance to select a sample

Answer 14

gives every possible sample of a given size the same chance to be chosen (do not mix with individuals). Choose an SRS by labeling members with numbers and use random digits to select the sample.

Answer 15

divide population into strata, groups of individuals that are similar in some way that might affect their responses. Choose a separate SRS from each strata.

Answer 16

divide population into groups or clusters. randomly select some of these clusters. All individuals in the chosen clusters are included in the sample.

Answer 17

Use a Simple Random Sample (SRS) when you want every member of the population to have an equal chance of being selected, while a stratified sample is best when you want to ensure representation from different subgroups within the population, and a cluster sample is ideal when you need to study large, geographically dispersed populations by randomly selecting groups (clusters) to sample from

Answer 18

systematic errors in the way the sample represents the population. voluntary response samples: respondents choose themselves, can cause bias convenience samples: individuals are close by and included in sample, prone to large bias.

Answer 19

errors that come from the act of choosing your sample random sampling error: sampling is not truly random under coverage: some members of population are left out of sampling frame, the list from which the sample is chose.

Answer 20

nonsampling errors. have nothing to do with choosing sample. this happens with nonresponse, when people cant be contact or choose not to answer. Incorrect answers can lead to response bias. also happens with wording of questions, can influence answers.

Answer 21

gathers data on individuals as they are

Answer 22

actively do something to measure a response.

Answer 23

when effects on a response can't be distinguished from each other. observational studies and uncontrolled experiments often fail to show changes in an explanatory variable actually causes changes in a response variable because explanatory variable is confounded with lurking variables.

Answer 24

a combination of values of the explanatory variables.

Answer 25

the smallest unit a treatment of an experiment is applied to.

Answer 26

control prevents lurking variables that are confounded with explanatory variable. random assignment of treatments is just randomly assigning treatments to an experimental unit. replication is doing it over and over and getting consistent results.

Answer 27

DB: when neither party knows who has what treatment in an experiment SB: when on party knows who has the treatment.

Answer 28

individuals that are similar in some way important to experiment

Answer 29

the individuals taking part in the study be randomly selected from this large population. Doing this allows inference for cause and effect.

Answer 30

the proportion of times that a particular outcome occurs in many repetitions will approach a single number.

Answer 31

imitation of chance behavior. follows 4 step process SPDC

Answer 32

describes chance behavior by listing possible outcomes in the sample space S and giving the probability of each outcome.

Answer 33

a subset of possible outcomes.

Answer 34

P(A^c) = 1-P(A)

Answer 35

events A and B are mutually exclusive if they have no outcomes in common.

Answer 36

P(A or B) = P(A) + P(B)

Answer 37

P(A or B), P(A and B)

Answer 38

P(A U B) = P(A) + P(B) - P(A ∩ B)

Answer 39

if one event has happened, the chance another will happen is a conditional prob. Notation P(B|A) represents prob of B given A has happened

Answer 40

the chance that event B occurs is not affected by whether or not A has occurred. P(B|A) = P(B) and P(A|B) = P(A) if events are mutually exclusive, they cannot be independent.

Answer 41

P(A ∩ B) = P(A)*(B|A) for independent: P(A ∩ B) = P(A)*P(B)

Answer 42

divide both sides of general multiplication rule by P(A) and we get P(B|A) = P(A ∩ B) / P(A)

Answer 43

consists of n independent trials with the same chance process, each resulting in a success or failure, prob of success = p. The count X of successes is a binomial random variable. Its probability distribution is a binomial distribution. BINS B: Binomial (2 outcomes) I: independent trials N: trials fixed in advanced S: success? (sample value of p for all trials)

Answer 44

P(X=k) = (ⁿ_k)*p^k*(1-p)^n-k

Answer 45

μ_x= np σ_x= sqrt(np(1-p))

Final Exam Flashcards

Just for stats final, notecards from summaries of chapters (70 cards)