UNIT 3 and UNIT 4 COMBO Flashcards

Question

What is a factor?

Answer 1

A variable in an experiment that the experimenter manipulates.

Answer 2

Retrospective, and Prospective

Answer 3

1 (or 100%)

Answer 4

SPLIT UP POPULATION FIRST (by % with condition). then split the groups by outcomes of the test

Answer 5

the probability that it doesn't happen. 1-P(it happens). (together they add to 100%)

Answer 6

To blind the subject that is being experimented on to avoid influence to the given variable therefore altering the response variable. When people think they're getting help, they often improve anyway..

Answer 7

A sample where every possible sample has the same chance of being selected. There are no impossible samples.

Answer 8

It's any systematic failure of a sampling method. COMMON ERRORS: Voluntary response, undercoverage of the population, nonresponse bias and response bias. We use randomness and methods like stratifying to reduce these.

Answer 9

Words or phrases that impact your feelings tend to influence responses. Look for "devastating, horrific, wonderful, etc." Sometimes there is a background story like "Many americans lose jobs to illegal aliens every year, how do you feel about the border wall?"

Answer 10

Geocdf(p, 5)

Answer 11

NEED only 3: control , randomization, replication.. MAKE SURE YOU COMPARE Use blocking when appropriate

Answer 12

OR probability. Use when not disjoint. (subtract overlap) P(this OR that) = P(this)+P(that) - P(this and that) (IT ALWAYS WORKS IN ALL SITUATIONS, when disjoint, P(this and that)= 0, so you end up with the simpler disjoint version)

Answer 13

These numbers come from the coefficients of expanded binomials..(x+y)¹, (x+y)², (x+y)³

Answer 14

3X is just tripling one play. Mult mean and SD by 3. X+X+X is playing 3 times, Mult mean by 3, BUT... must add variances, square SD's add 3 times then sqrt.

Answer 15

A cluster sample is when the population is first divided into sections of clusters that have all of the traits that the population has, so the clusters are representative. You grab a cluster as your sample. A random sample is all names in a hat so you could get any group.

Answer 16

1. Being a DOG and being SMELLY 2. Being a FRESHMAN and being FEMALE 3. Liking ICE CREAM and liking HAMBURGERS(both can be true simultaneously)

Answer 17

Assign heads to odd numbers and tails to even numbers.

Answer 18

In an SRS, all samples are possible and all possible samples have the same chance of being picked. The other methods have lots of "impossible sample groups" .Stratified- an impossible group would be all girls (you're taking some boys and girls)- Clustered- an impossible group would be all girls (each cluster has boys and girls)- systematic- an impossible group would be 4 people that are right next to eachothe (you are taking every nth person)

Answer 19

randomness. sophisticated answer: make as many things as random as possible

Answer 20

mean: 3y+5b sd: SQRT(9z2+25c2) var: 9z²+25c² (same as (3z)² + (5z)²)

Answer 21

Assign everyone a 2 digit number (toss out repeaters), then simply sort from lowest to highest. The lowest n get treatment 1, next n get treatement 2, next n get treatment 3, etc....

Answer 22

a misinterpretation of the law of large numbers. thinking things will even out.. Using this law, if you flipped 4 heads in a row, you'd expect the next one to be a tails because it should even out in the long run. Not true, 5 flips is not the long run. Infinity is. The next flip still has a 50% chance of being another head. You may hear someone say "he's do for a hit" or "it's bound to rain soon" both bad.

Answer 23

it is "n factorial" example: 5! = 5\*4\*3\*2\*1= 120. tells you how many ways you can arrange n objects.

Answer 24

You don't have to, But you might want to if you feel that the experimental units (subjects) may respond differently to the treatment because of confounding variables. Like if you were testing out new deoderant. You might want to block according to activity level so you don't get all of the active people in one group (they sweat more).

Answer 25

Response bias is any influence that may sway the respondent to give a more favorable answer e.g wording of the question, interviewer's behavior/background. Therefore, in a survey, ask questions that allow respondents to answer comfortably and honestly. Keep the wording "indifferent" or neutral in some way in order to unduly favor one response over another. CONTROL the environment so that it is similar for all subjects.

Answer 26

a misinterpretation of the law of large numbers. Using this law, if you flipped 4 tails in a row, you'd expect the next one to be another tails, because tails is "hot." A baseball player who gets three hits in a row, you expect another hit? wrong. Streaks happen randomly (actually there is a little evidence for hot hand in some sports, but more research needs to be done)

Answer 27

1. Being tall and having a high GPA 2. If it is snowing and whether it is a Thursday or not 3. Whether a person likes pizza and their gender(notice, knowing one bit of information does not impact the likelihood of the other being true also)

Answer 28

NO! we say associated

Answer 29

Split population by %pregnant and %not who take test, then each of those into what test says. Then look just the groups that the test said pregnant. Then find: %pregnant/(total percent in both groups).

Answer 30

mean: y+b SD SQRT(z²+c²) var z²+c²

Answer 31

exactly x successes in K trials. What is likelihood of exactly 3 heads out of 13 flips? binopdf(13, .5, 3)

Answer 32

A Simple Random Sample (SRS) is our standard. Every possible group of n individuals has an equal chance of being our sample. That's what makes it simple.

Answer 33

FIRST SUCCESS ON THIS ATTEMPT geopdf (p,x) probability of FIRST SUCCESS being ON the Xth trial

Answer 34

6 or more same as not 5 or less 1- (5 or less) 1 - binocdf(12, p, 5)

Answer 35

(more than 4) not 4 or less 1-(4 or less) 1 - binocdf(12, p, 4)

Answer 36

A retrospective study is a study that looks backwards in time. They focus on estimating differences between groups or variable association because they are not based on random samples.

Answer 37

FIRST, Make a key to explain what the digits represent, whether you will use single, double or triple digits at a time and which, if any will be ignored. SECOND.. Decide when a trial will end (after 12 events, or after 12 successes), THIRD.. Make sure to clearly label the successes and where the trials end. FOURTH: KNOW YOUR RESPONSE VARIABLE, like, how many successes in n trials, or how long til n successes...

Answer 38

You want to control as much of the environment as posible so all subjects have a similar experience except for the treatments given Control the factors in the experiment for each trial, you keep them constant if you believe it would effect the outcome of the experiment Also having a group that is not getting treatment helps to control because it measures the effects of the natural environment.

Answer 39

SQRT(m² + p² + q² + r²)

Answer 40

Activity level could confound results in an anti-persperant deoderant experiment. If only active people used brand X and sedentary people used brand Y, Y might look like it was effective, but the people didn't sweat because they were sedentary. You would want to block according to lifestyle so that some active people used both types and sedentary used both types. Sunlight and seat Usage could be confounding variables for a Leather preserver.. If you randomly choose from all chairs in an airport for treatment and brand A randomly has a lot of chairs near the sun, Brand B randomly gets a lot fo chairs near the main entrance and Brand C randomly gets the chairs that don't have a lot of sun, or a lot of use, you may think that brand C works the best, when in fact, the results were confounded by sunlight and usage..

Answer 41

mean: y+y+y sd SQRT(z²+z²+z²) .... var (z²+z²+z²)

Answer 42

Not much. In an experiment, we are not looking for a sample that is like the population. We just want to see the effectiveness of a treatment. It is fine if the subjects are all similar. In fact it is best sometimes when they are!

Answer 43

1. Playing video games and gender (Knowing male makes it more likely they play) 2. Whether it is snowing and the month you are in (some months are more rainy than others, knowing what month changes likelihood of snowing) 3 If a pet is a dog and if it is a cat (knowing it is a dog makes it certain that it is not a cat).(notice, knowing one bit of information changes the likelihood of the other being true also).

Answer 44

use only the digits 1-6, ignore 0, 7, 8, 9

Answer 45

same as 4 or less binocdf(12, p, 4)

Answer 46

Those who volunteer may not be like the rest of the population. An example may be, if you're trying to find our how often people volunteer for things. So you ask for volunteers to take the survey.... A question may be "when was the last time you volunteered for something?" Well. they all just volunteered for the survey!

Answer 47

7 choose 3. 7!/(3! \* 4!) notice that the two factorials on bottom add to the top.

Answer 48

The expected value. sum of probs times values You can use calculator to find 1 var stats L₁, L₂

Answer 49

probability A times probability B (knowing A is true) called general multiplication rule P(A)\*P(B given A) P(this)\*P(that given this)

Answer 50

In clustering you can grap one or two clusters.Clustering is when chosen at random a group from the population that looks like the population, Stratifying you must take a few from every strata to get a representative sample. Stratifying is slicing a population into homogeneous groups(strata). Then randomly sample within each stratum before the results are combined.

Answer 51

. Systematic sampling includes picking every Nth number of what you are sampling (for example people.). You must still start on a random person and then from then on take every Nth person. So you can take every 10th person in a line in order to take a survey as long as you also start on a random individual.

Answer 52

4! 4\*3\*2\*1 = 24 ways

Answer 53

I was curious about a population parameter, but a census was too costly, so I collected data for a sample, calculated a statistic and used that to make an inference about the parameter of interest.

Answer 54

To avoid bias. An experimenter might want their treatment to work, so may chose the subjects that might respond best to show how great it is, when in fact, IT NO GOOD.

Answer 55

When people know they are getting a treatment, they may feel better even if the treatment doesn't work. Their previous experience with the brand might bias their reporting or something..

Answer 56

mean: 3y SD 3z var 9z²same as (3z)²

Answer 57

RANDINT( lowest, highest, how many you want to grab)

Answer 58

FIRST.. GEO not on the 4th or before 1-(fourth or before) 1 - geocdf(p, 4)

Answer 59

use single digits on a random number table. Each digit represents a student on the bus. Ignore the zeros. Let 1 be a psych major, and 2 through 9 be other students. Trials end when you have reached 30 students. Count the number of psych majors (ones) in the trial. Record this. Do this 20 times. Find the percent of times there were 4 or more psych majors on the "bus." If this occured in 5 trials.. then the likelihood is 5 in 20, or 25%

Answer 60

BINOMIAL P binocdf(12, p, 5)

Answer 61

The two are similar because they divide the subjects into groups that have similar traits.

Answer 62

add or subtract the means, and thenADD THE VARIANCES

Answer 63

Single blinding is when all individuals in either one of the classes are blinded; double-blinded is when everyone in BOTH classes are blinded. Classes are: subjects and treatment givers and evaluators \*\*Can't blind a tomato plant, so blind the fertilizer guy

Answer 64

Retrospective. You could take all of the SAT Math and Verbal scores and run a regression and find the r-quared value and linear model. This would be a Retrospective Study.

Answer 65

binopdf (12,p,5)

Answer 66

Blocking is in an experiment, when you want to tease out a possible confounding variable. stratifying is in sampling when you want to make sure to get units with a specific characteristic so your sample is representative of population.

Answer 67

The mean of the random variable. What you'd AVERAGE if you played the game A LOT!!!!!!!!!

Answer 68

NEITHER!! you always just add variances. Square the st devs, add them, then take sqrt.

Answer 69

EXACTLY X OR LESS successes in N tries (cumulative) n: total number p: likelihood x: # of successes binocdf(n,p,x)..

Answer 70

MUTUALLY EXCLUSIVE They can't both happen at the same time! (being over 5 feet and under 4 feet)

Answer 71

It reduces bias in an experiment. If subjects don't know what treatment they're receiving, they won't change their habits based on that knowledge. If evaluators don't know which treatment each subject is receiving, they won't bias the true results based on the results they expect to see

Answer 72

Make a table, put values in L1 and probabilities in L2, and run "1-var stats L1,L2" and you get it!

Answer 73

qqqqqq p (q^6\*p). (this is a GEO prob)

Answer 74

independent events

Answer 75

independent

Answer 76

At least 20-30.

Answer 77

NO.. If they are disjoint then knowing one tells you that the other couldn't happen, so it does impact the likelihood of the other, so they are always NOT INDEPENDENT. DISJOINT EVENTS ARE ALWAYS ASSOCIATED!!

Answer 78

1/p or 1/.30 .Which is 3.333 so around the 3rd or 4th try. 1/p tells you, on average, when the first success will occur 1/p is the mean of the geometric distribution

Answer 79

EXACTLY X successes in N tries n: total number of tries p: prob of success x: number of successes binopdf(n,p,x) .Probability of exactly X successes in N trials. (PARTICULAR probability)

Answer 80

Humans who are experimented on are commonly called subjects in an experiment. Subjects like dogs, days, plants and anything not human are called Experimental Units

Answer 81

Multiply P(this)\*P(that) works when independent only, when there is an association, then P(that) should be p (that|this), so it looks like this: P(this) \* P(that given this)

Answer 82

Stratified- you grab a bit from each strata... you divide the population up into groups according to traits, called strata (groups with similar traits- homogeneous groups) and randomly choose from each strata to get a representative sample. Cluster- grab a cluster or two, . each cluster should be like the population. You don't neet to take a little from each cluster, they are already representative.

Answer 83

Matching (a type of blocking), reduces unwanted variation. In a retrospective or prospective study, subjects who are similar in ways not under study may be matched and then compared with each other on the variables of intrest.

Answer 84

assign random number then sort low to high and start with bottom.. or throw names in hat and pick.

Answer 85

Undercoverage is when either one part of the population is not included in a survey or is underrepresented in the survey

Answer 86

When we use chance to select a sample. You MUST use some real randomness ex: dice, cards, randint, number table

Answer 87

associated

Answer 88

When the percent is not a multiple of ten, Like "18% ofdogs eat underwear".. You'll have to assign 01-18, or 00-17 as undie eating dogs.

Answer 89

You don't have to.. But you might want to if you feel that a simple random sample might not be representative of the population . You want your sample to be like the population. a representative sample (it represents the population well).

Answer 90

A sample that combines several sampling methods, like stratifying then clustering...

Answer 91

When an observed difference is too large for us to believe that it is likely to have occurred naturally (or just randomly). Basically it is Statistically Significant when we don't think it happened randomly We use 5% as a threshold. If it was less than 5% likely to happen, then that is significantish.

Answer 92

SRS, stratified, clustered, systematic, multistage, convenience, voluntary

Answer 93

n choose k it tells you how many ways you can choose k objects from a set of n things. The formula is n!/(n!(n-k)!) the two numbers on bottom add to the number up top. These are coefficients in expanded binomials and can also be found in Pascal's Triangle

Answer 94

An alternative of an experiments could be an observational study. There's two forms: prospective and retrospective. A prospective observational study is when you identify subjects in advance and record data as you go along. A retrospective observational study is when you analyze observations from the past.

Answer 95

Sample size. A sample of 150 will say as much about a population of 2,000 as it will about a population of 2,000,000. the percent of the population isn't what matters. The sample size determines level of confidence and interval widths..

Answer 96

IT IS NOT A MISTAKE!!!... Because the data in samples are generally different, the statistics calculated from one sample to another vary and are generally not equal to the parameter. This variablilty of the STATISTICS is called sampling error. (not the variability of the data).

Answer 97

FIRST SUCCESS ON OR BEFORE p: probability of succes x: xth try geocdf(p,x). Probability of the FIRST SUCCESS being ON OR BEFORE the Xth trial.

Answer 98

Basically a test based on reality with a sequence of random outcomes that model it. Like an imitation.

Answer 99

A retrospective study takes a group and looks back at its history while a prospective study watches a group for a period of time and records the data into the future. RETRO-REVERSE, PROspective- PResent and On..

Answer 100

mean: 3y+5b+12 sd: sqrt (9z² +25c²) var 9z²+25c² same as (3z)² + (5c)²

Answer 101

It means that the sample statistics will be kind of like the population parameters.. The sample "looks like" the population.

Answer 102

In experiments you don't need a representative sample of the population, you can have volunteers, convenient subjects and that is OK. You are looking at impact of treatment, not at getting a representative sample. When you use one of the sampling methods, you want a sample that looks like the population so you can make an inference about the population.

Answer 103

Run a simulation. Find the percent of trials that you observed the event occur.

Answer 104

probability A plus probability B minus the double counted (the ones that are both A and B) called "general addition rule" P(A)+P(B)-P(A and B) P(this)+P(that)-P(this and that)

Answer 105

Response is when the person's response is influenced by the question or questioning method (like if a parent asks if you use drugs, as opposed to a friend... there is only one answer to this, but one might respond differently to them), non response is is when the people who don't respond might have different opinions/views than the people who did.

Answer 106

Placebo is used for control in an experiment. It lets you know how factors other than the treatment impact the subjects. the purpose of placebo is to determine the change between the controlled treatment and the other treatments

Answer 107

People at the restaurant are probably there because they already like it. If you asked the question "Is this your first time dining here?" and if they say "yes" you survey them, that would be a better method. But then again.. the people wouldn't go into an Italian restaurant if they didn't like that type of food.

Answer 108

It depends generally, it is better to do a sample since a census is expensive to execute, and because popultaions are always changing it is hardly more accurate then a sample. BUT, For small populations, a census is fine. Ordering sandwiches for your family, do a census.

Answer 109

it is about FIRST SUCCESS What is likelihood first success is on 5th trial? q q q q p

Answer 110

1. To simulate the likelihood of an event occurring. (ch 11) 2. To choose a sample that is representative of the population and avoid bias.(Ch 12) 3. To assign subjects (experimental units) to treatments to evenly distribute variability and help reduce possible confounding variables.(Ch 13)

Answer 111

Subjects. Those delivering treatments. Those assessing effectiveness of treatments. and three mice.

Answer 112

st dev of combined model is: sqrt(st dev squared + st dev squared) or more if you combine more

Answer 113

In all of them, all members of population have equal chance of being selected. So.. individuals have equal chance in them all, but there are impossible sample groups for some.

Answer 114

same as disjoint

Answer 115

A level is a specific value(s) that the experimenter chose for a factor that is manipulated. ex. Factor is sleep, level(s) would be how many hours the subjects were aloud to sleep. 4 hours, 6 hours, 8 hours. 3 levels

Answer 116

ADD P(this) + P(that) works when disjoint only, when not, subtract overlap.

Answer 117

1. A card being a CLUB and a RED 2. A student being a SENIOR and a FRESHMAN 3. An animal being a CAT and a GOLDFISH(both can't be true)

Answer 118

When the percent is a multiple of ten, like "30% of teachers secretly twerk", then you would assign 1-3 or 0-2 as twerking teachers.

Answer 119

To find probability of x successes in K trials.. BINOMIAL BABY!!!

Answer 120

When those who get the placebo show improvements, or show the effects of the treatment. This often happens to up 20% of participants!

Answer 121

Researchers like to see results, they want to see an effect. If they know which treatment is the actual medicine, then they might be "looking" for it.. We want the data to say it works, not the person.

Answer 122

Deoderant.. If you just randomly assign it, maybe the active people get deoderant X and non active get Y. The results would be confounded by lifestyle. Was it deoderant Y or the fact that the people didn't sweat all day? You want people in each group to get both deoderants. Leather preserver.. If you randomly choose from all chairs in an airport for treatment and brand A randomly has a lot of chairs near the sun, Brand B randomly gets a lot fo chairs near the main entrance and Brand C randomly gets the chairs that don?t have a lot of sun, or a lot of use, you may think that brand C works the best, when in fact, the results were confounded by sunlight and usage..

Answer 123

Confounding is with experiments, it is the thing that may be causing the different effects instead of the treatment (sunlight instead of leather preserver). Lurking is with regression, it is when something is causing things to go up and down together like how the weather impacts ice cream sales and beach injuries (rise and fall when more people are at the beach).

UNIT 3 and UNIT 4 COMBO Flashcards

(150 cards)