UNIT 3 SAMPLING AND EXPERIMENTS AND STUFF Flashcards

1
Q

How can you estimate the probability of an event occurring?

A

run a simulation. Find the percent of trials that you observed the event occur.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

How many trials should you run to have an accurate simulation?

A

At least 20-30.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Is it always better to do a census or a sample?

A

It depends. generally, it is better to do a sample since a census is expensive to execute, and because populataions are always changing it is hardly more accurate then a sample. BUT?. For small populations, a census is fine.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

To make a survey to tell of a restaurant is good, would you ask the people coming out of the restaurant?

A

People at the restaurant are probably there because they already like it. If you asked the question “Is this your first time dining here?” and if they say “yes” you survey them, that would be a better method. But then again.. the people wouldn’t go into an Italian restaurant if they didn’t like that type of food.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are humans bad at ?

A

Humans are bad at generating random numbers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How can you “randomly ask 5 people” in the hall a question?

A

You could roll dice, and if it says 5.. Then the fifth person to walk by you interview, then roll again.. Repeating this is random, but remember that the sampling frame is just the people who walk down that particular hallway?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Sampling frame.

A

The group who you may sample from. If you do a phone survey then your sampling frame is only people with phones. If you are interested in everyone, a phone survey could suffer from undercoverage?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are the 3 ways we used random numbers?

A
  1. To simulate the likelihood of an event occurring. (ch 11) 2. To choose a sample that is representative of the population and avoid bias.(Ch 12) 3. To assign subjects (experimental units) to treatments to evenly distribute variability and help reduce possible confounding variables.(Ch 13)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

what is a simulation?

A

Basically a test based on reality with a sequence of random outcomes that model it. Like an imitation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

when does a trial of a simulation end?

A

Generally there are two cases:1. You want to know the probability of having x successes in n attempts (getting 3 smokers in a group of 5 students). Trials end when you get to n (get to 5 students). You record the number of smokers for each trial.2. You want to know how many attempts it takes to get f successes. Trials end when you get f successes. Record the number of attempts.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

You want to simulate the likelihood of more than 4 psychology majors being on a full bus that seats 30. 1 in 9 students are psych majors.

A

use single digits on a random number table. Each digit represents a student on the bus. Ignore the zeros. Let 1 be a psych major, and 2 through 9 be other students. Trials end when you have reached 30 students. Count the number of psych majors (ones) in the trial. Record this. Do this 20 times. Find the percent of times there were 4 or more psych majors on the “bus.” If this occured in 5 trials.. then the likelihood is 5 in 20, or 25%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How do you use a table of random digits?

A

FIRST.. Make a key showing what the digits represent, whether you will use single, double or triple digits, and which, if any will be ignored? SECOND.. Decide when a trial will end (after 12 events, or after 12 successes), THIRD.. Make sure to clearly label the successes and where the trials end.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

When would you use two digits instead of a single on a random number table?

A

When the percent is not a multiple of ten.. Like “18% ofdogs eat underwear”.. You’ll have to assign 01-18, or 00-17 as undie eating dogs. All other digit pairs will be non-underwear eating dogs.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

When can you use single digits for simulations?

A

When the percent is a multiple of ten, like “30% of teachers secretly twerk”, then you would assign 1-3 or 0-2 as twerking teachers. Or to simulate rolling dice (1-6 faces, ignore 0, 7, 8, 9), or flipping coin (odds H, evens T)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Use the following words in one run on sentence: inference, sample, statistic, parameter, population, census, data

A

I was curious about a population parameter, but a census was too costly, so I collected data from a sample, calculated a statistic and used that to make an inference about the parameter of interest.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is random sampling?

A

When we use chance to select a sample, when you use an actual randomizing mechanism.. Not your “random” guess! When you “randomly” do something in your head, it is not random. roll dice, shuffle cards, pick from a hat or use a random number table or calculator

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

How can you simulate a coin flip with random number table?

A

Assign heads to odd numbers and tails to even numbers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

How can you simulate rolling 1 die with a random number table

A

use only the digits 1-6, ignore 0, 7, 8, 9

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

How can you simulate on your calculator

A

RANDINT( lowest, highest, how many you want to grab)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Samplin Method Types?

A

SRS, stratified, clustered, systematic, multistage, convenience, voluntary

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What are the two types of observatinal studies?

A

Retrospective, and Prospective

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What do observational studies and experiments have in common?

A

In both, you are making OBSERVATIONS.. recording data… doing statistical analysis…

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

What is a mutlistage sample?

A

A sample that combines several sampling methods

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What is a quality of SRS that is not a quality of Systematic, Stratified or Clustering?

A

In an SRS, all groups are possible, and ALL POSSIBLE GROUPS have the same chance of being picked. The other methods have lots of “impossible groups” SRS has no impossible groups.-Stratified- an impossible group would be all girls (you’re taking some boys and girls)-Clustered- an impossible group would be all girls (each cluster has boys and girls)-systematic- an impossible group would be 4 people that are right next to eachothe (you are taking every nth person)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

What is a simple random sample?

A

A sample where every possible group has the same chance of becoming a part of a sample.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

What is difference between subject and experimental unit?

A

Humans who are experimented on are commonly called subjects in an experiment. Subjects like dogs, days, plants and anything not human are called Experimental Units

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

What is prospective study?

A

Prosepctive study is when you study the experimental unit’s present and futrue response variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

What is response bias? How do you avoid it?

A

Response bias is any influence that may sway the respondent to give a more favorable answer e.g wording of the question, interviewer’s behavior/background. Therefore, in a survey, ask questions that allow respondents to answer comfortably and honestly. Keep the wording “indifferent” or neutral in some way in order to unduly favor one response over another.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

What is retrospective study?

A

A retrospective study is a study that looks backwards in time. They focus on estimating differences between groups or variable association because they are not based on random samples.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

What is sampling error?

A

IT IS NOT A MISTAKE!!!… Because the data in samples are generally different, the statistics calculated from one sample to another vary and are generally not equal to the parameter. This variablilty of the STATISTICS is called sampling error. (not the variability of the data).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

What is sample size and how does it compare with the fraction of a population?

A

Sample size is the number of individuals in a sample. The sample size determines how well the sample represents the population, not a fraction of the population sampled. The fraction of the population that you’ve sampled doesnt matter. Its the sample size its self thats most important.

32
Q

What is statistically significant?

A

When an observed difference is too large for us to believe that it is likely to have occurred naturally (or just randomly). Basically it is Statistically Significant when we don’t think it happened randomly

33
Q

What is systematic sampling?

A

Systematic Sampling is one of four different ways to make a survery sample random. Systematic sampling includes picking every Nth number of what you are sampling (for example people.). You must still start on a random person and then from then on take every Nth person. So you can take every 10th person in a line in order to take a survey as long as you also start on a random individual.

34
Q

What is the difference between a cluster sample and random sample?

A

A cluster sample is when the population is first divided into sections of clusters that have traits similar to the population (the clusters are heterogeneous and have all types within them). Then we randomly select an entire cluster or clusters, and include all of the members of the clusters in the sample. As for random sample is when each member of the population, and each possible group is equally likely to be included.

35
Q

What is the difference between response bias and nonresponse bias?

A

Response is when the person’s response is influenced by the question or questioning method (like if a parent asks if you use drugs, as opposed to a friend… there is only one answer to this, but one might respond differently to them), non response is is when the people who don’t respond might have different opinions/views than the people who did.

36
Q

What is the problem with convenient sampling?

A

The sample may not be representative as it is not randomized to include every type of person. E.G Friends and family are convenient but they likely share similar opinions and thus the sample is not representative of a population.

37
Q

What is the standard sampling method?

A

A Simple Random Sample (SRS) is our standard. Every possible group of n individuals has an equal chance of being our sample. That’s what makes it simple.

38
Q

What is undercoverage?

A

Undercoverage is when either one part of the population is not included in a survey or is underrepresented in the survey

39
Q

What is wrong with using volunteers in a survey?

A

Those who volunteer may not be like the rest of the population. An example may be, if you’re trying to find our how often people volunteer for things. So you ask for volunteers to take the survey…. A question may be “when was the last time you volunteered for something?” Well. they all just volunteered for the survey!

40
Q

What is wrong with using voluteers in an experiment?

A

Not much. In an experiment, we are not looking for a sample that is like the population… We just want to see the effectiveness of a treatment. It is fine if the subjects are all similar. In fact it is best sometimes when they are!

41
Q

What type of study would find relationship beween Verbal and Math SAT?

A

You could take all of the SAT Math and Verbal scores and run a regression and find the r-quared value and linear model. This would be a Retrospective Study.

42
Q

What’s the difference between a prospective and a retrospective study?

A

A retrospective study takes a group and looks back at its history while a prospective study watches a group for a period of time and records the data. RETRO-REVERSE, PROspective- PResent and On..

43
Q

What’s the difference between cluster and stratified?

A

Stratified- you divide the population up into groups according to traits, called strata (groups with similar traits- homogeneous groups) and randomly choose from each strata.
Cluster- grab clusters of the population.. each cluster should be like the population.

44
Q

What’s the difference between lurking and confounding?

A

Lurking varibles, on one hand, infer the assoiation between the two varibles; confounding variables, on the other hand, make it unclear which variable has had an impact on which in an experiment.

45
Q

what’s the difference between response bias and nonresponse bias?

A

response bias is anything in a survey design that influences responses falls under the heading of response bias (wording of questions). Nonresponse bias is bias introduced to a sample when a large fraction of those sampled fails to respond.those who respond are likely to not represent the entire sample. Will you please take a survey? .. NO !

46
Q

Why do you have to Stratify?

A

You don’t have to.. But you might want to if you feel that a simple random sample might not be representative of the population . You want your sample to be like the population.. a representative sample (it represents the population well).

47
Q

How are voluntary and convenience samples similar?

A

With voluntary, people choose them selves, with covenience, the people are just chosen by researcher, neither uses randomness and both are prone to BIAS.

48
Q

How can the WORDING of the question lead to response bias

A

Words or phrases that impact your feelings tend to influence responses. Look for “devastating, horrific, wonderful? etc.” Sometimes there is a background story like “Many americans lose jobs to illegal aliens every year?? “

49
Q

Can you stratify in an experiment?

A

NO. stratification is a sampling method, blocking is method used in experiments. They are similar ideas.

50
Q

explain CONTROL

A

one of the principles would be the control, which are the factors that the experimentors keep constant in each trial because they believe it would effect the outcome of the experiment. Also having a group that is not getting treatment helps to control because it measures the effects of the natural environment.

51
Q

Explain two types of experimental design.

A

1.)Randomized Block Design: randomization occurs within the blocks only. 2.) Completely Randomized Design: all of the experimental units have the same chance at recieving a treatment.

52
Q

How is Blocking in an Experiment Similar to Stratefying in a Sample?

A

The two are similar because they divide the subjects into homogenous groups where the subjects are all similar

53
Q

What is common mistake when using the term BLOCKING?

A

Students often will report that they “blocked according to exercise” or “blocked according to type of fertilizer”.. These things are treatments.. We don’t block by treatments. We block by things that are ALREADY PRESENT BEFORE WE BEGIN EXPERIMENT.. Like by gender, or dog type or how close the plants are to a window.

54
Q

How is clustering and stratifying different when doing a sample?

A

Clustering is when chosen at random a group from the population that looks like the population, clusters should be heterogenous. While Stratifying is slicing a population into homogeneous groups(strata). Then randomly sample within each stratum before the results are combined.

55
Q

What four things do you need in an experimental design? (trick)

A

NEED only 3: control , randomization, replication.. BUT? Use blocking when appropriate

56
Q

What is a control group?

A

A group in an experiment without the treatment that is compared to groups with treatments to make results or conclusions. The control group helps us see what would happen anyway… without any treatment so that we can see the true effect of the treatment.

57
Q

What is a factor?

A

A variable in an experiment that the experimenter manipulates. (factors have levels.. )

58
Q

What is a level in an experiment?

A

A level is a specific value(s) that the experimenter chose for a factor that is manipulated.ex. Factor is sleep, level(s) would be how many hours the subjects were aloud to sleep.. 4 hours, 6 hours, 8 hours.. 3 levels

59
Q

What is bias?What are some common errors?

A

It’s any systematic failure of a sampling method. COMMON ERRORS: Voluntary response, undercoverage of the population, nonresponse bias and response bias. We use randomness and methods like stratifying to reduce these.

60
Q

What is Placebo used for?

A

Placebo is used for control in an experiment. the purpose of placebo is to determine the change between the controlled treatment and the other treatments

61
Q

what is the best way to reduce bias?

A

randomness. sophisticated answer: make as many things as random as possible

62
Q

What is the difference between a study and an experiment?

A

In a study you are basically just watching and in an experiment you are manipulating factors and (hopefully randomly) assigning treatments

63
Q

What is the difference between confounding and lurking?

A

Confounding is to experiment, we may think a treatment works when it was really the environment (like sunlight on plant growth…. we then block by proximity to window. to remove that confounding variable). .Lurking is to sample, y and x makes it appear that x may be causing y, like ice cream sales and surfing accidents.

64
Q

What is the difference between subjects in experiments and subjects in sample surveys?

A

Samples for surveys try to represent the entire population of interest and often experimental units are all the same type of tomato because we want to just look at impact of treatments.

65
Q

What is the difference between single-blind and double blind?

A

Single blinding is when all individuals in either one of the classes are blinded; double-blinded is when everyone in BOTH classes are blinded. Classes are: subjects, treatment givers, evaluators?

66
Q

What are the two blinding groups?

A

Group one: subjects and the people giving subjects the treatment.
Group two: those people assessing the groups to compare results.

67
Q

What is the main purpose of a placebo ?

A

To blind the subject that is being experimented on to avoid influence to the given variable therefore altering the response variable . When people think they’re getting help, they often improve anyway..

68
Q

What is the placebo effect?

A

When those who get the placebo show improvements, or show the effects of the treatment. This often happens to up 20% of participants!

69
Q

What is the purpose of matching?

A

Matching, like blocking, reduces unwanted variation. In a retrospective or prospective study, subjects who are similar in ways not under study may be matched and then compared with each other on the variables of intrest.

70
Q

What is the sure way to assign treatments correctly?

A

throw names in hat and pick

71
Q

What’s a useful alternative when you can’t run an experiment? What are they useful forms of this, and how do you preform them respectively?

A

An alternative of an experiments could be an observational study. A prospective observational study is when you identify subjects in advance and record data as you go along. A retrospective observational study is when you analyze observations from the past.

72
Q

Who can be blinded?

A

Subjects and Those delivering treatments. Those assessing effectiveness of treatments. and three mice.

73
Q

Why do you have to block?

A

You don’t have to.. But you might want to if you feel that the experimental units (subjects) may respond differently to the treatment.

74
Q

Why does it make sense to double-blind an experiment?

A

It reduces bias in an experiment. If subjects don’t know what treatment they’re receiving, they won’t change their habits based on that knowledge. If evaluators don’t know which treatment each subject is receiving, they won’t bias the true results based on the results they expect to see

75
Q

Why randomize in an experiment?

A

To avoid bias. An experimenter might want their treatment to work, so may chose the subjects that might respond best.

76
Q

what is completely randomized?

A

all subjects names in a hat and pick

77
Q

what is randomized block

A

separate subjects into blocks (cats here.. Dogs here? rabbits here..) then put dog names in dog hat and choose for treatments.. And same with others, therefore each block will get all of the treatments.