QUIZZES after midterm1 Flashcards
Which of the following is NOT one of the assumptions of hypothesis testing
a) The independent variable is measured on an interval scale
b) Participants are randomly selected from the population
c) The dependent variable is measured on an interval scale
d) The population distribution is approximately normal
A)
Only the dependent variable is required to be on a scale variable to meet our assumptions.
A distribution of z scores has a known mean and standard deviation. What are they?
a) Mean = 0, SD = 1
b) Mean = 100, SD = 15
c) Mean = 50, SD = 1
d) Mean = 1, SD = 0
A)
For our standard z distribution, our mean is always zero and our standard deviation is always 1. In this way, we can use our z distribution to look at z scores.
THESE are inferential statistical analyses based on a set of assumptions about the population.
a) nonparametric tests
b) parametric tests
c) standardized tests
d) nonstandardized tests
B)
A researcher hypothesizes that there is a significant relationship between stress and fatigue. Specifically, he hypothesizes that as stress increases, fatigue levels will also increase. This example best illustrates what type of hypothesis test?
a) null hypothesis test
b) one-tailed test
c)nondirectional test
d)two-tailed test
B)
The critical values associated with a p level of 0.05 for a two-tailed hypothesis test using the z statistic are:
a) +1.96
b) -1.96 and + 1.96
c) -1.65 and +1.65
d) +1.65
B)
Since this value is BEYOND the critical p value of 1.96, we would reject the null hypothesis. (Recall that we never support the alternative hypothesis.)
See also Chapter 7 Step 6: Make a Decision
Using a p level of 0.05 and cutoff values of z = +1.96 and z = –1.96, what decision would we make if our calculated z statistic was 1.97?
a) fail to reject the null hypothesis
b) reject the null hypothesis
c) support the null hypothesis
d) support the alternative hypothesis
B)
The p level is the:
a) probability with which a test statistic would occur if the null hypothesis were true.
b) cutoff probability at which a test statistic is considered extreme.
c) probability with which a test statistic would occur if the research hypothesis were true.
d) probability with which a test statistic would occur if both the null and research hypothesis were false.
A)
Note that we are doing a test of the NULL hypothesis. Since we are using the NULL hypothesis (or in this case, our population mean and standard deviation) to develop our comparison distribution, the p value indicates how likely that value would be found if the null hypothesis were true. We usually use a 5% cut-off; thus, we only accept the value to be found 5% at chance. See also Chapter 7 The Six Steps of Hypothesis Testing.
How does the sampling distribution of means for a sample with N = 5 differ from that of a sample with N = 100?
a) Both sampling distributions will be identical; they will not differ.
b) The mean of the sampling distribution of means with N = 5 will be closer to the mean of the population than the mean of the sampling distribution with N = 100.
c) There is more variance in the sample distribution with N =5 than with N = 100.
d) There is less variance in the sampling distribution with N = 5 than with N = 100.
C)
This question goes back to the Central Limit Theorem discussed in Chapter 6. The more data points in your sample, the less variability there will be in the sampling distribution. This is because the formula for the standard error is:
The higher N is, the smaller the standard error will be. It’s math :)
A(n) _ is based on our sample statistic; it conveys the range of sample statistics we could expect if we conducted repeated hypothesis tests using samples from the same population.
a) coefficient of determination
b) point estimate
c) interval estimate
d) estimated standard error
C)
This is the definition for an interval estimate as described in the section on Chapter 8 Confidence Intervals. They generally add and subtract a value that is our margin of error (the size of which depends on how confident we want to be!).
Which two figures below would produce larger effect sizes?
C) & D)
Look at the overlap – more overlap, less effect size.
Less overlap, a larger effect size.
THIS is a measure of our ability to reject the null hypothesis given that the null hypothesis is false.
a) R squared
b) Cohen’s d
c) Statistical power
d) Eta squared
C)
Statistical power is used to calculate how likely we are to reject the null hypothesis give it is false. These calculations are completed before you complete a study to see if you have the resources available to get enough participants in your study to find a significant effect, if one exists. See Chapter 8: Statistical Power.
Imagine The California Verbal Learning Test has a national average of 14.68 items recalled with a standard deviation of 2.01 items. A researcher administers the test to 24 patients with early onset Alzheimer’s Disease. They score an average of 8.5 items recalled. What would be the standard error for the comparison distribution? (Round to two decimal places)
0.41
Imagine that a self-esteem scale has an average of 91.60 and a standard deviation of 14.76. What is the score of a person who obtains a z score of -0.47? (Round to two decimal places)
84.66
Imagine that you are a tennis player. You calculate the number of points you get each game in order to determine how much your points change from game to game. What measure should you calculate?
THE MODE
THE MEDIAN
THE MEAN
THE STANDARD DEVIATION
STANDARD DEVIATION
Luca would like to determine the probability a person at Langara has brown hair. They sit in the cafeteria and every time a person walks in, they record their hair colour. The hair colours recorded are: 24 blonde, 58 brown, and 17 red. Using the data, what is the probability someone at Langara will have brown hair (rounded to two decimal places)?
0.59
Two students take a math test at two different high schools. Tim scores 50. His class’s average was 40 with a standard deviation of 5. Mira scores 25. Her class’s average was 21 with a standard deviation of 2. Who scored higher on the test?
Tim scored higher than Mira
Mira scored higher than Tim
Tim and Mira scored equally high
We are unable to compare test scores from different tests
TIM AND MIRA SCORED EQUALLY HIGH
You have completed a study in which people were randomly assigned to consume either no caffeine, 100 mg of caffeine, or 200 mg of caffeine. You then measure their reaction time. What type of graph should you use to visually display this data?
A BAR GRAPH
A researcher is looking at SES (low, medium, and high). Which measure of central tendency should he use for this data?
Mode
Median
Mean
Range
MODE
A researcher collects some data on the relationship between introversion and academic success. The researcher finds that, contrary to their hypothesis, there was no relationship between introversion and CGPA. She does, however, find that there is a relationship between introversion and retention rates. Despite this not being her original focus, she publishes the finding as though this was the intent of the study. This is an example of:
HARKing
You and your friends decide to complete the Narcissism Personality Inventory. You find the following values: 11, 19, 12 and 24. What is the mean value? (Round to two decimal places)
16.5
Below is a histogram looking at the frequency of test scores. Which of the following statements is correct:
The mean is higher than the median
The mean, median, and mode are approximately the same value
The mean is lower than the median
The mode is higher than the mean
THE MEAN IS HIGHER THAN THE MEDIAN
You obtain the weights of all of the students in your data analysis class. The scores range from 43 to 95 kilograms. Based on the guidelines set out in this course, which of the following intervals in a frequency table makes the most sense for your data?
40 to 42.9 kg
50 to 69.9 kg
50 to 59.9 kg
40 to 69.9 kg
50 TO 59.9
A standardized test contains a mean of 25 with a standard deviation of 5. If Harry scores 30 on the test, what is his percentile?
84%
You develop a test that measures social anxiety. The test is scored from 0 to 120, with higher scores representing higher social anxiety. You give the test to a patient that has high social anxiety. The first week they score 110, the second week they score 112, and the third week they score 111. The results suggest that the test is:
RELIABLE
NEITHER RELIABLE NOR VALID
VALID
BOTH RELIABLE AND VALID
BOTH
You receive a dataset of memory performance with the following values: 6, 10, 8, 1, 16, 14. Calculate the Interquartile range (IQR).
8
You measure the number of hours a child watches violent television each week. You get the following values: 14, 3, 15, 7, 2. Calculate the variance. (Round to two decimal places)
29.36
Imagine this following image was created to demonstrate the change in home sales over time:
What ‘lie’ is represented by this image?
INACCURATE VALUES LIE
A researcher plans to explore whether the amount of marijuana a woman smokes during pregnancy is related to fetal weight. What type of study should the researcher use?
correlational study
within-groups experiment
between-groups experiment
double-blind experiment
CORRELATION STUDY
A provincial survey is given to every student in BC. The students report studying a mean of 11.00 hours per week with a standard deviation of 5.18. Min’s class of 38 students studies an average of 9.34 hours per week. What is the z statistic for Min’s class? (Round to two decimal places
-1.98
Past research has shown that words presented in red background are better remembered that words presented in a neutral background (Mehta & Zhu, 2009). A researcher would like to know if this finding extends to math performance. She places signs around campus asking participants to participate in her study for $10. Participants are assigned to complete math questions with either a red or neutral background. They researcher finds that those who read the questions with a red background scores an average of 25 questions correct, whereas those with the neutral background scored an average of 21 questions correct. The difference was not statistically significant.
Answer the following:
a. Explain how the type of sampling used in the study affects the generalizability of the study. (1 point)
b. Identify the control and experimental groups. (1 point)
c. What is the null hypothesis? (1 point)
d. What is the research hypothesis? (1 point)
e. What decision should the researcher make and why? (1 point)
f. Given your answer in e), what type of error might the researcher have made? Explain your answer. (1 point)
a) if ppl are paid, then it’s not random sampling. It’s an incentive to participate.
c) There is no statistical difference between the two backgrounds.
d) There is a diffeence in the background scores.
MY ANSWERS:
a) if ppl are paid, then it’s not random sampling. It’s an incentive to participate.
c) There is no statistical difference between the two backgrounds.
d) There is a diffeence in the background scores
CORRECT ANSWERS:
A researcher wishes to explore how stereotype threat can influence math performance in college women. 112 college students identifying as women are asked to complete a math test. The math test consists of 45 multiple choice questions. One group is placed in a testing room with 4 confederates who are all men. The other group is placed in a testing room with confederates, 2 of whom are men and 2 are women. The researcher finds that the average score on the test is 32 for the women who complete the test in the presence of men and 38 for the women who complete the test in the presence of men and women. A statistical analysis suggests that the women who completed the test in the presence of only men performed worse than those who completed the test in the presence of both men and women.
Answer the following:
a. Who is the population? Who is the sample? (1 point)
b. What are the independent and dependent variables? (1 point)
c. What are the levels of the independent variable? (1 point)
d. Identify whether the dependent variable is nominal, ordinal, interval or ratio and if it is continuous or discrete. Explain your answers. (2 points).
e. Is this a between- or within-group design? How do you know? (1 point)
f. Do you think random sampling was used? Explain your answer. (1 point)
g. Do you think random assignment was used? Explain your answer. (1 point)
h. Explain how both descriptive and inferential statistics are represented in this example. (2 points)
MY ANSWERS:
a) pop = all college women (who can do math, maybe?) sample = just the 112 women
b) indep = the confederates dep = score of the math test
c) 2 groups of confederates (one is all men, other is half/half)
d) ratio + continuous ( ratio has a true zero and the scores could have decimal points)
e) between groups. two groups assigned to two testing groups
f) not random as it was chosen from a group specifically identifying as women. should have been randomly chosen
g) there’s not enough data to say this fro certain
h) descriptive = describes the group who identify as women (112) / inferential = it would have to be applied to the population of all the women of the college.
CORRECT ANSWERS:
A researcher wishes to explore how stereotype threat can influence math performance in college women. 112 college students identifying as women are asked to complete a math test. The math test consists of 45 multiple choice questions. One group is placed in a testing room with 4 confederates who are all men. The other group is placed in a testing room with confederates, 2 of whom are men and 2 are women. The researcher finds that the average score on the test is 32 for the women who complete the test in the presence of men and 38 for the women who complete the test in the presence of men and women. A statistical analysis suggests that the women who completed the test in the presence of only men performed worse than those who completed the test in the presence of both men and women.
**Answer the following: **
a. Who is the population? Who is the sample? (1 point)
b. What are the independent and dependent variables? (1 point)
c. What are the levels of the independent variable? (1 point)
d. Identify whether the dependent variable is nominal, ordinal, interval or ratio and if it is continuous or discrete. Explain your answers. (2 points).
e. Is this a between- or within-group design? How do you know? (1 point)
f. Do you think random sampling was used? Explain your answer. (1 point)
g. Do you think random assignment was used? Explain your answer. (1 point)
h. Explain how both descriptive and inferential statistics are represented in this example. (2 points)
MY ANSWERS:
a) pop = all college women (who can do math, maybe?) sample = just the 112 women
b) indep = the confederates dep = score of the math test
c) 2 groups of confederates (one is all men, other is half/half)
d) ratio + continuous ( ratio has a true zero and the scores could have decimal points)
e) between groups. two groups assigned to two testing groups
f) not random as it was chosen from a group specifically identifying as women. should have been randomly chosen
g) there’s not enough data to say this fro certain
h) descriptive = describes the group who identify as women (112) / inferential = it would have to be applied to the population of all the women of the college.
TEACHER ANSWERS:
MY ANSWERS:
a) 17
b) 2/40 = 0.05 = 5%
c) 35.75
d) definitely negatively skewed – more distributed towards the right side of the graph (more towards the higher numbers)
CORRECT ANSWERS:
All of the grade 8 students across Vancouver are given a reading comprehension test. The mean score is 56 with a standard deviation of 12. Interestingly, the distribution of scores appears to be slightly negatively skewed. A teacher claims his 22 students outperformed the other classes in Vancouver due to her teaching methods. His class scored 58 on the test with a standard deviation of 8.
Answer the following:
a. Is the value 12 a statistic or a parameter? Explain your answer. (1 point)
b. If the teacher wished to compare his students to the average score, what type of distribution would he need to use? Explain your answer. (1 point)
c. What would be the mean of the distribution? How do you know? (1 point)
d. Do you think his comparison distribution would be normal, positively skewed or negatively skewed? Explain your answer. (1 point)
MY ANSWERS:
A) it’s a parameter as it describes the whole population of grade 8 students in Vancouver.
B) The average score would be the ‘mean’, so he would need to compare against the distribution of means graph.
C) the mean of the distribution would be the same as the whole population of Grade 8 students (56).
D) As in, his 1 singular classroom? It would be very slightly neg scored. The distribution of means should be normally distributed as there would be many, many scores to base it off.
CORRECT ANSWERS:
a. The value of 12 is a parameter (0.5), since it has been measured for every grade 8 student in the region (the population of interest) (0.5)
b. He would need to use a distribution of means, since he would like to compare the score of his 22 students (a mean) to an average score.
c. The mean would be 56. The CLT tells us the mean of the distribution = the mean of the population (1 point)
d. Normal. Since the sample size is 22, even if the underlying distribution is slightly skewed, due to the CLT the comparison distribution should be normal. -.25 clarify CLT makes it normal
QUIZ WEEK 6:
Which of the following is an example of p-hacking?
a) A researcher fabricates some results to support their hypothesis
b) A researcher replicates their study using a new sample of participants
c) A researcher changes their hypothesis after analyzing their data
d) A researcher collects six dependent measures but only reports on the two that show significant results
D)
P-hacking refers to the practice of selectively analyzing data or manipulating statistical tests in order to obtain significant results or support a desired hypothesis. By collecting multiple dependent measures but only reporting on the ones that yield significant results, the researcher is engaging in p-hacking. This selective reporting can lead to a biased presentation of the findings and can be misleading.
WEEK 6 QUIZ:
According to a “how to stop bullying” website, 15 percent of students report experiencing bullying one to three times within the most recent month. Assume the standard deviation is 5 percent of students. Joseph collects data from 125 students at a medium-sized school in Iowa and finds that only 12 percent reported this rate of bullying. What is his 95 percent confidence interval (rounded to two decimal places)?
[11.12, 12.88]
[14.27, 15.73]
[11.33, 12.67]
[14.12, 15.88]
[11.12, 12.88]
A teacher is comparing her students’ performance to the IQ test. Recall that the IQ test has a mean of 100 and a standard deviation of 15. If the teacher’s class consists of 30 students, and her students score a value of 105 on the IQ test, what would be the effect size (rounded to two decimal places)?
0.33
- use Cohen’s d formular:
- d=M-mu/standard deviation
- d=105-100/15
- =0.33