Stats and research methods Flashcards
What does the area under the curve of a normal distribution represent?
a. the proportionality of obtaining any given range of values
b. the proportionality of obtaining the highest value in a specified range.
c. the probability of obtaining the lowest value in a specified range
d. the probability that the mean of a population exists in a specified range.
The correct answer is A. The area under the normal curve (the integral) represents the probability of obtaining a range of values. (Technically speaking, it is the area under the density function of the normal distribution.)
Imagine a situation in which the null hypothesis is true, and α is (as usual) set to .05. You collect some data, and do your statistical test and get a p-value of .01. If you kept repeating this process of collecting data and doing a statistical test, what percentage of the time would you expect to reject the null hypothesis?
A. 1%
B. 5%
C. 95%
D. 99%
Answer: B. If you refer back to where α was mentioned in the videos you will see that it’s meant to determine the long-term rate of falsely rejecting the a true null hypothesis. The fact that you got a p-value of .01 when you did your first test is irrelevant - the long-term rate of rejecting a true null hypothesis is still meant to be 5%.
By the way, when dealing with psychological data the null hypothesis is almost never true.
You have a random sample of 30 data points. Which of these facts would provide the strongest indication of non-normality in the population from which these data points were sampled?
A. The mean and mode and median are all the same
B. The mean is much lower than the median
C. The standard deviation is over 9000
D. There are 0 data points more than 2 standard deviations from the mean
B is a sign of skewness and thus non-normality. In a perfectly normal distribution mean = median = mode and outliers are rare, so A and D can’t be right. C is irrelevant because the value of the standard deviation depends on the scale of the data. The question didn’t tell you whether the range was (for example) 0 to 20,000 or 0 to 60,000, and you can’t really make any sort of judgment about how large “over 9000” is in absolute terms.
Which of these variables would be most likely to approximate a normal distribution?
A. Heights of 20-year-old women
B. Populations of human settlements
C. Total sales of single-volume books
D. Values obtained when throwing a fair die (that is, a single dice)
A is the best answer. It won’t be exactly normal, but if you have a large enough sample it will be a decent approximation. The reasons why are not relevant to this course, but simplifying a bit you can expect a (somewhat) normal distribution whenever an outcome is the result of a range of variables acting (somewhat) independently from one another. Regarding those variables, think of all the genetic and environmental factors that contribute to height. It’s true that they don’t necessarily act independently from one another, and that is one of the reasons why the distribution of heights is only approximately normal.
B isn’t right, because there are a whole lot of little townships with small populations, and a small number of colossal cities like Tokyo and Delhi.
C isn’t right for the same reason as B. There are a lot of books with relatively few sales, and then a very small few (e.g. A Tale of Two Cities, Harry Potter and the Philosopher’s Stone) with colossally more sales.
D would not resemble a normal distribution but would be uniform (otherwise known as ‘rectangular’) - each of the six values on the die is as likely as another.
Stan Statissian runs a two-sample t-test examining whether science students arrive significantly later than arts students to his class. The p value is 0.4. Which of the following statements is the most reasonable conclusion to draw from this outcome.
A. There is evidence for the proposition that arts and science students arrive at class at the same time.
B. There is evidence against the proposition that arts and science students arrive at class at the same time.
C. There is no evidence for the proposition that arts and science students arrive at class at the same time.
D. There is no evidence against the proposition that arts and science students arrive at class at the same time.
The correct answer is D. The logic of null hypothesis significance testing is somewhat obtuse: we use the obtained data as evidence against the null hypothesis (we never endorse the null hypothesis, we can only fail to reject it). Thus, in this instance, where we have a p value above any reasonable alpha level (conventionally set to 0.05), we fail to reject the null hypothesis meaning that we have no evidence against the proposition that arts and science students arrive at the same time to class.
Which of the following statements is implied if the null hypothesis is true?
A. Any observed differences are due to unobserved factors.
B. Any observed differences are due to controlled factors.
C. The experiment was not sensitive enough to pick up a true differences.
D. A false alarm cannot be discounted
The correct answer is A. If the null hypothesis is actually true, then any observed variability is simply due to random perturbations in the data generating process that have not been controlled for or specifically measured.
A p-value tells you
Answers:
A. The probability of observing results as (or more) extreme than those actually observed, if the null hypothesis is true
B. The probability that, if the experiment were repeated many times, the results you obtained would be replicated
C. The probability the null hypothesis is false
D. The probability the null hypothesis is true
ANSWER: A. This definition of a p-value is functionally equivalent to the one Simon presented in the previous section. D is always a common answer here, and indeed it would be nice if we could know the probability the null hypothesis is true. However, using the statistics in the present course we do not attempt to get at this.
Which of the following is NOT an example of doing statistical inference?
a. Trying to work out a population standard deviation
b. Trying to work out a sample standard deviation
c. Trying to work out whether a single observation could plausibly have been drawn from a certain population
d. Trying to work out whether two samples come from the same or different populations.
ANSWER: B. Working out a sample standard deviation is simply a matter of performing calculations on that sample data. By contrast, the other answers all involve inferring something about a population from a sample. This is most clear in C and D, which were drawn directly from the video. A is a little less obvious but (almost invariably) you will attempt to infer the population standard deviation from the sample standard deviation.
What variation does a one-way between-groups ANOVA compare?
A. Between-groups variance and population variance.
B. Between-groups variance and within-groups variance.
C. Sample variance and population variance.
D. Within-groups variance and population variance.
The correct answer is B.
Which of the following is the definition of the total sum of squared deviations?
A. It is the sum of the squared difference between each data point and the grand mean.
B. It is the squared sum of the difference between each data point and the grand mean.
C. It is the sum of the difference of the squared value of each data point and the grand mean.
D. Is the squared sum of the difference between the square root of each data point and the square root of the grand mean.
The correct answer is A. The easiest way to work out this question is to write each of the formulae out in full.
The sum of the squared deviations, (X-Xbar)², is also called the sum of squares or more simply SS. SS represents the sum of squared differences from the mean and is an extremely important term in statistics. Variance. The sum of squares gives rise to variance.
Which SS measures the variability of each data point from the overall mean?
C. This follows from the definition of the SSTOTAL. In order to calculate the SSTOTAL you ignore what groups people are in and calculate the overall average score on the DV. Then you measure the difference between each data point and the overall average score on the DV, and square those differences. Then you add up those squared differences. By the way, the “Error” SS is another, less common, name for the SSWITHIN.
What are the values of the following:
Sum of squares total
Sum of squares within
Sum of squares between
Sum of squares total: Take the difference between each observed scores and the mean score, square them, add them up!
Sum of squares within: Sum of squares of each group, added up
Sum of squares between: sum of squares total minus sum of squares minus the sum of squares within
What is the F-statistic?
In general, an F-statistic is a ratio of two quantities that are expected to be roughly equal under the null hypothesis, which produces an F-statistic of approximately 1
In ANOVA the different groups are called
A. Groups of the study
B. Levels of the factor
C. Observations of the sample
D. Parameters of the population
ANSWER: B. This is just a matter of terminology. ‘Factor’ is ANOVA-speak for independent variable, while we call the groups ‘levels’.
You are replicating a previous study in which the researchers conducted an ANOVA comparing blue-eyed, green-eyed and brown-eyed participants on a measure of anxiety, and obtained a statistically significant result. However, you are using twice as many participants. Which of the following statements is true?
Answers:
A. The effect size would be expected to be bigger in the your study.
B. The null hypothesis will be even more likely to be false in your study than in the previous study.
C. The p-value should be approximately the same as in the previous study.
D. The SSTOTAL would be expected to increase substantially.
ANSWER: D. This follows from the way the SSTOTAL is calculated. As previously mentioned, we don’t get to find out the probability the null hypothesis is true (or false) and hence B can’t be right. Furthermore, the null hypothesis is either true, or it isn’t, and our study doesn’t change the fact of its truth or falsity. The magnitude of effect won’t change in any systematic way due to us taking a larger sample size, so A is wrong. We would expect the p-value to decline with a larger sample size, so C is wrong.
Sum of squares total: Take the difference between each observed scores and the mean score, square them, add them up!