Hypothesis Testing in R Flashcards

Question 1

Q

Question 2

Q

independent samples t-test code

Question 3

Q

p value definition

Answer

A

Assuming that there the null hypothesis is true (i.e.; that there is no difference between the groups), what is the probability that we would have gotten a test statistic as far away from 0 as the one we actually got?

It’s a bullshit detector aimed at the null hypothsis. If the p value gets too small, the bullshit detector goes off

Question 4

Q

Does the p-value tell us the probability that the null hypothesis is true?

Answer

A

No!!! The p-value does not tell you the probability that the null hypothesis is true. In other words, if you calculate a p-value of .04, this does not mean that the probability that the null hypothesis is true is 4%. Rather, it means that if the null hypothesis was true, the probability of obtaining the result you got is 4%. Now, this does indeed set off our bullshit detector, but again, it does not mean that the probability that the null hypothesis is true is 4%.

Question 5

Q

htest

Answer

A

R stores hypothesis tests in special object classes called htest. htest objects contain all the major results from a hypothesis test, from the test statistic (e.g.; a t-statistic for a t-test, or a correlation coefficient for a correlation test), to the p-value, to a confidence interval.

different h tests necessitate data to be loaded into the function in different formats (vectors/dfs or tables)

Question 6

Q

names()

Answer

A

returns all of the elements in the h.test object

Question 7

Q

one sample t-test

Answer

A

you can pull data from a df or from separate vectors, it doesn’t have to come from a table() function

Question 8

Q

t tests compared to each other in bar chart form

Answer

A

you can pull data from a df or from separate vectors, it doesn’t have to come from a table() function

Question 9

Q

Using subset to select levels of an IV

Answer

A

use the %in% argument to specify which levels of an IV you want to test

Question 10

Q

cor.test()

Question 11

Q

two ways to run a correlation test

Answer

A

To run a correlation test between two variables x and y, use the cor.test() function. You can do this in one of two ways, if x and y are columns in a dataframe, use the formula notation (formula = ~ x + y). If x and y are separate vectors (not in a dataframe), use the vector notation (x, y):

you can pull data from a df or from separate vectors, it doesn’t have to come from a table() function

Question 12

Q

example correlation test

Question 13

Q

using subset() in the cor.test() function

Answer

A

Just like the t.test() function, we can use the subset argument in the cor.test() function to conduct a test on a subset of the entire dataframe. For example, to run the same correlation test between a pirate’s age and the number of parrot’s she’s owned, but only for female pirates, I can add the subset = sex == “female” argument: