Freytag Lectures 6, 7 Flashcards

1
Q

Which would be a dichotomous variable?

a. Car brand
b. Sex
c. Blood type
d. Age

A

b. Sex

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

When should a chi-square distribution test be used?

a. When a data set has more than 2 variables
b. When the sample is not random
c. Only when the degrees of freedom is >3
d. When you are comparing one parameter (variable) between groups

A

d. When you are comparing one parameter (variable) between groups

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

A contingency table…

a. Is not an appropriate way of presenting qualitative data
b. Displays the frequency distribution of variables
c. Allows for easy estimation of p values
d. Can only be used to display polytomous variables

A

b. Displays the frequency distribution of variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Weldon rolled 12 dice 26,306 times. Assuming each side is equally likely to come up, how many 3’s would you expect to observe?

a. (12 X 26,306)/3 = 105,224
b. √(6 X 12 X 26,306) = 1376
c. (1/6) X 12 X 26,306 = 52,612
d. (1/6) X 26,306 = 4,384

A

c. (1/6) X 12 X 26,306 = 52,612

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Consider the following statement: “there is no inconsistency between the observed and the expected counts. The observed counts follow the same distribution as the expected counts”

a. This is likely to describe an alternative hypothesis
b. If this statement was true, we could conclude that there is no effect
c. The distribution of the observed counts and expected counts must be normal
d. This statement would likely be symbolised by H0 (null hypothesis)

A

d. This statement would likely be symbolised by H0 (null hypothesis)

Never claim that there is NO effect

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How can you control the significance level of your test?

a. By controlling for Type I errors (false positive)
b. By controlling for both Type I and Type II errors
c. By accepting the null hypothesis when p<0.05
d. By controlling for Type II errors (false negatives)

A

a. By controlling for Type I errors (false positive)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What does a Goodness of Fit test measure?

a. The P value required for H0 to be false
b. How well the observe data fits the expected distribution
c. How much observed values deviate from the expected values in a normal distribution
d. The test statistic of polytomous variables only

A

b. How well the observe data fits the expected distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How is the Goodness of Fit test statistic distributed?

a. With a X^2 with k-1 degrees of freedom
b. With a X^2 with X+1 degrees of freedom
c. Using the equation (O-E)/E
d. With a skew to the left

A

a. With a X^2 with k-1 degrees of freedom

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Which statement about an X^2 distribution is FALSE?

a. The degrees of freedom is the only parameter
b. Shape, centre and spread are influenced by the degrees of freedom
c. They are always positive and often right skewed
d. They do not require random sampling

A

d. They do not require random sampling

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is a condition for an X^2 test for goodness of fit?

a. Observations must not be independent of one another
b. The sample size must be small
c. Random sampling cannot be used
d. Expected table cell count should be preferably more than 10

A

d. Expected table cell count should be preferably more than 10

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

The test of significance is designed to assess the strength of evidence AGAINST the null hypothesis.

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

A type II error can be controlled whilst a type I error cannot.

A

False

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Type I and type II errors are used together to gauge statistical significance.

A

False. cannot be used together

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

The null hypothesis must be an equal, equal or greater or equal or lesser statement.

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

You can never ACCEPT the null hypothesis due to the influence of Type II errors (false negatives)

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

If the null hypothesis states that men and women suffering from heart attack in New York were equally likely to die (i.e. sex and death are independent), what would a p value of < 0.05 suggest?

a. The null hypothesis is accepted and each variable is independent
b. The null hypothesis is rejected and each variable is independent
c. The null hypothesis is rejected and the variables are dependent of each other
d. The null hypothesis is accepted and the variables are dependent of each other

A

c. The null hypothesis is rejected and the variables are dependent of each other

17
Q

When performing an independence test in R…

a. A two variable contingency table must not be used
b. The degrees of freedom is (k-1)(l-1) where k is columns and l is rows
c. A Chi square test is irrelevant
d. A Yates-corrected chi-square test should be used by multiplying the T value by 0.5

A

b. The degrees of freedom is (k-1)(l-1) where k is columns and l is rows

18
Q

Which statement is false?

a. A test of homogeneity can assess whether two or more multinomial distributions are equal
b. The same assumptions for a Goodness of Fit test apply to tests for homogeneity (Random sampling, independence, large sample size, cell count)
c. Fischer’s exact test calculates the probability of obtaining a contingency table with the observed counts using a hypergeometric distribution
d. A Fischer’s exact test can only be used on contingency tables with 1 variable

A

d. A Fischer’s exact test can only be used on contingency tables with 1 variable

No it’s used for 2 X 2 contingency tables

19
Q

What should be included in Excel data provided to a statistician? (Select all that apply)

  1. Calculations to save the statistician having to examine Raw data
  2. No empty cells. “NA” should be used instead
  3. The data should be rectangular
  4. Data should be kept in multiple sheets when there are many values
  5. More rows than columns
  6. Well labelled data, columns and rows
  7. Documentation of the experiment
A
  1. No empty cells. “NA” should be used instead
  2. The data should be rectangular
  3. Well labelled data, columns and rows
  4. Documentation of the experiment
20
Q

What is NOT a feature of tidy data?

a. Replicate values are placed in the same cell
b. Each variable forms a column
c. Each observation forms a row
d. Each type of observation unit forms a table

A

a. Replicate values are placed in the same cell

21
Q

I have to label a variable in Excel for data that I will be sending to a statistician. Which name would be most suitable?

a. Maximum Temp (C)
b. Max Temp DegreesC
c. Max_Temp
d. Max/Temp.C

A

c. Max_Temp

22
Q

Spaces can only be used when typing variable names in Excel.

A

False

23
Q

It is okay to embed a graph in your Excel data if it will help the statistician better understand your project.

A

False

send separate documentation

24
Q

Comma’s must not be used in excel data.

A

True