Exam 1 Flashcards

Question 1

Q

Definition of p-value: (2)

Answer

A

The probability that our results are due to chance alone. The probability of making a type 1 error.

Question 2

Q

How is p-value used to determine whether to accept or reject the null hypothesis?

Answer

A

P-value is compared to alpha=0.05 to determine whether to accept or reject it. If “p” is less than or equal to 0.05 than we reject the null hypothesis, and we accept if greater than 0.05.

Question 3

Q

What is statistical power?

Answer

A

Statistical power is the probability of successfully rejecting the null hypothesis.

Question 4

Q

Increasing “n” will ____ power

Question 5

Q

Give 2 reasons why it is not possible for researchers to study an entire population

Answer

A

It would take too much resources to study an entire population.
There are usually too many subjects in a population to be able to gather them for a study.

Question 6

Q

What is a type 1 error?

Answer

A

A type 1 error is when you reject the null hypothesis but you should have accepted it because its true.

Question 7

Q

What is standard deviation a measure of?

Answer

A

Standard deviation is a measure of how data points, x, differ from the mean, (bar X).

Question 8

Q

What is random assignment and what is one advantage of using it?

Answer

A

Giving participants in a study an equal probability of being assigned to a certain group. One advantage is that it gets rid of some bias.

Question 9

Q

Which of the following is not a measure of sampling error?

a. Confidence limits
b. Power
c. Standard error
d. None of the above

Question 10

Q

Describe 95% confidence limits

Answer

A

A researcher is 95% confident that the population mean is within the confidence limits.

Question 11

Q

On what axis is the independent variable plotted?

Answer

A

Horizontal

Question 12

Q

Which three measures are approximately equal in a normal distribution?

Answer

A

Mean, Median, and Mode

Question 13

Q

What type of statistical analysis should be used to compare observed phenotypic frequencies for a dihybrid cross involving eye color (red, white) and wingless (wildtype or wingless) to an expected ratio of 9:3:3:1?

Answer

A

Chi Square Goodness of Fit

Question 14

Q

How is Z-score Calculated?

Answer

A

(“x” - mean)/Standard deviation

Question 15

Q

In the tattoo hepatitis test, what type of statistic test should be used to analyze the data?

Answer

A

Chi square test of Independence

Question 16

Q

What is the null hypothesis for the hepatitis/tattoo case?

Answer

A

Hepatitis C is not dependent on the tattoo parlor.

Question 17

Q

What is the alternative hypothesis for the hepatitis/tattoo case?

Answer

A

Hepatitis C is related to the tattoo parlor.

Question 18

Q

What are three examples of categorical variables?

Answer

A

Gender, Phenotype, Genotype

Question 19

Q

What are three examples of numerical variables?

Answer

A

Height, Age, Biomass

Question 20

Q

What two things must a histogram have?

Answer

A

Measured variable on x-axis, and frequency on y-axis

Question 21

Q

What is gaussian distribution?

Answer

A

When the mean, median, and mode are all equal. Normal bell shape curve.

Question 22

Q

How is range calculated?

Answer

A

Highest-Lowest

Question 23

Q

How is variance calculated?

Answer

A

(Standard Deviation)^2

Question 24

Q

How is the mode of a graph recognized?

Answer

A

Always the highest peak

Question 25

Q

What do positive and negative skew look like on a graph?

Answer

A

Neg. Skew has a tail on the left, and pos. skew has a tail on the right

Question 26

Q

Describe Symmetric Distribution

Answer

A

50% of the data is above the mean, and 50% of the data is below the mean

Question 27

Q

What is sampling error?

Answer

A

How much in error of the population we are by studying the sample.

Question 28

Q

What can influence sampling error?

Answer

A

-Anomalies: Subjects that differ greatly from the mean

Question 29

Q

What is the standard error of the mean?

Answer

A

How much our sample mean is different from the population mean.

Question 30

Q

What are 3 ways to lower standard error?

Answer

A

Increase the sample size
Decrease the standard deviation
Improve measurement

Question 31

Q

What do two sample t-tests test for?

Answer

A

To determine whether the difference between two sample means is statistically significant.

Question 32

Q

What is meant by “Statistically significant”?

Answer

A

More than we’d expect by chance

Question 33

Q

How do you decide if a 2-sample t-test or a paired t-test should be conducted?

Answer

A

If the experiment can’t be performed twice, then a 2-sample t-test must be used. Otherwise use a paired-t-test

Question 34

Q

What 2 things does a 2-sample t-test assume?

Answer

A

The data for each sample is normally distributed

- The variances for each sample are statistically equal

Question 35

Q

What 1 thing does a paired t-test assume?

Answer

A

The differences between pairs of data are normally distributed

Question 36

Q

What is a type 2 error?

Answer

A

When you accept the null hypothesis but should have rejected it.

Question 37

Q

What is the non-parametric alternative test to a 2-sample t-test?

Answer

A

Kruskal-Wallis Test

Question 38

Q

What is the non-parametric alternative test to a paired t-test?

Answer

A

Wilcoxon Signed Rank test

Question 39

Q

What is an advantage of random sampling?

Answer

A

Eliminates some bias

Question 40

Q

What is the difference between a bar graph and a histogram?

Answer

A

Bar Graph: Shows categorical data on the x-axis and a d.v. on the y-axis
Histogram: Shows numerical data on the x-axis and frequency on the y-axis

Question 41

Q

What are the three measures of central tendency?

Answer

A

Mean, median and mode

Question 42

Q

In a Z-test, what is the null hypothesis compared to?

Answer

A

0.025 instead of 0.05

Question 43

Q

When is welch’s approximation used?

Answer

A

For samples with unequal variances, as a nonparametric alternative to the 2-sample t-test

Question 44

Q

What is the difference between a parametric and non-parametric test?

Answer

A

A parametric test makes assumptions about the population that must be met in order to conduct the test, such as homogeneity of variances or normalized data. Whereas a nonparametric test does not make assumptions about the population.

Question 45

Q

What is the nonparametric alternative to a 2-sample t-test that does not meet the assumption of normality?

Answer

A

Mann-Whitney U-Test

Question 46

Q

What is the nonparametric alternative to a 2-sample t-test that does not meet the assumption of equal variances?

Answer

A

Komolgrov-Smirnov Test

Question 47

Q

What nonparametric test determines homogeneity of variances?

Answer

A

Levene’s Test

Question 48

Q

Why is 1-way ANOVA better than multiple t-tests?

Answer

A

Designed to protect against alpha inflation, aside from saving time unnecessarily spent conducting multiple tests. Reducing alpha inflation lessens the chance of receiving a false positive.

Question 49

Q

What is the nonparametric alternative to One Way ANOVA?

Answer

A

Kruskal-Wallis Test

Question 50

Q

What is the difference behind the purpose of correlation and the purpose of linear regression?

Answer

A

Correlation analysis specifically tests for a linear relationship between 2 measured independent variables and a measured dependent variable. Regression attempts to fit a line, or curve, to data to look for a dependence of one variable on another.

Question 51

Q

Describe the F-Ratio

Answer

A

Compares the variation between each group being tested, relative to the variation between individuals within each group. A large F-Ratio indicates more variation between groups relative to the variation within groups

Question 52

Q

“2 x 3” design is:

Answer

A

Comparing means of 2 independent variables, 1 with 2 levels and 1 with 3 levels. giving a total of 6 groups

Question 53

Q

Main Effect:

Answer

A

Explains whether the effect of an independent variable on a measured dependent variable is significant or not

Question 54

Q

Interaction:

Answer

A

Explains if two independent variables influence one another in their effect on the dependent variable

Question 55

Q

What “r” values represent a strong correlation?

Answer

A

Absolute values closer to “1”

Question 56

Q

What is the main assumption of repeated measures ANOVA?

Answer

A

Sphericity: The variances of all possible pairwise combinations of groups are equal

Question 57

Q

How are 2-way ANOVA results reported?

Answer

A

F(2,84)=15.75, p

Question 58

Q

How are regression results reported?

Answer

A

B=0.287, p

Question 59

Q

How is a regression equation set up?

Answer

A

Dependent=B value(for I.V.) x I.V. + B value (for D.V.)

Question 60

Q

GO INTO DETAIL WITH SPECIFIC CONCLUSIONS

Answer

A

GO INTO DETAIL WITH SPECIFIC CONCLUSIONS

Question 61

Q

What do you do after calculating a Z score?

Answer

A

Compare it to the value in the table to find your p-value, then compare that to 0.025

Question 62

Q

How are chi-square results reported?

Answer

A

X2(1)=2.251, p=0.134

Question 63

Q

How do you calculate degrees of freedom for chi-square?

Answer

A

of categories - 1

Question 64

Q

How is the chi-square value compared to the critical value?

Answer

A

If the calculated value is higher than the critical value, then our p-value is higher than 0.05 so we reject the null hypothesis.

Answer 63

A

Whether “y” depends on any of the MULTIPLE independent variables

Answer 64

A

All I.V.’s are analyzed at the same time to give an overall R squared value, as opposed to step-wise multiple regression in which each I.V. gets its own R squared value

Answer 65

A

The column on the right tells us if the change in R squared is significant or not

Answer 66

A

It analyzes 2 or more CATEGORICAL variables with counted FREQUENCIES to determine if they are related

Answer 67

A

Is prostate cancer outcome dependent on the type of treatment? (categories are surgery and radiation)

Answer 68

A

r(18)= 0.254, p=0.280

Answer 69

A

n-2

Ex: sample of 20-2=18

Answer 70

A

When variable on x-axis is numerical, not categorical

Answer 71

A

A different mean

Answer 72

A

Which comparisons are significant, N.S. = Not significant

Answer 73

A

Variables on both axes are numerical, ALL the raw data are plotted. Used with small sample sizes. A measurement of X and a measurement of Y make a dot on the graph that shows trends and relationships

Answer 74

A

Histogram because we are plotting counts, not means

Answer 75

A

When there are too many statistical results to report in an analysis or show in a graph.

Answer 76

A

Photographs, flowcharts, diagrams, and maps

Answer 77

A

Prob. of “A x B”

Ex: Flipping a coin to get heads twice= 1/2 x 1/2 = 1/4

Answer 78

A

Prob. of “a” + “b”

Ex: Rolling a 2 OR a 3= 1/6 + 1/6 = 2/6 = 1/3

Answer 79

A

Find lesser probability and subtract from 1
Ex: At least 1 coin comes up heads=
Both tails is 1/2 x 1/2 = 1/4 , 1 - 1/4 = 3/4 = 75%

Answer 80

A

Replicate the experiment and see if you get the same results

Answer 81

A

The extent to which a measure actually indicates what i’s intended to

Answer 82

A

Protect against falsified data

Answer 83

A

Sample size unrepresentative/small/biased
Inadequate controls/comparisons
Non-randomized

Answer 84

A

Could be biased allocation of patients to certain groups

Answer 85

A

Predicting an equation of “Y” based on “X”

Answer 86

A

The test of independence uses categorical variables to find a relationship between them, while the goodness of fit test looks to see if observed frequencies match what is expected by chance alone.