Block 4 Quiz Flashcards

1
Q

Describe the use of a chi-square test

A

It measures the distribution between TWO categorical variables & compares the proportions of two or more categorical data sets to see if there’s statistical significance

  • Independent variable can be two or more groups (i.e treatments)
  • Dependent variable 2 or less groups (true/false)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

It measures the distribution between TWO categorical variables & compares the proportions of two or more categorical data sets to see if there’s statistical significance

  • Independent variable can be two or more groups (i.e treatments)
  • Dependent variable 2 or less groups (true/false)

Describe the statistical test

A

Chi-squared test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Key words associated with what statistical test?:

Two categorical variables
Two/+ independent variables
Two/- dependent variables
Compares proportions
Determines statistical significance.

A

Chi Square test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Measuring statistical significance of the proportion of adults with lung disease in clinics A, B, & C at a certain point in time

What Statistical test will you use?

A

Chi-Square test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Measuring statistical significance of the proportion of people with diabetes in four different ethnic groups.

What statistical test will you use?

A

Chi-square test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Describe the use of an ANOVA/F-test

A

analyzes the variance, it determines if there’s a statistically significant difference between two or more groups by comparing the means.
- Independent variable (manipulated by researcher)
- Dependent (mean)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is a one-way F-test/ANOVA?

A

ONE independent variable
ONE dependent variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Mean height of women in clinics A, B, &C

What is the stat test, the independent, & the dependent variables

A

One way ANOVA/F-test
Independent = Women
Dependent = Mean height

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Mean height of women & men in clinics A, B, &C

What is the stat test, the independent, & the dependent variables

A

Two-way ANOVA/F-test
Independent (Women & Men)
Dependent (Mean height)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is a TWO way ANOVA/F-test?

A

TWO independent variable
ONE dependent variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

(Pearsons correlation coefficient)

r = +1

A

Positive correlation
increase in y = increase in X
(vise versa)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

(Pearsons correlation coefficient)

r = -1

A

Negative correlation
decrease in y = decrease in x
(vise versa)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

(Pearsons correlation coefficient)

r = 0

A

NO correlation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

(Pearsons correlation coefficient)

r = 0.9

A

Weak positive correlation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

what’s the correlation?

A

r= +1
Positive correlation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

what’s the correlation?

A

r= -1
Negative correlation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

what’s the correlation?

A

r= 0
NO correlation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

what’s the correlation?

A

r= 0.9
Weak correlation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What is the use of a t-test?

A

It calculates the difference of the means of two samples & is used to make confidence intervals

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What is the use of a one sampled t-test?

A

calculates whether sample mean is different then the populations mean

p> 0.05 reject null
p< 0.05 can’t reject null

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What is the use of a two sampled t-test?

A

calculates whether sample mean of two different groups taken from the same population differs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What is the use of an unpaired t-test?

A

For independent samples, it calculates whether sample mean of two different groups sampled at the same time is different

Null = group mean is equal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

What is the use of a paired t-test?

A

For dependent samples, it calculates whether the sample mean of the same group sampled at two different times is different

Null= group mean is equal at both times

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Describe variance

A

The sum of the squared deviations from the mean divided by the total number of values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Variables: Describe independent variable
Doesn't depend on other variables so it can be manipulated by the researcher (i.e dosage of a drug given to a treatment group)
26
Variables: Describe dependent variable
Depends on other variables so it can't be manipulated (outcome)
27
Doesn't depend on other variables so it can be manipulated by the researcher (i.e dosage of a drug given to a treatment group)
Independent variable
28
Depends on other variables so it can't be manipulated (outcome)
Dependent variable
29
Variables: Describe discrete variables
Whole number values (number of ____)
30
Variables: Describe Continuous variables
Any number (weight, distance, temperature etc)
31
Variables: Describe categorical variables
Variables with categories
32
Variables: Describe Nominal Categorial variables
Categories without any order ( smoker vs non smoker or male vs female)
33
Variables: Describe Ordinal Categorical variables
Ordered categories (min/mod/severe)
34
Describe regression
It means making an equation to represent the relationship between a dependent (Y/outcome) & independent (X/exposure) variables
35
Describe logistic regression
Dependent variable is categorical - Simple (1 independent) - Multiple (more than 1 independent)
36
Dependent variable is categorical - Simple (1 independent) - Multiple (more than 1 independent)
logistic regression
37
It means making an equation to represent the relationship between a dependent (Y/outcome) & independent (X/exposure) variables
regression
38
Ordered categories (min/mod/severe)
Describe Ordinal Categorical variables
39
Categories without any order ( smoker vs non smoker or male vs female)
Describe Nominal Categorial variables
40
Variables with categories
Describe categorical variables
41
Describe linear regression
Dependent variable is continuous - simple (1 independent) - multiple (more than 1 independent)
42
r= +1
positive correlation
43
r= -1
Negative correlation
44
r= 0.9
strong correlation
45
r= 0
NO correlation
46
Describe the z-score
It compares populations with different means & standard deviations
47
z-score for a 95% CI is
1.96
48
z-score for a 97.5% CI is
2.24
49
z-score for a 99% CI is
2.58
50
Describe confidence intervals
A range of standard deviations that we're ____% sure our data will fall into
51
How does sample size impact CI?
Larger sample sizes mean narrower CI (small p-value aka more significant)
52
Wider CI indicates what about CI?
Less statistical significance
53
Describe a type 1 error
H0/Null is rejected when it's actually true
54
H0/Null is rejected when it's actually true
Type 1 error
55
Describe a type 2 error
H0/Null isn't rejected when it's actually false
56
H0/Null isn't rejected when it's actually false
Describe a type 2 error
57
Describe statistical power
The formula is 1-B (type 2 error) it represents the probability of correctly rejecting the H0/Null
58
How can you increase statistical power
Increase sample size
59
Describe what bimodal distribution represents
There are two subgroups in the populations
60
Describe this distribution
Bimodal, there are two subgroups within the population
61
Describe what Positive skew distribution represents
A right tailed skew mean < median < mode
62
A positive skew (right) mean < median < mode
63
Describe a negative skew distribution
Left tailed mean > median > mode
64
Negative skew (Left) mean > median > mode
65
Describe what the standard error of the mean means?
It's the deviation of the sample's mean from the populations mean Increasing standard deviation & decreasing sample both increase chance of SEM!
66
How can you reduce SEM?
Decrease standard deviation & increase sample size
67
Formula for SEM
68
What does standard deviation describe?
It describes variability or dispersion of the data in relation to the mean
69
68% translates to how many standard deviations
1SD
70
95% translates to how many standard deviations
2 SD
71
99.7% translates to how many standard deviations
3 SD
72
Formula for standard error of proportion
𝑺𝑬(𝒑)=√(𝒑(πŸβˆ’π’‘)/𝒏)
73
Describe what the standard error of proportion tells us
measure of the precision of our estimate
74
a small standard error indicates what?
a precise estimate
75
What stat test?: Numerical data + 1 group
One-sample t-test
76
What stat test?: Numerical data + 2 groups dependent on each other
Paired t-test
77
What stat test?: Numerical data + 2 groups independent on each other
unpaired t-test
78
What stat test?: Numerical data + over 2 groups independent on each other
ANOVA 1independent = One-way anova 2 independents = two-way anova
79
What stat test?: Categorical data + 1 group
z-test
80
What stat test?: Categorical data + 2 or more groups independent on each other
Chi-Square test
81
What is the best way to minimize the chances of a Type I error in a diagnostic ?
Increase the specificity of the test
82
A researcher is looking at the effect of a new medication upon blood pressure. She concludes that the results of her study are that the medication does affect blood pressure. What best describes her conclusion? a) The results of her study are not valid b) She has accepted the null hypothesis c) The power of her study was >95% d) The p-value for the relationship is <0.05 e) She has accepted the alternative hypothesis
e) She has accepted the alternative hypothesis
83
A researcher is setting up an experiment. She generates two hypotheses: first, that the medication will not lower blood pressure; and second, that the medication will lower blood pressure. What is her null hypothesis?
The medication will not lower blood pressure
84
What factors increase power?
1) bigger sample size 2) variability of observations 3) larger effects of interest 4) greater significance (narrow CI/small p-value)
85
Researchers are looking at whether or not silicone implants cause breast cancer. They are 80% confident tat their sample size of 250 will detect a difference in breast cancer rates, if in fact one does exist. What does this 80% confidence represent? a) The power of the test b) The false positive rate c) An alpha error rate of 80% d) The false negative rate e) A beta error of 80%
a) The power of the test
86
What's the stat test?: categorical data 2 paired groups
McNemars test
87
What test: Quantitative variable + 2 groups
t-test
88
What test: Ordinal variable + 2 groups
Mann Whitney test
89
What test: Categorical variable + 2 or more groups
chi-square
90
What test: Ordinal variable + paired data
Wilcoxon test
91
What test: Ordinal data + more than 2 groups
Kruskal Wallis test
92
What test Quantitative variable + paired data
paired t-test
93
What test Quantitative variable + over 2 groups
f-test/ANOVA
94
A researcher looks at the cholesterol level in men as compared to women. She is trying to determine whether or not there is a significant difference in the proportion of men with elevated cholesterol as compared to women. What is the most appropriate test? a) Regression b) T-test c) ANOVA d) Correlation e) Chi-square
e) Chi-square
95
A physician is studying the effects of drug A and drug B on cognitive performance in Alzheimer patients. He administers a memory test to two groups of subjects (those taking drug A and those taking drug B) and compares their mean scores. Which of the following statistical tests would be most appropriate for this purpose? a) T-test b) Chi-squared test c) Paired T-Test d) ANOVA e) Multiple regression
a) T-test
96
A researcher measures the cholesterol level of three demographic groups: Caucasian, Asian, and African American. What statistical test is used to compare the mean cholesterol level of these three groups? a) Regression b) Chi-squared test c) T-Test d) ANOVA e) Correlation
d) ANOVA
97
Strong positive correlation r= 0.8 to 1
98
Weak positive correlation r = 0.2 to 0.5
99
Strong negative correlation r = -0.8 to- 1
100
No correlation r = 0 to 0.2
101
Moderate negative correlation r =-0.5 to -0.8
102
Weak negative correlation r= -0.2 to -0.5
103
A researcher found that as cigarette smoking increased, cancer rates increased. How are these two variables related?
Positive correlation
104
A study of 100 women finds that as the cholesterol level increases, the blood pressure does not change. How are these variables related? Zero correlation
Zero correlation