Lecture 6: Statistical Testing II Flashcards

1
Q

What is the ANOVA test (Analysis of variance)?

A

A technique used to compare means among three or more independent populations with one test.

Each individual falls into a group described by a (categorical) grouping variable and for each individual we measure some continuous outcome

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

The P value from ANOVA answers the question?

A

Are there any differences in mean among the groups studied?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Why use ANOVA?

A

It is similar to the t-test in that it compares means

ANOVA is more suited for a large number of groups:

–Robust design
–Increases statistical power in comparison to the t-test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

The kind of questions that can be answered using ANOVA?

A
  1. Is there a difference in the mean blood pressure for three different types of smokers (ex-smoker, current smoker, passive smoker)
  2. Study that compares student’s test performance following various teaching techniques (online, tutorials, lecture-based)
  3. Is there a difference between the average number of times articles are shared on social
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is Linear regression?

A
  • Assumes a linear relationship between one or many variables and an outcome
  • Looks at the extent to which a change on one (or many) variables (X) are uniquely associated with an outcome (Y)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is Linear regression used for?

A

Estimating the relationship between two continuous variables

Continuous outcomes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the simple linear regression equation?

A

y= mx + c

y= outcome or dependent variable 
c=  intercept of line at the y axis
m= slope (how much y changes with every unit increase in x)
x=  independent variable
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Explain the application of simple linear regression

Example: The linear relationship between the age of a driver and the maximum distance at which a highway sign was legible

A

Regression equation: y= a+bx

y= distance (dependent variable)
a= interval of line at the y axis 
b= slope (how much y changes with every unit increase in x) 
x= age (independent variable)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the difference correlation and simple linear regression?

A

Simple linear regression is similar to correlation in that the purpose is to measure to what extent there is a linear relationship between two variables.

  • The major difference between the two is that:
  • Correlation: provides information on the direction and strength of the relationship
  • Linear regression: allows us “predict” the value of the dependent variable based upon the values of one or more independent variables.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is Simple linear regression?

A

Simple linear regression is a statistical method to generate an equation to summarize and study relationships between two (quantitative) variables, where one variable has an impact on the other…..

X: is the predictor or independent variable.
•Y: is the response or dependent variable.
•Simple linear regression concerns the study of only one independent (predictor) variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is Multiple linear regression?

A

Multiple linear regression is a regression model that contains more than one regressor variable.

Multiple linear regression is a statistical method to generate an equation to summarize and study relationships between multiple (quantitative) variables, where each variable has a unique impact on the outcome

•More commonly used than simple linear regression

y=𝛽_0 + 𝛽_1 𝑥_1+ 𝛽_2 𝑥_2+ 𝛽_3 𝑥_3…..

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is Logistic regression?

A

Similar to linear regression, but…

Outcome is categorical or binary/dichotomous

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is Survival analyses?

A

Survival analyses is used when the dependent variable is time to some event

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are the types of event in a survival analysis?

A

EVENT could mean death, recovery, relapse, reoffending etc

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What do Survival analysis account for?

A

time to event

Loss to follow-up

Differences in follow up time

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Example of survival analysis?

A

Cohort studies are time-to-event studies and are analysed in the frame work of survival analysis

17
Q

What is censoring?

A

Refers to when an individual cannot be observed for the full time to the event

18
Q

How can censoring occur?

A
  • Termination of the study
  • Death due to another cause
  • Loss to follow-up
19
Q

Why use survival analysis?

A

The goal of survival analysis is to provide information on the proportion of the population with the event at any given time.

20
Q

How are survival analysis presented?

A
  • Survival curves
    Proportion of intervals alive (without event)
  • Hazard ratios
    Similar to odds ratio
    Takes any value from 0 to 1