Exam 2 Flashcards
What is the Null Hypothesis (Ho) and the Alternate Hypothesis
The null hypothesis, denoted by Ho, is usually the hypothesis that sample observations result purely from chance.
The alternative hypothesis, denoted by H1 or Ha, is the hypothesis that sample observations are influenced by some non-random cause.
Null Hypothesis: chance
Alternate hypothesis: not chance
What is a p-value
What does a p-value LESS than 0.05 mean?
What does a p-value GREATER than 0.05 mean?
P-value helps you determine the significance of your results. It is a number between 0 and 1. Closer to 0 means you REJECT the null hypothesis (in other words, it isn’t random chance, but some intervention influenced the result).
P-values less than 0.05 means it is significant that you can REJECT the null hypothesis. (0.05 means 5% chance or probability)
P-values greater than 0.05 means there is weak evidence against the null hypothesis so you fail to reject the null hypothesis.
In the majority of analyses, an alpha of 0.05 is used as the cutoff for significance. If the p-value is less than 0.05, we reject the null hypothesis that there’s no difference between the means, and so we conclude that a significant difference does exist.
If a measurement is less precise, how does that effect the association between 2 variables?
The less precise a measurement is, the lower the association is between two variables.
T or F: If a measurement is reliable and fairly precise, a strong association may be found.
True
*** What is the fundamental statistic for the analysis of INTERVAL and RATIO data to estimate the strength of the association between variables.
And what is it?
Regression Analysis
RAtio = RA = Regression Analysis = Pearson’s R
Regression analysis is used to understand which among the independent variables are related to the dependent variable, and to explore the forms of these relationships.
How does regression analysis look?
Graph with dots in relation together going up a line … graph relationship between independent and dependent variables. Or how the dependent variable changes when the independent variable is changed.
______ ______ regression is the simplest form of regression.
What is it?
Simple linear
Study and plot relationship between 2 variables (independent and dependent). X influences Y
What is pearson’s r or r value?
Designation of measuring the strength of the association between pairs of data that are interval or ratio
RA = Ratio = Regression Analysis = Pearson’s R
What is ANOVA
Analysis of variance, ANOVA: a statistical method for making comparisons between two or more MEANS; a statistical method that yields values that can be tested to determine whether a significant relation exists between variables.
What is a t-test
T-test: is simplest type/form of ANOVA … is an analysis of two populations means through the use of statistical examination
Difference between ANOVA and t-test
Example questions:
“Would you use a T-test or an ANOVA to compare 2 groups?”
“What about if you compare a group of patients w/ stroke vs. a group of patients w/ MS? Independent or dependent T-test?”
“What about if you were comparing a group of children, pre and post intervention? You are using the same group.”
A hypothesis test that is used to compare the means of TWO populations is called t-test.
A statistical technique that is used to compare the means of MORE THAN TWO populations is known as Analysis of Variance or ANOVA
T-Test: A simple ANOVA where you have 2 groups you are comparing. A control group against a select group with a condition.
Example questions:
- T-test
- Independent T-test
- Dependent T-test
What is multiple regression
Plot the relationship between several independent or predictor variables and a dependent or criterion variable.
X1 and X2 and X3 and X4 —–> all lead to Y
T or F: simple linear regression is r, and multiple regression is r^2
True
Multiple regression:
R2 values (0 to 1)
PPMC (Pearson’s r):
r values (-1 to 1)
On exam, she will have two variables with data, and you need to know whether they have low correlation or high correlation:
.00 to .25 or .00 to -0.25 .26 to .49 or –.26 to –.49 .50 to .69 or –.50 to –.69 .70 to .89 or –.70 to –.89 .90 to 1.0 or –.90 to –1.0
Little to no correlation Low correlation Moderate correlation High correlation Very high correlation
So if you see r = 0.62 and p = 0.03 … how do you interpret?
It is a moderate correlation, and less than 0.05 for p-value so we can reject null hypothesis since there is evidence that it is not chance.
*** T or F: Variables that are associated do not imply cause and effect.
Give example:
True! Just because there is an association between variables, doesn’t mean there is a cause and effect relationship. (Although measures of association cannot confirm cause and effect, these analyses can build useful predictive models).
Example: So does your socioeconomic status effect BMI? They may be associated, but they may not necessarily cause / influence each other.
The analysis used most often for ORDINAL DATA is a:
Spearman rank order correlation
Difference between Pearson’s and Spearmans
The Pearson correlation evaluates the linear relationship between two continuous variables. … The Spearman correlation coefficient is based on the ranked values for each variable rather than the raw data. Spearman correlation is often used to evaluate relationships involving ordinal variables.
T or F: Spearman r and Cramer’s V are nonparametric statistics
True
What is difference between parametric and nonparametric tests
Parametric:
- assumed NORMAL distributions
- interval and ratio data
- statistics based on assumptions about the population
- Have more statistical power
Nonparametric:
- Uneven distribution
- Ordinal, ranked, nominal data
- LESS powerful (use less info in calculation)
*** If your measurement scale is nominal or ordinal, would you use parametric or nonparametric?
If it is interval or ratio?
Nominal or Ordinal = Non-parametric statistics
Interval or ratio = Parametric statistics
** If you have this type of data below, list the type of test you would use and the statistical procedure:
- Interval or Ratio
- Ordinal
- Nominal
- Interval or Ratio: Pearsons r, Parametric
- Ordinal: Spearman r, Nonparametric
- Nominal: Cramers V, Nonparametric
SPEARMAN: rank ordinary gums
CRAMERS: nominal
Difference between sensitivity and specificity?
Which one has the condition?
Test SENSITIVITY is the ability of a test to correctly identify those with the disease (true positive rate). A
Test SPECIFICITY is the ability of the test to correctly identify those without the disease (true negative rate). D
Sensitivity = HAVING THE CONDITION. The number of people that are correctly diagnosed by exam.
Specificity = NOT HAVING THE CONDITION. The number of people correctly classified as not having the condition
Remember if you have the 4 box table, A is top left and is TRUE positive. D box is bottom right, and is TRUE negative. So boxes are …
ab
cd
What is A and D?
How do you calculate:
- Sensitivity
- Specificity
- PPV
- NPV
A = sensitivity (have condition) D = specificity (without condition)
- Sensitivity: a / a+c
- Specificity: d / d+b
- PPV: a / a+b
- NPV d / d+c
In other words:
ab a / a+b=PPV
cd d / d+c=NPV
SS
Sensitivity: a / a+c
Specificity: d / d+b
How do you calculate the +LR and -LR
+LR: sensitivity / (1-specificity)
-LR: (1-sensitivity) / specificity
What is SNout and SPin
SNout = Sensitivity … rules OUT a condition (if test is neg)
SPin = Specificity … rules IN a condition (if test is positive)
When sensitivity is High (close to 1), we can confidently RULE OUT the condition when the clinical test is negative.
When specificity is High (close to 1), we can confidently RULE IN the condition when the clinical test is Positive
Gold standard is what?
Xray or MRI
What are the 2 main criteria to know how VALID a test is
Sensitivity and Specificity
If you get a 0.88 for sensitivity and a 0.90 for specificity, is that a good test?
Yes
T or F: Knowledge of the LRs influences the level of certainty that a condition does or does not exist at the end of the examination.
True
A large +LR Rules IN or OUT a disorder?
A large –LR Rules IN or OUT a disorder
IN (so higher the positive number, more likely the disease/condition is there)
OUT (so more negative the number, more likely the disease/condition is not there)
Remember how to grade area under the ROC curve
.90-1.0 = Excellent .80-.90 = good .70-.80 = fair .60-.70 = poor .50-.60 = fail
What is cut off on ROC curve
Point where curve turns … a balance between sensitivity and specificity
What is STARD
STAndards for the Reporting of Diagnostic accuracy studies
4 steps of Bahr’s “Framework of Risk Identification and Injury Prevention”
1) Establish the extent of injury or severity (PROSPECTIVE STUDY)
2) Establish cause of sports injury
3) Introduce a preventative measure
4) Reassessing injury and improvement over time, and repeating step 1
Difference between Traumatic and Atraumatic:
Traumatic = contact or extreme accident Atraumatic = osteo bone decay over time that led to ACL injury
The Gold standard for studies of injury risk factors?
What is the other option?
PROSPECTIVE COHORT DESIGN … go forward in time
A large group of participants potentially “at risk” for injury of interest are baseline tested. Participants are followed over time to determine if they go on to suffer the injury.
Other option: Retrospective design (look back over time and compare group that had injury in past with control group going forward).
*** The prevalence of an injury or illness refers to the …
proportion of a sample that has a given injury or illness at a single time point. It is presented as a proportion or a percentage.
*** The incidence of an injury or illness …
… the number of new cases of the pathology in a given period of time.
Bathtub analogy related to prevelance, incidence, mortality, recurrance
Incidence is new amount of people coming in, prevelance is how many patients in study at time, mortality is how many die or leave.