Research Exam 2 PPT 1 PPT 2 Flashcards
Standard Error of the mean
Is used to estimate the true mean of the population from which the sample was drawn
CI
a range of values from the sample data that has a given probability of encompasing the true value
95% CI is equal to 1-alpha (type I error)
95% CI for the estimated difference between two groups or within the same group over time does not include 0 and the results are significant at the 0.05 level
CI for ratio rate
the line of no effect is 1 therefore if 1 is included is passed then it will be no statistical significance
CI for differences
the line of no effect is 0 so if the interval and the value cross 0 then it will be no statistical difference but if 0 is not included than is statistical difference
Ho
null hypothesis there is no difference between the two groups been compared
Ha
alternative hypothesis there is difference between the two groups treatment A does not equal treatment B
Reject the null hypothesis
Pvalue<alpha 0.05 and conclude that the alternative hypothesis is true at the 95% Confidence level
Failed to reject Ho hypothesis
Pvalue > 0.05 and conclude that there is not enough evidence to say that the null hypothesis is false at the 95% confidence level
Type 1 error
reject the Ho when Ho is actually true
is at the 0.05 alpha p-value
there is no true difference between the two groups
Type 2 error
failed to reject Ho when Ho is false and there is actually a difference BETA
P-value
probability a number between 0 (it wont happen) and 1 (it will defenitly happen) that describes the frequency of an outcome
the probability of our experimental results take into account the H0
-P-value is probability that our observed results are due to chance
Independent (unpaired) T-Test
data is parametric is continuous is comparing two independent variables
Paired T-test
compares the mean difference of paired matched samples
ANOVA
comparing three different groups is parametric and continous data
so ex three different doses of aspirin with the birth weight
Simple linear regresion
one continous independent variable and one continous dependendent variable
Multiple linear regresion
one continous independent variable and two or more continous dependent variable
You are designing a new alert system at the hospital to investigate the impact of several factors on the risk of corrected QTc prolongation. You want to create a model, assessing several variables, to predict which patients are most likely to experience QTc prolongation after the administration of certain drugs or the presence of certain conditions.
Which statistical technique will be most useful in completing such an analysis?
Regresion
Multiple logistic regression
1 categorical independent
1 categorical dependent
1 confounding variable
Simple logistic regression
for categorical discrete data
1 categorical independent variable
1 categorical dependent variable
non parametric statistic
-Not for estimating parameters
-Does not requires a large sample size
-Is for nominal or ordinal data
-No assumption on data distribution
-Can be used for continous data if the sample size is small
Investigators would like to determine if a new statin decreases the number of elderly subjects who experience a myocardial infarction compared to standard therapy after considering the influence of a number of confounding variables. Which of the tests below is best?
A. Student t-test
B. Multiple linear regression
C. ANOVA
D. Multiple logistic regression
Multiple logistic regression
Parametric statistic
Parametric statistic:
-Requires a large sample size
-For continuous data
-Can make inference of parameters
-Data can be normal or non normal
Chi square
non parametric
nominal data
large sample size
Ho assumes no particular relation between outcome and exposure
Fisher exact
non parametric
nominal data
sample doesnt need to be large
non paired