Research Exam 2 PPT 1 PPT 2 Flashcards

1
Q

Standard Error of the mean

A

Is used to estimate the true mean of the population from which the sample was drawn

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

CI

A

a range of values from the sample data that has a given probability of encompasing the true value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

95% CI is equal to 1-alpha (type I error)

A

95% CI for the estimated difference between two groups or within the same group over time does not include 0 and the results are significant at the 0.05 level

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

CI for ratio rate

A

the line of no effect is 1 therefore if 1 is included is passed then it will be no statistical significance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

CI for differences

A

the line of no effect is 0 so if the interval and the value cross 0 then it will be no statistical difference but if 0 is not included than is statistical difference

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Ho

A

null hypothesis there is no difference between the two groups been compared

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Ha

A

alternative hypothesis there is difference between the two groups treatment A does not equal treatment B

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Reject the null hypothesis

A

Pvalue<alpha 0.05 and conclude that the alternative hypothesis is true at the 95% Confidence level

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Failed to reject Ho hypothesis

A

Pvalue > 0.05 and conclude that there is not enough evidence to say that the null hypothesis is false at the 95% confidence level

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Type 1 error

A

reject the Ho when Ho is actually true
is at the 0.05 alpha p-value
there is no true difference between the two groups

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Type 2 error

A

failed to reject Ho when Ho is false and there is actually a difference BETA

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

P-value

A

probability a number between 0 (it wont happen) and 1 (it will defenitly happen) that describes the frequency of an outcome
the probability of our experimental results take into account the H0
-P-value is probability that our observed results are due to chance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Independent (unpaired) T-Test

A

data is parametric is continuous is comparing two independent variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Paired T-test

A

compares the mean difference of paired matched samples

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

ANOVA

A

comparing three different groups is parametric and continous data
so ex three different doses of aspirin with the birth weight

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Simple linear regresion

A

one continous independent variable and one continous dependendent variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Multiple linear regresion

A

one continous independent variable and two or more continous dependent variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

You are designing a new alert system at the hospital to investigate the impact of several factors on the risk of corrected QTc prolongation. You want to create a model, assessing several variables, to predict which patients are most likely to experience QTc prolongation after the administration of certain drugs or the presence of certain conditions.

Which statistical technique will be most useful in completing such an analysis?

A

Regresion

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Multiple logistic regression

A

1 categorical independent
1 categorical dependent
1 confounding variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Simple logistic regression

A

for categorical discrete data
1 categorical independent variable
1 categorical dependent variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

non parametric statistic

A

-Not for estimating parameters
-Does not requires a large sample size
-Is for nominal or ordinal data
-No assumption on data distribution
-Can be used for continous data if the sample size is small

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Investigators would like to determine if a new statin decreases the number of elderly subjects who experience a myocardial infarction compared to standard therapy after considering the influence of a number of confounding variables. Which of the tests below is best?
A. Student t-test
B. Multiple linear regression
C. ANOVA
D. Multiple logistic regression

A

Multiple logistic regression

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Parametric statistic

A

Parametric statistic:
-Requires a large sample size
-For continuous data
-Can make inference of parameters
-Data can be normal or non normal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Chi square

A

non parametric
nominal data
large sample size
Ho assumes no particular relation between outcome and exposure

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Fisher exact

A

non parametric
nominal data
sample doesnt need to be large
non paired

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Researchers planned a study to evaluate the percent- age of subjects who achieved less than a target blood pressure (less than 140/90 mm Hg) when initiated on two different doses of amlodipine. In the study of 100 subjects, the amlodipine 5-mg group (n=50) versus the amlodipine 10-mg group (n=50) were compared. The investigators used blood pressure goal (i.e., the percentage of subjects who successfully achieved the blood pressure goal at 3 months) as their primary end point. Which one of the following is the most appropriate statistical test to answer the question?
A. Independent samples t-test.

B. Chi squared
C. Wilcoxon signed rank test.

D. Independent t-test.

A

Yes they did achieve/ no they didnt achieve
B chi squared

24
Q

A randomized control trial is comparing drug X and drug Y to measure the efficacy for treating cancer-associated nausea and vomiting. The end point of the trial is based on patients’ ranking their nausea and vomiting 2 hours after taking the drug that was assigned to them. The ranking is based on the following scale:

0=no nausea, 1=mild nausea, 2=moderate nausea, 3=severe nausea, 4=vomiting, 5=severe nausea and severe vomiting

Which of the following statistical tests is the most appropriate to perform to assess the difference in nausea and vomiting severity between the two drugs?

a) Wilcoxson signed rank test

B) Paired student t-test

C) Wilcoxson rank sum test

D) Independent student t-tes

A

C) Wilcoxon rank sum test

25
Q

Kruskal Wallis

A

Similar to one way ANOVA
Compared 3 groups with one dependent variable but is categorical , discrete non parametric, non normal distributed
-Is used to measure the sample means if they are identical
-Requires 5 observations

26
Q

Suppose weights of poplar trees are different based on treatments (none treatment, fertilizer, irrigation, or fertilizer & irrigation). Each weight sample determined by the treatments is independent and random, and each sample size is 5; yet, the weight samples are not normally distributed.

A

Kruskal Wallis

27
Q

Kaplan Meir method

A

Uses the survival time to estimate the proportion of people that survive under the same circumstances in a given length of time uses a table and graph

28
Q

Cox proportional hazards

A

Incidence rate vary over time are commly referred as hazard rates
IR based on follow up time contibuted by each person in the study

29
Q

Population

A

a group of individual with the characteristics of interest of the researcher
All people with characteristics
All people in defined setting

30
Q

Target population

A

population to which one would like to make an inference

31
Q

Source population

A

population to which the study subject are drawn from

32
Q

Sampling frame

A

subset of the population from which to sample from at a specific time frame

33
Q

Study sample

A

participants actually in the study

34
Q

Simple random sampling

A

randomly select individuals from the sampling frame
is useful when the target population is small and the sampling frame is done
-Assign numbers to pts
-Computer randomly select the pt
-Lottery method

35
Q

Systemic sampling

A

Random
-Use specialized intervals from which the subject are drawn from into the sample
-Disadvantage is the natural periodicities in the population
-Allows investigators to predict and manipulate the selection of the sample

36
Q

Stratified random sampling

A

The population is dividided into subgroups and via randomly selection individuals from each subgroup
-Ensures higher precision compared to simple random sampling
-Allows fro more control and precision

37
Q

Cluster sampling

A

is a random sampling of natural grouping individuals in the population
- is based on geographic location
-Can be divided into single stage, double stage or multistage
-Sampling frame may not be possible in a large population

38
Q

researcher is interested in studying the use of melatonin among adults with hypertension in a primary care clinic. The researcher divides the sampling frame into 4 subgroups based on specific age groups (i.e., 30 to 39 years old, 40 to 49 years old, 50 to 59 years old, and 60 years or older). The researcher then randomly samples 100 patients with hypertension from each subgroup. Which of the following best describes this sampling technique?
A) Convenience sampling

B) Quota sampling

C) Stratified random sampling

D) Cluster sampling

A

Stratified random sampling

39
Q

Non random sampling

A

Individuals does not have equal chance of being selected
-Unable to make inferences of the population from the sample

40
Q

Convenience sampling

A

Non random sample
The pt is there is accessible
No time consuming and easy to perform

41
Q

Quota sampling

A

Non random sample but is based guided in specific characteristics like race,sex,economic class\
-Ensures inclution of a particular segmented of the population into the sample
-Researcher set stratas within the sample

42
Q

Consecutive sampling

A

Non random sample
the subject that statisfie the characteristics of the study they wll offer them to it
-Is useful for the prediction of the number of patients from which the sample size has been selected
-concern of seasonal trends

43
Q

Power analysis

A

Using predetermined alpha a, investigators will try to determine and achieve the desired B by choose a sample size to detect the clinically/significally meaningful effect

44
Q

Calculating sample size using power analysis (determining specific formula to use)

A

-Knowledge of how the primary end point is measured (type of data)
-Type of hypothesis test

45
Q

Calculating sample size using power analysis (information input in the formula)

A

1-Measure variability and precision
2-Specification of the magnitude effect
3-Stated level of significance (a)
4-Target level of power (1-b)

46
Q

Power analysis (measure of variance)

A

More precise less variation smaller sample size
More variation larger sample size
Variability represented based of the type of data
Continous data outcome–> population sd
Categorical dichotomous outcome–>depends on the proportion of the control group compared with the treatment group

47
Q

Power analysis (magnitude of effect)

A

Clinically relevant treatment effect that the study should be able to detect
Difference depends on the type of data
Continous outcome–>Difference between the two population means
Dichotomous outcome–>difference between the proportions
-Smaller effect size larger sample size needed to detect the difference
-Researchers should be able to choose effect size based on judgment, experience and expertise
-Need to be both clinically and statistical significant

48
Q

Power analysis (significance level)

A

alpha a should be set priori to the study during the study planning
Conventionally is 0.05
other values is 0.10 and 0.01
As the alpha increases the sample size needs to decrease to detect the difference

49
Q

Power analysis (target level of power)

A

Increase the power will increase the sample size to detect difference
-1-B is detect prior to the study during the planing
-Generally is 0.8-0.9 (90%)

50
Q

Suppose you are an investigator interested in evaluating the difference in glucose levels with a new diabetes medication compared to placebo. Based on past literature, you determine a clinically meaningful difference in the glucose level as 20 mg/dL and an assumed standard deviation of 75 mg/dL. The required total sample size for this study with α of 0.05 and power of 80% is calculated to be 442 patients.
Which of the following changes to the assumptions would increase the required total sample size (assuming the all other assumptions will remain the same)?
A)Reduce standard deviation from 75 mg/dL to 60 mg/dL

B)Decrease power from 80% to 75%

C) Increase alpha from 0.05 to 0.10

D)Reduce effect size from 20 mg/dL to 15 mg/dL

A

D) reduce the effect size

51
Q

Sample size increases

A

1-Smaller effect size
2-Decreasing a
3-Increasing Power
4-Increase variability

52
Q

Sample size decreases

A

1-Larger effect size
2-Increasing a
3-Decreasing power
4-Decreasing variability

53
Q

Association

A

If there is a statistical link between the health outcome in patients with a specific exposure

54
Q

Causation

A

The exposure causes the health outcome
it can be to presence of adverse exposure or absence of preventative exposure

55
Q

Artificial (spurios) association

A

Arises bias in the study
-Biases in method selecting the case/control
Bias in recording the information
Bias in design the study - non representative study group
Bias in the conduct of the study- loss of follow up, observe bias, measurment errors

56
Q

Non causal association

A

The confounding factor causes both but there is no causal relationship between both
Ex cigarette smoking causes increase in drinking coffe and increase risk of CHD mortality

57
Q

Causal association

A

Causality is inferred
-One factor causes the disease
-The removal is shown to decrease the frequency of the disease
-Not all exposure lead to the outcome
-Multiple combination of exposure is likely to lead to the outcome

58
Q

Counterfactual definition of Cause in Pharmacoepidemiology

A

Ideal world: cause and effect of drug on a outcome
-Same pt
-Same time
-Same place
-Conditions are the same and see if they got the disease when they took the drug vs when they didnt

59
Q

Conterfactual Ideal in Pharmacoepidemiology

A

Real World: The fact (exposed group take the drug) vs the Proxy for counterfactual (not take the drug) and evaluate the associations between the drug on the outcome
-The proxy needs to have similar characteristics as the exposed group

60
Q

In pharmacoepidemiological studies, which of the following scenarios best illustrates the counterfactual ideal?
A) Patients taking Medication W are compared to a group of patients with similar characteristics who are also taking the same medication.

B) Patients taking Medication Z are compared to a group of patients with different characteristics who are also taking the same medication.

C) Patients taking Medication X are compared to a group of patients with similar characteristics who are not taking the medication.

D) Patients taking Medication Y are compared to a group of patients with different characteristics who are not taking the medication.

A

C) Pt taking medication X are compared to patient with similar characteristics that are not taking the medication