Statistics Flashcards

Question

Nominal variables (non parametric) besides chi square

Answer 1

Fishers exact - when sample is <20 or expected 2x2 cells is less than 5 Mcnemar - similar to chi sq but for paired or matched data Mantel-haenszel - to see if one factor is influencing the results - uses separate contingent tables

Answer 2

Mann Whitney- non para equiv to student t -- no paired groups Sign test - matched or paired data - tells whether pos or neg difference Wilcoxon signed rank test - determines magnitude of diff and rank order of differences

Answer 3

Kruskal wallis one way Anova - data not matche or paired Friedman two way anova - data are paired or matched

Answer 4

Pearson corr coeff- for parametric , ranges from -1 to 0 to 1 Spearman rank corr coeff - ranks the strength of correlation

Answer 5

Linear regression - continuous variables (parametric) Logistic regression - ordinal or nominal data - non parametric

Answer 6

Censoring - takes into acct that some subjects leave study for different reasons and can enter study at different time points Actuarial method - counts number subjects who reach a certain point - ex- pt who dies at 5 months 29 days isn't included in the 6 month analysis Kaplan Meier - measures time to endpoint - produces life table and survival survey Cox hazards proportional - allows researcher to adjust for differences in study groups (age, comorbidities) - produces hazard ratio and CI

Answer 7

Number of new cases that occur in a popn in a specified time (number of new cases can trend over time)

Answer 8

Number of cases in the population who HAVE disease in a specific time frame

Answer 9

Dz + Dz- Rf + A B A+B Rf- C D C+D

Answer 10

Actual or true risk Used in prospective and ecperdnral studies RR = (A/A+B) / (C/C+D) Ex: prospective cohort study to evaluate subj taking antipsychotics and development of dm - take subj with and without antipsychotic use and calculate RR to see if dm associated with antipsychotic use

Answer 11

Estimates Relative risk Used in case control and cross sectional studies OR =( A/C) / (B/D) = A*D/B*C Study subjects are selected on basis of disease status so it is not possible to calculate te rate of development of the disease given presence or absence of exposure - thus, OR used to approximate RR or estimate risk

Answer 12

Both used to determine magnitude of association between exposure to risk factor and disease Same scale - >1 means correlates with association with development of dz, < 1 means protection, and =1 means no association If 95%CI includes 1 => not stat sig

Answer 13

Estimates % of risk that is reduced by result of the intervention = 1-RR OVERestimates true risk because divided by proportion of control group outcome rate. So often the benefit of a treatment is Overstated!

Answer 14

Rate in intervention group minus rate in control group

Answer 15

Dz + Dz - Test + TP FP Test - FB TN

Answer 16

Probability that a true pos test occurs in an individual pos for dz Sensitivity = TP/(TP+FN) *100 = true pos / people who have disease Highly sensitive test rules out dz SNOUT

Answer 17

Probability that a true negative test result occurs - neg test in neg pt Specificity = TN/(FP+TN) = people who test negative / people without the disease Spin - highly specific test rules in (confirm) disease

Answer 18

PPV = TP/(TP+FP) *100 PPV = proportion of individuals who have diseas when test is positive = likelihood a person with pos text has disease

Answer 19

NPV = TN/((FN + TN) Proportion of disease free persons who test negative - likelihood that a person with negative test doesn't have disease

Answer 20

Analytical observational study - retrospective - Use in new diseases or outbreaks Measure of association = odds ratio

Answer 21

Analytical observational study Strongest observational study design Usually prospective - relative risk is measure of association

Answer 22

Aka prevalence study Descriptive obs study - user to gather info on risk factors and outcomes of interest - generate hypotheses

Answer 23

Descriptive obs study Generate a case dfn Determine adv effects , generate new info

Answer 24

Effect size = d = cohens d (mean experimental grp - mean control group) / st dev Interpretation - tells us how many st dev of difference between exp and control. Eg if d=0.25, means that there is a quarter sd difference 0. 2 = small effect size 0. 5 medium 0. 8 large

Answer 25

Analytic - case control and cohort - involve more comprehensive data Descriptive - cross sectional an case report/series - compare disease frequency in populations, generate hypotheses

Answer 26

Does the study measure what it was designed to measure? Does it address biases, confounders? ** if you do not have internal validity, you won't have external validity

Answer 27

Assumes internal validity (measures what intended, addresses biases confounders and outcomes) - external validity means outcomes can be generalized to other groups or patients, including your clinic population

Answer 28

Selection of study participants - includes sampling bias - researcher chooses study participants based on convenience rather than representativenes Detection bias- individuals who have risk factors - leads to more medical encounters - increase probability dz is identified Admission rate bias (Berkson's) - specific to using case and controls inpatients - exposure and disease being studied leads to higher exposure rate among hospital cases than controls. Example: OCuse lead to DVT- higher referral rate to hospitals Response bias - individuals who participate are different than those who decline to participate

Answer 29

- Define study cases in a detailed and objective manner | - enroll a representative study sample in the study

Answer 30

In accuracy in collecting data Recall bias- different memory of past events - people w disease recall more detail than healthy people - case control and retrospective cohort are most vulnerable Interviewer bias - differences in obtaining info from subjects

Answer 31

- Confirm pt response through medical records | - use a control group w disease other than that being studied

Answer 32

Detailed training of interviewers Directions to study staff conducting interviews and surveys Supervision of data collection process

Answer 33

- study participants lost to follow op - prospective study most vulnerable - difficult to minimize but assess reasons for loss

Answer 34

Inaccuracy in measurement or placement of study participants - mismeasurement, or if someone was thought to have disease on study entrance but does not Sources of miss classification bias: - variation among study observers and instruments - variation in underlying characteristics - misunderstanding of questions by study subjects (interview or questionnaire) - incomplete medical record data

Answer 35

One treatment that pts adhere to better than another

Answer 36

Proper study design Conduct of study - selection of pts, procedures, supervision and training statistical analysis: - difficult to accomplish because no stat test can correct for bias or fix study flaw - using appropriate stat procedures for data analysis can help with bias

Answer 37

- falsely conclude that a rf is associated with a disease without adjusting for rf that are either known or unknown 1) confounders can influence study results have the potential to influence study results 2) researchers may not account for these, or even be aware of their existence!

Answer 38

1) randomization - ensures confounders are evenly distributed - not done in epidemiology studies like case control, retro, and cohort 2) restriction - Restric admission to study to certain category of confounders - matching - equal representation of subjects with certain confounders among study groups - over matching - strong association between variable and variable of interest that decreases ability to find a result. Do not match based on factors affected by disease or exposure eg signs and sx because this decreases ability to find a result 3) analysis - stratification - data are split into non-overlapping groups called strata where a specific factor is contained in separate strata to see if each may contribute to effect - multivariate regression analysis - can control for a number of confounders at same time without losing power -

Answer 39

``` Strength of association Reproducibility - different populations different times Temporal sequence - has to happen before Biological plausibility Dose response relationship - can be, but not necessarily Coherence of relationship ``` - strength of association A) stronger the association, the less likely it is due to chance alone B) but, just because the magnitude is low doesn't mean there is no cause and effect

Answer 40

``` RCT- strongest design for cause effect and differences in tx effect Cohort Case control Case series Case report - weakest causality ```

Answer 41

Case control, cohort, cross sectional - appropriate for studying natural history of disease, accuracy of dx test, or public health policy - program planning etc

Answer 42

- is it an answerable question? - sufficiently narrow and objective? - use SMART criteria - biological, temporal, and time frame plausibility

Answer 43

It measures subjective - psych sx, but validation and standardization minimizes variability

Answer 44

Primary -'measured directly by researcher for purpose of ongoing study - rct, cohort, case control, cross sectional Secondary- from databases or pt medical records. Data is already collected, researcher gains permission to access for study (retrospective cohort, case control and cross sectional) - advantage: not as costly and doesn't take as much time to acquire data - disadvantage: missing data can impact accuracy of results and data may be miscoded eg ICD codes done wrong

Answer 45

Obs studies Case control study - Odds ratio Cohort study - relative risk w CI

Answer 46

Generating hypotheses

Answer 47

US preventive service task force level 1: evidence obtained from at least one well designed RCT level 2-1: well designed controlled trials without randomization Level 2-2: well designed cohort or case control trial, pref from >1 center Level 2-3: evidence obtained with multiple time series with or without intervention Level 3: opinion of experts, case reports or series

Answer 48

Efficacy is narrow term used to describe outcomes in studies | Effectiveness is broad and defines a real world outcome

Answer 49

- usual presented graphically, without confidence intervals | - doesnt explain the impact of drop outs on power in studies

Answer 50

One factor: | - would exclusion criteria for the study exclude pts in our practice?

Answer 51

Incidence in group a divided by incidence in group b

Answer 52

Estimates relative risk in retrospective studies

Answer 53

When the incidence is >10% the odds ratio overestimates risk - over 10 over estates When incidence is s a decent estimate

Answer 54

Exposed cases / unexposed cases Divided by Unexposed cases/ unexposed non-cases

Answer 55

``` For prospective study RR= A/(a+b) Divided by C/(c+d) ```

Answer 56

Minimized by blinding esp double blinding

Answer 57

Can occur in experimental studies when patients are randomized

Answer 58

Occurs in observational studies where must rely on existing sources of information

Answer 59

Occurs when inaccuracies or measurement or placement of study patients in specific groups. Most vulnerable: case control and retrospective studies

Answer 60

Mann Whitney U test Used she a comparison is being made with 2 non paired groups which don't have to be equal size - non paired means subjects don't have to participate in all treatment (don't have to serve as own controls)

Answer 61

Chi square for large sample Chicago large Fishers exact for sample size less than 20

Statistics Flashcards

(85 cards)