Stats Flashcards

Question

How is the standard error calculated

Answer 1

Standard error = standard deviation/ the square root of the sample size

Answer 2

When the standard deviation and the mean cannot be used due to how skewed the data is

Answer 3

The data has to be normally distributed | The sample size has to be large enough (more than 20 individuals)

Answer 4

``` If the Mean = 18,477 sample size (n) = 12 standard deviation (SD) = 3,732 ``` Standard error = standard deviation/ the square root of the sample size Standard error = 1077.3 To get a 95% confidence interval, use the 1.96 from the standard deviation To get a 99% confidence interval, use the 2.58 from the standard deviation Mean - (1.96 x standard error) = 18,477 - (1.96 x 1077.3) = 16,365 Mean + (1.96 x standard error) = 18,477 + (1.96 x 1077.3) = 20,589 95% confident that the true value of the mean lies between 16,365 and 20,589 If we wanted to get the 99% confidence, we would do mean - (2.58 x standard error) and mean + (2.58 x standard error)

Answer 5

Greater precision that the results from the sample are representative of the populations

Answer 6

The values are less spread so there is less variability in the sample

Answer 7

- how two numeric continuous variable are related | - the strength of an association

Answer 8

Y= a + bx Y is the dependent variable (or called outcome or response) the one we measure e.g blood pressure reading, pain score, hours of sleep X is the independent variable (or called predictor or explanatory) e.g age, deprivation level and family history of illness) A is the y intercept (or called the constant) B is the coefficient - the change in y when we increase x by 1 unit

Answer 9

Linear regression

Answer 10

Logistic regression

Answer 11

We can incorporate additional values (additional predictors) which allows us to account for confounding variables

Answer 12

- they can be used to create productive models - remove the effects of confounding variables - explore how a particular drug influences the outcome

Answer 13

It requires a numeric outcome variable

Answer 14

A multivariable model

Answer 15

It adjusts for confounding variables

Answer 16

Coefficients in a multiple model have taken account of background factors

Answer 17

An increase in the outcome by 1.5 units

Answer 18

Proportion of population with a disease at one point in time Number of cases at a point in time/ total population = prevalence

Answer 19

A rate Rate at which new cases appear in a population at a certain period of time Number of new cases/ at risk population = incidence

Answer 20

-uses routinely collected data - Quick, cheap -units of analysis are populations - groups of people -can examine patterns of ill-health by age, sex, ethnicity, country and/or by time -few ethical issues -useful for generating hypotheses

Answer 21

- no link between individual exposure and effect - bias - variation in diagnostic criteria -absence of records of individual attributes -unsuitable format of records -inconsistency in data presentation

Answer 22

- results used to generate hypotheses -rapid feedback of current events in the community - quick and cheap -few ethical problems

Answer 23

- could just be reporting a medical oddity -prone to bias, e.g. sampling, subject and observer variation - no time reference

Answer 24

-by concentrating effort on the identification of affected individuals and recruiting controls from the unaffected population, the number of subjects required to obtain significant results is kept to a minimum (so good for rare diseases) -results can be obtained relatively quickly because the investigation does not have to wait for the disease to develop (compare this with cohort studies – see later) and can look for multiple causes -it is a relatively inexpensive type of study

Answer 25

-generally rely on retrospective data, which has its own dangers. The ability of individuals to recall past events tends to be unreliable due to a tendency for memory to be selective. Records of past events may be incomplete. -because data are collected retrospectively, it is difficult to say if an association is causal or not. This is less of a problem when the exposure is highly specific or where the time between exposure and disease is short -prone to selection and information biases -there can be difficulties choosing controls -the incidence of disease within a population cannot be calculated from this type of study

Answer 26

-the main advantage is that it is possible to distinguish antecedent causes from concurrent associated factors (cause comes before effect) -since incidence can be determined for both exposed and non- exposed groups, we can determine absolute, relative and attributable risks -we can study more than one outcome to the same exposure -there is less chance of bias since exposure is measured before development of disease

Answer 27

-cannot be certain that exposures are causal- this requires controlled studies -long periods of study, and large populations mean that cohort studies are expensive -follow-up can be a problem- especially if the period of study is long- this needs to be considered in the design of the study -diagnosis of cases may change over the years as medical science becomes more advanced- better at detecting the disease or with different criteria for a diagnosis

Answer 28

-randomization should mean that confounding factors (age, sex etc.) are equally distributed. This helps to concentrate the study on the effect of the intervention -by randomly allocating patients to interventions, it is likely that staff and patients will not break the blinding -statistical tests for significance are easier to interpret when the study design removes confounders -confounders and many biases minimised

Answer 29

-to allow sufficient numbers to balance confounders these tend to be large and expensive trials. They are often multicentre and may even be multinational -there is always a chance that volunteer bias will be a problem: what about people that refuse to be included in the trial or those that are never asked. -there may be ethical difficulties in withholding treatment from the control group or offering what is believed to be an inferior treatment to one group -may lose statistical power if poor compliance

Answer 30

-critical appraisal is the assessment of evidence by systematically reviewing its relevance, validity and results to specific situations -by R Chambers 1998

Answer 31

Parametric tests have rules that need to be followed or assumptions that need to be met Non-parametric tests are used as an alternative - they dont need rules to be followed or assumptions to be met Assumptions include - sample size, normal distribution and linearity in regression

Answer 32

- one sample t-tests - two sample t-tests (also called students t-test) - chi square test - ANOVA test - Pearson correlation coefficient

Answer 33

- one sample Wilcoxon test - Mann-Whitney U test - Fishers Exact test - Kruskal-Wallis ANOVA - Spearman rank correlation coefficient

Answer 34

- CASP | - AXIS has 20 questions and no scoring system

Answer 35

Cystic fibrosis = 1 in 2500 alpha-1-antitrypsin deficiency 1 in 2000 Hereditary Haemorrhagic Telengretasia (HHT) 1 in 4000 Immotile cilia syndrome 1 in 20000

Answer 36

- linkage - positional cloning - sequencing

Answer 37

- sweat test | - gene mutation analysis

Answer 38

- abnormal ion transport across epithelium - salt loss - impaired mucociliary clearance - chronic infections - sterility (infertility) - impaired digestion (meconium ileus) - failure to thrive - liver disease - diabetes

Answer 39

- pancreatic enzyme supplementation - control of infection - suppression of chronic infection - antibiotic nebulisers - bronchodilation - salbutamol nebulisers - anti-inflammatory - azithromycin - diabetes - insulin - vaccinations - flu, pneumococcal

Answer 40

- autosomal recessive - chromosome 14 - 14q32.1

Answer 41

M is the normal phenotype | S and Z are associated with major disease presentation

Answer 42

Due to build up of deformed alpha-1-antitrypsin in the liver - childhood jaundice - early onset cirrhosis Due to the unopposed action of neutrophil elastase in the lungs -early onset emphysema and bronchietasis Highly sensitive to cigarette smoke

Answer 43

Hereditary haemorrhagic talengiectasis (HHT) is also known as Osler-Weber-Rendu diseases (or syndrome) -causes abnormal blood vessel formation in the skin, mucous membranes and in the organs such as the lungs, liver and brain

Answer 44

HHT1 -endoglin gene (ENG) on chromosome 9 HHT2 -ALK-1 HHT3 -chromosome 5 - talengectasia - epitaxis - PAVMs - GI blood loss

Answer 45

- Kartagner's syndrome or primary ciliary dyskinesia | - autosomal recessive

Answer 46

10 variations in dynein arm - infertility - sinusitis - bronchiectasis - situs invertus

Answer 47

- asthma - chronic obstructive pulmonary disease (COPD) - venous thrombosis and pulmonary embolism - Tuberculosis - sarcoidosis (NRAMP) - Obstructive sleep apnoea - infant respiratory distress

Answer 48

- cystic fibrosis - CFTR - alpha-1-antitrypsin - SERPINEA1 - kartagener's syndrome (immotile cilia) - DNA1 - pulmonary veno-occlusive disease - E1FZAK4

Answer 49

-chronic granulomatosis disease CYBB

Answer 50

- hereditary haemorrhagic telangectasis (HHT) - ALK, ENG | - hereditary pulmonary arterial hypertension (HPAH) - BMPR2

Answer 51

-aims to detect early disease in order to alter the course of the disease e.g screening by mammography for breast cancer in order to treat it early

Answer 52

-the proportion of people with the disease who are correctly identified by the screening test True positive/ true positive + false negative = sensitivity

Answer 53

-the proportion of people without the disease are correctly excluded by the screening test True negative/ true negative + false positive = specificity

Answer 54

-the proportion of people with a positive test result who actually have the disease True positive/ true positive + false positive = positive predictive value

Answer 55

-the proportion of people with a negative test result who do not have the disease True negative/false negative + true negative = negative predictive value

Answer 56

True positive + false negative = true positive + false negative + false positive + true negative

Answer 57

- predictive values are dependent on prevalence | - sensitivity and specificity do not affect predictive values

Answer 58

-by randomised controlled trial (individual or clusters)

Answer 59

- selection bias - lead time bias - length time bias (or length bias)

Answer 60

-people who chose to participate in screening programmes may be different from those who do not - may be at more risk - may be at less risk

Answer 61

When screening appears to increase survival time because disease was discovered and diagnosed earlier

Answer 62

An overestimation of survival because long duration cases are more likely to be detected and treated than short duration cases e.g PSA screening more likely to be detected as the tumour is slow growing

Answer 63

- population-based screening programs (national diabetes and hypertension screening like in thailand) - opportunistic screenings (prevention and control of substance abuse) - screening for communicable diseases (heaf test) - pre-employment and occupational medicals (vision test for commercial drivers) - commercially provided screening (screening is a programme not a test)