Statistics Flashcards

1
Q

What are the type of variables

A

Nominal
Ordinal
Interval
Ratio

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

It is a type of variable with no value

A

Nominal (name)- gender, blood type

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

It is a variable with order/superiority but no magnitude difference

A

Ordinal (order)- stage of NEC

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

It is a variable with equal interval but no zero

A

Interval- body temp

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

It is a variable with equal interval with meaningful zero

A

Ratio

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Tests for normal distribution

A

parametric test

  • T-test
  • ANOVA
  • Pearsons correlation

Can only be used on interval and ratio

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Nominal data with only 2 groups

A

dichotomous or binary

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

it is a variable that is the outcome

A

Dependent variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

it is the variable that is the intervention

A

independent variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Tests for skewed distribution, ordinal and nominal variable

A

non-parameteric
Wilcoxon rank sum test
kruskal-wallis test
Spearmans rank correlation coefficient

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

measure of central tendency that is the sum of all observation

A

mean “average”
- can be influenced by outlying value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

measure of central tendency that is the middle value of date

A

Median

  • more appropriate for skewed data
  • commonly used for ordinal data like apgar score
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

measure of central tendency that is most frequently occuring

A

Mode
- commonly used with nominal data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

It is the bell shaped frequency distribution where the mean, median, mode are the same

A

Gaussian distribution/ Normal distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

It is how flat or peaked the curve

A

Kurtosis

  • peaked >0 leptokurtic
  • normal 0 mesokuritc
  • lower and broad <0 platykuritc

Note: all are symmetric

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

In skewed data, basis of terminology

A

It based on the tail

  • right- positive
  • Left- negative
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

It is the measure of dispersion which is the difference between the highest and lowest value

A

Range

  • dependent on sample size
  • influenced by extreme values
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

It is the measure of dispersion which is the difference between the median of the lower half and upper half of the data

A

Interquartile range

  • between 25th and 75th percentile
  • less influenced by extreme value
  • comprises middle 50% of the data
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

It is the deviation from the mean

A

Variance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

It is the square root of variance, how close a cluster is to the close to the sample mean

A

Standard deviation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Meaning of standard deviation if mean is known and has normal distribution

A

1 SD- 68.2% (34.1 %- left or right)
2 SD- 95.4% (47.7%)
3 SD- 99.8% (48.9%)

  • know this- can compute for percentage of the sample is included
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

It is the SD of the error of the sample mean in relation to the true mean of the total population

A

Standard error of the mean

  • how close is the sample mean close to the population mean
  • inc the sample size, SEM decreases
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

it is a hypothesis with one predictor and one outcome

A

simple hypothesis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

It is a hypothesis with several predictor variable

A

complex hypothesis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
It is the hypothesis that proposes no difference between groups
null hypothesis
26
It is the hypothesis that proposes an association
alternative hypothesis
27
it is a parametric test to compare 2 groups that are continuous, normal distributed
T-test 1. Paired: subject his own control (before and after) 2. Unpaired: two groups compared
28
Extension of T-test with three or more groups
Analysis of variance (ANOVA)
29
It is the comparison for further exploration of data after significant effect
Post hoc comparison
30
Test used for ordinal data
Wilcoxon rank mann-whitney U
31
Test for categorial data
Chi-squared test Fisher exact test
32
Type of error that rejects the null hypothesis
Type I false positive Reduced by more stringent P (influenced by: sample size, difference of control and expe, less variance)
33
Type of error that fails to reject the null hypothesis
Type II false negative Reduced by increasing sample size, power of the study
34
It is the probability the null hypothesis is true by chance
P-value P value 0.05 means 5% chance the null hypothesis is true to chance alone (5% na swerte lang) or 95% the sample represents different population (the groups are different, talagang magkaiba)
35
true or false lower p-value has a higher strength of association or importance of association
False Remember p-value the null hypothesis is true by chance alone
36
What is bonferroni correction
Interpreting the P value when multiple comparison- need to be more stringent due to higher likelihood of type I error - p value/ number of comparison
37
It is the range of values you expect the actual mean of the true population
Confidence interval
38
It is the probability of including the population mean within the confidence interval
Level of confidence A high level of confidence will widen the range (the lower the confidence, the narrower the range)
39
Absolute risk
the number of subjects who develop the disease among the exposed * a/a+b
40
Absolute risk reduction
the absolute effect of the exposure % outcome of exposure- % outcome from non exposure * a/(a+b)- c/(c+d) Also known as risk difference or attributable risk
41
Table for measure of association
42
Number needed to treat
the reciprocal of absolute risk reduction * 1/(a/(a+b)- c/(c+d)) As the difference in group increases, the lower NNT
43
Relative risk reduction
(control event rate- experiment event rate)/ control rate * (c/c+d)- (a/a+b)/ (c/c+d) If its negative: protective If its positive: harmful
44
It is the probability of the outcome in the exposed vs in unexposed
Relative risk or risk ratio * (a/a+b) / (c/c+d) Interpretation: \> 1 positive association \< 1 negative association
45
Its the improvement in outcomes simply as a result of being involved in astudy
Hawthorne effect
46
How does randomization reduce bias
By creating two groups of individuals that have equal likelihood of having the outcome of interest
47
It is an observational study useful for rare diseases
Case control
48
the probability of rejecting the null hypothesis when the alternative hypothesisis true
Statistical power (1-Type II error rate)
49
It is the probability of rejecting the null hypothesis when it is true
Type I error
50
It is the probability of accepting the null hypothesis when the alternative is true
Type II error
51
Statistical power depends on
1. increasing the significance criteria (p value) 2. increasing the magnitude of effect (difference or change) 3. increasing the sample size
52
Deaths that occur between 22 weeks’ gestation and 7 days of postnatal life
Perinatal mortality
53
deaths in the first 28 days of life
Neonatal mortality
54
Death occuring within the 1st year of life
Infant Mortality
55
It shows the difference in the rate of a condition between individuals with and without a specific exposure
Attributable risk Best study or research design: Cohort study
56
Of the measure of central tendency, which should be equal to show normal distrubution
Mean and median Mean\> median- skewed to the right Mean
57
Cummulative incidence = Incidence rate
period of observation is short disease prevelence is low duration of disease is same for the exposed and non exposed
58
Effect of prevalence in predictive value of a test
* increased: increase PPV, dec NPV * decreased: dec PPV, inc NPV
59
It the ability of a test to correctly identify individuals who have a condition
sensitivity
60
It is the ability of a test to correctly identify individuals who do not have a condition
specificity
61
It presents as the proportion of individuals with a positive test result who have a condition
Positive predictive value - affected by disease prevalence; when a disease is more prevalent, the PPV is higher.
62
It presents the proportion of individuals with a negative test result who do not have the condition
Negative predictive value - affected by disease prevalence; when a disease is more prevalent, the NPV is lower.
63
It is used assess a diagnostic test's accuracy whose results are continuous variables.
Receiver operating characteristic (ROC) curves
64
The best clinical design for evaluating intervention for clinical practice
randominzed controlled trial
65
what decreases bias in RCT
1. randomization 2. blinding
66
It is the statistical method in a systematic review, data from individual studies combined to give a summary of the effectiveness of an intervention with a 95% confidence interval | terms used: estimate relative risk (RR), RR reduction, or odds ratio
Meta-analysis ## Footnote It increases power and precision of estimates of treatment effects and exposure risks
67
* It is the variation among studies in a meta analysis * It is measured using I2 statistic
Heterogenicity ## Footnote Interpretation of heterogenicity: 0% to 40%: might not be important 30% to 60%: may represent moderate heterogeneity 50% to 90%: may represent substantial heterogeneity 75% to 100%: considerable heterogeneity
68
what are the definition of mortality a. fetal death b. infant death c. maternal death
a. fetal- death prior to expulsion or extraction of product of human conception regardless of duration of pregnancy b. live birth- expulsion of product of conception from mom irrespective of durwtion of pregnancy, baby has evidence of life- beating heart, pulsation of the umbilical cord or movement of voluntary muscle c. death of woman while pregnant or within 42 days of pregnancy
69
what is a SMART objective
specific measurable achievable relevant time-bound
70
What are the type of validity
1. internal validity: results of the study are true or are they a result of the way the study was designed or conducted. 2. External validity: generalizability of results to other settings or samples.