Binary data Flashcards

1
Q

Binary data

A

Categorical usually only two answers ie yes/no, o patients possess attributes or not

NB continuous data can be crudely mapped as binary.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Ordinal variable

A

If large enough with multiple categories can be analysed as continuous variable but require specialist stats. Clear ordering ie tumor stage

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

π

A

Proportion of population possessing the attribute hence proportion not possessing attribute = 1-π

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Estimating π

A

π = r/n Analogous to u

r = possession of attribute
n = population
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Why new methods for categorical data

A

Continuous data varies by o around a mean u. Any value of u can vary by a range described by o. In categorical data there is only one parameter.

SE error is used. Once π has been estimated there is no need for further estimate of the spread of data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

SE continuous variables

A

SE = σ/√n

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

SE binary variables

A

SE = √[π(1-π)/n]. Still infers that the larger the sample the smaller the error. This can give CI

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

χ2 test

A

Binary equivalent of the unpaired t-test ie A=B

Must be performed on independent counts

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What needs to be calculated in χ2 test

A

Margins = these are subtotals of the categories. Both sets sum to the total in bottom right corner

Always draw 2x2 table

Expected values = always should add up to the margins despite sometimes not being whole numbers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

K

A

Estimate of the proportion similar to both groups

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

χ2 test

A

Sum of ((O-E)/E)2

Ie difference between observed - expected/ expected squared

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

P value from χ2 test

A

Asymptomatic approximation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

When is Fischers exact test used

A

When expected values <5

NB For larger tables the rule is that if over 20% of the
cells have expected values below 5, or any cells with expected values below 1 then the χ2 method may be unreliable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Fischers exact test

A

Calculates the p-value directly from data all possibilities for analysis are enumerated in tables. Margins ie p values from all possibles should add up to 1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Pitfalls for χ2 test

A

Tables must compare raw counts not proportions/percentages as they don’t give indication of sample size

Independent entities ie school children walking to school. Ensure bottom right number = no independent units

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

P value testing χ2 test

A

Adding up all probabilities that are less than or equal to the probability of the observed table.

17
Q

π

A

Proportion of population possessing attribute always inbetween 0 and 1

P = O/ 1+0

18
Q

Odds

A

Can be above 1.

O = π /(1-π)

19
Q

Absolute difference

A

πa - πb

Null value = 0

20
Q

Relative risk

A

πa/πb

Null value = 1

21
Q

Odds ratio

A

πa/ 1 - πa / πb/ 1 - πb

Null value = 1

22
Q

CI for absolute difference

A

Using estimate for D the E is calculated using
πa(1-πa)/na + πb(1-πb)b all square rooted

Hence CI = D +/- 2SE

23
Q

CI for OR

A

1 = Natural log of OR (logOR)
2 SE = root sum of the reciprocal

Hence CI = logOR +/- 2 SE (root sum reciprocals)

24
Q

Flaw of OR

A

Skewed ie 0-1 represents all values compared to 1-infinite