Exam 1 Flashcards
Sensitivity
Probability of a positive test for individuals with a given disease
Specificity
Probability of a negative test for individuals without a given disease. Is intrinsic property of test and does not depend on prevalence. Is intrinsic property of test and does not depend on prevalence.
Cut off point
The threshold for where you decide a screening test is positive. Defines the disease. Yields different specificity and sensitivity results depending where you put it.
Yes/no (has or does not have disease) is binary but most variables like blood sugar are continuous.
Setting the cut off point will treat the lower scores as normal.
Event
Measurable outcome or case
Sample vs population
Sample average is with x bar and sample standard deviation is āsā. Pop average is mu and pop standard deviation is sigma.
Negative predictive value
Proportion of screening tests that are true negatives. Describes the performance of a screening test. Is not inherent to the test because it also depends on prevalence. Can be derived using bayes theorem. (# true negatives / all negatives)
Positive predictive value
Proportion of screening tests that are true positives. Describes the performance of a screening test. Is not inherent to the test because it also depends on prevalence. Can be derived using bayes theorem. =# true positives / all positives.
Lots of false positives in large low prevalence population unless specificity is extremely good.
Median
50th percentile. Survives outliers. Robust measure. Comparing median to mean can show skewness
IQR
Inter quartile range 25th-75th percentile. Calculate by finding the difference between the middle point between the lowest data point and the median and the highest data point and the median
Percentile
You can also index by multiplying the number of points in the set by the percentile (15 scores*.9=13.5 which means the 90th percentile would be the 14th highest score)
On excel: =percentile (A2:A19, .9)
Variance
Measures how far a set of numbers is spread out. Always non negative. Smaller means the data points tend to be very close to the mean (expected value)
Standard deviation
34/68(1)
95/47.5(2)
99.7(3)
Measures spread and summarizes data
Skewness
A tail to the left or right of data. Left slew indicates something that tends to occur later or higher, for example mortality rate. Right skew indicates something that tends to occur earlier or lower, like income.
Boxplot
Summarizes data around mean. Whiskers extend beyond 1st and 3rd percentile. Outliers indicated by dots.
Histogram
Data collected in bins. All data points should fit in a bin. Artificial categories or ranges are created.