Descriptive Analysis Exam 1 Flashcards

1
Q

Frequency tables

A
  • Used to present counts and frequencies of discrete or continuous data at any level of measurement
  • Must include a clear title, descriptive column names, adequate space and row/column delineation
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Box and whisker plots

A
  • Used to visualize the range of spread of data

* Only one axis is mathematically meaningful

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Bar charts

A
  • Used to graph discrete (interval, ratio) / categorical (nominal, ordinal) data
  • Ideal for visualizing frequencies and distributions but not ideal for representing trends over time
  • Must include title and labeled axes with well-defined units of measurement
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Histograms

A
  • Similar to bar charts but used to graph numeric data that have been apportioned into discrete categories
  • May be used to illustrate changes in a variable over time
  • Must include a title and labeled axis with well-defined units of measurement
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Frequency polygons

A
  • A special kind of line graph in which we place a dot above the midpoint of each class interval and connect the dots
  • Being and end with lines touching x-axis
  • Can use two sets of data and represent each data with a separate color
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Pie charts

A
  • Used to represent proportions or relative quantities of values
  • Limit amount of represented categories
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Proportion

A
  • The number of observations with a given characteristic divided by the total number of observations in a given group
  • “Parts” divided by the “whole” using the same unit of measurement
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Rate

A
  • Computed over a specific time-period (per unit of time, for example, per year)
  • Typically use a multiplier (for example 1,000, 10,000 or 100,000), which is called the base
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Prevalence

A
  • Measures the probability of having a disease at a point in time
  • Reflects existing disease within a population
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Incidence

A
  • Measures the number of new cases of a disease (or symptom or problem) that develop in a population at risk within a given period of time
  • A.K.A. Cumulative Incidence
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Sensitivity

A
  • Testing for true positives
  • Ability of a diagnostic test to correctly identify individuals with disease
  • Proportion of individuals with the disease who are correctly identified by the test
  • Have low false negative rates
  • True positive + false negative = 1
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Specificity

A
  • Testing for true negatives
  • Ability of a diagnostic test to correctly identify individuals without disease
  • Proportion of individuals without the disease who are correctly identified by the test as disease-free
  • Have low false positive rates
  • True negative + false positive = 1
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Positive predictive value

A
  • Provides information about how likely it is that the individual does, or does not, have the disease given his/her test result
  • Probability that a patient has the disease given that a positive test result was obtained
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Negative predictive value

A
  • Provides information about how likely it is that the individual does, or does not, have the disease given his/her test result
  • Probability that a patient does not have the disease given that a negative test result was obtained
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q
  1. Describe and interpret receiver operating characteristic (ROC) curves
A
  • ROC curves are used to illustrate the trade-offs between sensitivity (i.e., true positive rate) and the false positive rate
  • Each point on the ROC curve represents the sensitivity and false positive rate at a different decision threshold
  • Any ROC curve that lies above the chance diagonal has some diagnostic ability
  • The area under the chance diagonal is 0.5; the area under the ROC curve for a perfect diagnostic test is equal to 1
  • Goal = larger AUC = value of area under closest to 1
How well did you know this?
1
Not at all
2
3
4
5
Perfectly