Data and Statistics Flashcards

1
Q

Define data

A

Data
“Raw” numbers
Counts of individual events or services
Collected at local, state, national, or international level

Data Sets
Data collected and arranged logically according to definite criteria
May/may not represent entire population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Define statistics

A

Statistics

  • Data already analyzed and summarized
  • Information presented as text, figures, graphs, or maps
  • Not always available freely or publicly on the Web
  • Most data on Websites is compiled statistics
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are some key features of health statistics?

A
  • Population based
  • Measure a wide range of health indicators for a population
  • Entire U.S., state, county, city, zip code
  • Often collected and analyzed over a period of time
  • Include different types of data
  • Vital (birth, death, marriage, divorce)
  • Morbidity & mortality
  • Use and cost of health care
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are some uses of health statistics?

A
  • Provide key indicators about life and health in a particular region.
  • Gauge disparities
  • Measure progress
  • Disease occurrence and potential
  • Identify prevention targets
  • Help with public health program planning and evaluation.
  • Monitor progress
  • Measure health care costs
  • Mobilize activities
  • Plan for resource allocation
  • Used in creation of health policy and legislation.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are some ways that you can assess the quality of data?

A
  • Nature (source) of the data
  • Availability of the data
  • Validity and reliability of measures
  • Completeness of population coverage
  • Strengths and limitations of study design
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Data will break your heart!

A
  • Statistics are collected to meet the needs of the collector!
  • Studies replicate previous findings
  • Collected data is imperfect… but we still act on it
  • It is collected by someone with a bias/incentive to lie
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Define reliability

A
  • Reliable = consistent
  • Overall consistency of a measure
  • A reliable measure is one that is relatively free from measurement error
  • A scale is reliable if it yields consistent results over repeated applications in a short time frame
  • Same survey given at two times of the day that gives different results (low correlation) for the same participants is not reliable
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Define validity

A
  • Validity = accuracy
  • Scale is valid if it measures what it intends to measure without systematic error
  • Is health status truly captured by the measure
  • Usually involves comparing two different measures of the same phenomenon (e.g., self- rating scales versus physician’s assessment)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are three things that goes into the completeness of data?

A

Representativeness
—Degree to which a sample resembles a parent population

Generalizability (external validity)
—Ability to apply findings to a population that did not participate in the study

Thoroughness
—Care taken to identify all cases of a given disease

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How do you find data?

A
  1. Formulate the question
  2. Choose the best resource for the question
  3. Evaluate the results
  4. Repeat as often as necessary
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

List some sources of health data

A
  • Statistics from vital registration system
  • Reportable disease statistics
  • Insurance data
  • Clinical data sources
  • School health programs
  • Reports from health organizations (e.g., CDC, WHO), advocacy groups
  • Economic data
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are the four types of data?

A
  1. Nominal
  2. Ordinal
  3. Interval
  4. Ratio
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Define nominal data

A

a measurement scale consisting of qualitative categories whose values have no inherent statistical order or rank (e.g., categories of race/ethnicity, religion, or country of birth).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Define ordinal data

A

a measurement scale consisting of qualitative categories whose values have a distinct order but no numerical distance between their possible values (e.g., stage of cancer, I, II, III, or IV).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Define interval

A

a measurement scale consisting of quantitative categories whose values are measured on a scale of equally spaced units, but without a true zero point (e.g., date of birth).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Define ratio

A

a measurement scale consisting of quantitative categories whose values are intervals with a true zero point (e.g., height in centimeters or duration of illness).

17
Q

Define normal distribution

A

a measurement scale consisting of quantitative categories whose values are intervals with a true zero point (e.g., height in centimeters or duration of illness).

18
Q

What are skewed distributions?

A

Observations “clustered” at one end of the scale
The presence of outliers
CHECK PIC IN NOTES

19
Q

What are paranormal distributions

A

Distributions that don’t exist

20
Q

Define confidence intervals

A

Goal: capture the true value (e.g., the true mean) most of the time.
A 95% confidence interval should include the value about 95% of the time.
A 99% confidence interval should include the true value about 99% of the time.

21
Q

Define p-value

A

P-value is the probability that we would have seen our data (or something more unexpected) just by chance

P-values of

22
Q

Define the measure of central tendency

A

Mean: the average
Median: the middle value
Mode: the most frequent value

23
Q

What are measures of assocaition?

A

Strength of the association between two variables, such as an exposure & an outcome (disease)
Two measures of association used most often are the risk ratio (RR) & the odds ratio (OR)
Interpretation of RR and OR:
RR or OR = 1: exposure has no association with disease
RR or OR > 1: exposure may be positively associated with disease
RR or OR < 1: exposure may be negatively associated with disease

24
Q

Define the risk ratio

A
  • Used when comparing outcomes of those who were exposed to something to those who were not exposed
  • Calculated in cohort studies
  • Cannot be calculated in case-control studies because the entire population at risk is not included in the study
25
Q

Define the odds ratio

A
  • Used in case-control studies
  • Odds of exposure among cases divided by odds of exposure among controls
  • Provides a rough estimate of the risk ratio
26
Q

Look at the 2x2 table

A

http://hihg.med.miami.edu/code/http/modules/education/Design/images/Slide405042.jpg