Data and Statistics Flashcards
Define data
Data
“Raw” numbers
Counts of individual events or services
Collected at local, state, national, or international level
Data Sets
Data collected and arranged logically according to definite criteria
May/may not represent entire population
Define statistics
Statistics
- Data already analyzed and summarized
- Information presented as text, figures, graphs, or maps
- Not always available freely or publicly on the Web
- Most data on Websites is compiled statistics
What are some key features of health statistics?
- Population based
- Measure a wide range of health indicators for a population
- Entire U.S., state, county, city, zip code
- Often collected and analyzed over a period of time
- Include different types of data
- Vital (birth, death, marriage, divorce)
- Morbidity & mortality
- Use and cost of health care
What are some uses of health statistics?
- Provide key indicators about life and health in a particular region.
- Gauge disparities
- Measure progress
- Disease occurrence and potential
- Identify prevention targets
- Help with public health program planning and evaluation.
- Monitor progress
- Measure health care costs
- Mobilize activities
- Plan for resource allocation
- Used in creation of health policy and legislation.
What are some ways that you can assess the quality of data?
- Nature (source) of the data
- Availability of the data
- Validity and reliability of measures
- Completeness of population coverage
- Strengths and limitations of study design
Data will break your heart!
- Statistics are collected to meet the needs of the collector!
- Studies replicate previous findings
- Collected data is imperfect… but we still act on it
- It is collected by someone with a bias/incentive to lie
Define reliability
- Reliable = consistent
- Overall consistency of a measure
- A reliable measure is one that is relatively free from measurement error
- A scale is reliable if it yields consistent results over repeated applications in a short time frame
- Same survey given at two times of the day that gives different results (low correlation) for the same participants is not reliable
Define validity
- Validity = accuracy
- Scale is valid if it measures what it intends to measure without systematic error
- Is health status truly captured by the measure
- Usually involves comparing two different measures of the same phenomenon (e.g., self- rating scales versus physician’s assessment)
What are three things that goes into the completeness of data?
Representativeness
—Degree to which a sample resembles a parent population
Generalizability (external validity)
—Ability to apply findings to a population that did not participate in the study
Thoroughness
—Care taken to identify all cases of a given disease
How do you find data?
- Formulate the question
- Choose the best resource for the question
- Evaluate the results
- Repeat as often as necessary
List some sources of health data
- Statistics from vital registration system
- Reportable disease statistics
- Insurance data
- Clinical data sources
- School health programs
- Reports from health organizations (e.g., CDC, WHO), advocacy groups
- Economic data
What are the four types of data?
- Nominal
- Ordinal
- Interval
- Ratio
Define nominal data
a measurement scale consisting of qualitative categories whose values have no inherent statistical order or rank (e.g., categories of race/ethnicity, religion, or country of birth).
Define ordinal data
a measurement scale consisting of qualitative categories whose values have a distinct order but no numerical distance between their possible values (e.g., stage of cancer, I, II, III, or IV).
Define interval
a measurement scale consisting of quantitative categories whose values are measured on a scale of equally spaced units, but without a true zero point (e.g., date of birth).