ch 15 Flashcards
Define Data Management
The process of organizing, coding, and cleaning data in a usable format for the purpose of analysis and reporting
What is coded data?
The data is assigned labels so they can be read and processed by a computer.
Outlines the process through which raw data becomes translated for various forms of analysis
Examples
Frequency counts, descriptive statistics, cross tabulations and other statistical procedures
Yes = 1, No = 2
Describe Data Cleaning
Organized into a usable format
Entails checking that the values are valid and consistent aka All values correspond to valid question responses
Example: If possible range of answers for a particular question is 1 to 3 and the frequency distribution identifies some 4s, those response forms with the 4s must be identified and checked to determine if the person completing the instrument made an error or if there was an error made by the person coding the data.
If coder made a mistake it should be corrected
If person completing the instrument made an error it should be treated as no response or missing data
Variable
A construct, characteristic, or attribute that can be measured or observed
Independent Variable
A variable that is manipulated, selected, or measured by the evaluator that causes or exerts some influence on the dependent variable
Independent variables influence dependent variables
Examples:
Exposure to an intervention
Gender, Race, Age
Education or Income
Independent Variable
An outcome variable or end result indicator in an evaluation or study
Examples:
Awareness, Knowledge, Attitudes, Skills, Behaviors
Descriptive Statistics
Data used to organize, summarize, and describe characteristics of a group
Inferential Statistics
Data used to determine relationships and causality in order to make generalizations or inferences about a population based on findings from a sample
Univariate
Analysis of one variable
Example:
What was the average score on the cholesterol knowledge test?
How many participants at the worksite attended the healthy lifestyle presentation?
What percentage of the participants in the corporate fitness program met their target goal?
Bivariate
Analysis of two variables
Example:
Is there a difference in smoking behavior between the individuals in the experimental and control groups after the healthy lifestyle program?
Is peer education or classroom instruction more effective in increasing knowledge about the effects of drug abuse?
Do students attitudes about bicycle helmets differ in rural and urban settings?
Multivariate Data Analysis
Analysis of more than two variables
Example:
Can the risk of heart disease be predicted using smoking, exercise, diet, and heredity?
Can mortality risk among motorcycle riders be predicted from helmet use, time of day, weather conditions, and speed?
Which of the following most accurately predicts successful management of stress among program participants: physical activity, diet, meditation, anger management, yoga, or deep breathing?
Identify the measures of central tendency
Other forms of univariate data analysis.
The mean is the arithmetic average of all the scores.
The median is the midpoint of all the scores, dividing scores ranked by size into equal halves.
The mode is the score that occurs most frequently.
Identify the measures of spread or variation
Refer to how spread out the scores are in the data set.
The range is the difference between the highest and lowest scores.
What is null hypothesis
The hypothesis that holds there is NO difference between two groups, treatments, or interventions
Bivariate Data Analysis
Example:
Might state there is no difference between two groups, men and women, in knowledge about cancer risk factors.
What is alternative hypothesis
The hypothesis that holds there is a difference between groups, treatments, or interventions
Bivariate Data Analysis
Example:
Might state there is a difference between two groups, men and women, in knowledge about cancer risk factors.