Quiz #11 Flashcards
Data processing includes
1) Data Co
2) Data En
3) Data Cl
- Data coding
- Data entry
- Data cleaning
assigning values to the data for statistical analyses
Data coding
What is this?
data are entered into some kind of computer software
for management and analysis
what is this?
Data entry
____ _____ refers to the preliminary “cleaning” of data by fixing
coding errors, mistakes, or discrepancies with variables, coding
schemes, scales, or files in general
What is this?
Data cleaning
This refers to ensuring that responses that could only be answered by certain
participants were not answered by others
- For example, a measure of whether or not someone gave birth to a child
should not be answered by any biological males in the data set (as this is
physically impossible and thus would represent an error in data collection)
What is this?
Hint: Con Cle
Contingency cleaning
Causes fir missing data:
1) D E O/E
2) N
3) M/S D C W
4) A
5) F R
- Data entry oversight/errors
- Nonresponse
- Missing/skipped data collection waves
- Attrition
- Flawed responses
If a measure is missing between -%, listwise delete is acceptable
If a measure is missing between _-__%, imputation methods may be used
If a measure is missing more than __%, the variable should be dropped from the analysis altogether (except in special circumstances)
1-5
5-15
15 +
____ ____ involves modifying an original measure with a new numerical coding scheme
Recoding data
We can always recode a variable ________ to reduce its complexity (make it more simple)
– this means we can always recode in the following direction: Ratio Interval Ordinal
Nominal.
Downward
Broadly speaking, there are 3 types of data analysis
U
B
M
Univariate
Bivariate
Multivariate
Involves examining the characteristics/attributes of one variable at a time (Descriptive
statistics
Univariate
Involves examining the relationship between two variables (both comparative and inferential)
bivariate
Involves examining the relationship between three or more variables (both comparative and
inferential
multivariate
Data analysis can involve _______ and/or ________ approaches
Comparative and Inferential
involves analysis of the attributes of the variables being examined to describe better the relationship between two variables
Comparative or inferential?
Comparative
involves not only describing relationships, but makes predictions and/or inferences about dependent variables based on the influence of independent variables
Comparative or inferential?
Inferential
Broadly speaking, there are 3 types of statistical approaches:
1) Des
2) Comp
3) Infer
Descriptive
Comparative
Inferential
There are four (4) kinds of frequency distributions
A F Dist
R F Dist
C F Dist
C r F Dist
Absolute frequency distributions
Relative frequency distributions
Cumulative frequency distributions
Cumulative relative frequency distribution
Displays data based on the assigned numbers per category
Which of the four (4) frequency distributions is this?
Absolute frequency distributions
displays percentage breakdown of each category as a percentage of the total
Which of the four (4) frequency distributions is this?
Relative frequency distribution
Every subsequent category is added to the previous value, creating a cumulative total
Which of the four (4) frequency distributions is this?
Cumulative frequency distribution
Every subsequent percentage breakdown of each category is added to the previous
percentage breakdown, creating a cumulative percentage total
Which of the four (4) frequency distributions is this?
Cumulative relative frequency distribution
Methods of displaying frequencies:
1) P C
2) B G
3) H
4) P
5) L C
6) M
1) Pie charts
2) Bar graphs
3) Histogram
4) polygons
5) Line charts
6) maps