Lecture 2- Data processing and coding Flashcards
1
Q
what is data coding
A
converting text into numerical form so it can be analysed
2
Q
what is a quantitative variable?
A
Numeric/ continuous
choice of scale and level of precision
3
Q
what is a qualitative variable?
A
Catergorical
-use unique integer values
4
Q
what are missing values traditionally defined as?
A
.
5
Q
what should you inspect your dataset for?
A
- Clear mistakes –>coding/recording/ data entry mistakes
- Outliers–>extreme but plausible values
- Distribution of data –> looking for skewness
- prevalence of missing values for each variable
6
Q
What would a range check identify?
A
Clear mistakes and outliers
through –>frequency table / histogram / line plot
7
Q
how could you perform a consistency check?
A
cross-tabs/ scatter plot