Intro to R Flashcards

1
Q

Data cleaning/preparation tasks

A
  • dealing with missing values
  • dealing with incorrect data types on import
  • creating multiple datasets for use in model fitting and testing
  • transforming variables to allow models to pick up signal more effectively (ie. PCA, reducing the number of categories for variables with a large number of values, creating additional variables/flags to highlight certain relationships in the data)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly