Intro to R Flashcards
1
Q
Data cleaning/preparation tasks
A
- dealing with missing values
- dealing with incorrect data types on import
- creating multiple datasets for use in model fitting and testing
- transforming variables to allow models to pick up signal more effectively (ie. PCA, reducing the number of categories for variables with a large number of values, creating additional variables/flags to highlight certain relationships in the data)