Data science terms and Data handling terms Flashcards
Learn about the groupings of terms related to data science and one of this group i.e. Data handling terms
List the 4 groups of terms related to data science
-Data handling
-Data features terms
-Artificial intelligence
-Model development terms
-Model performance terms
What terms are data handling terms
Training set, Testing set, outlier, data cleansing
Define training set
The dataset used by the machine learning model that will help it to learn the desired task
Define testing set
Dataset that is used to measure the performance of the developed machine learning model
Define outlier
The data record that is seen as exceptional and outside the distribution of the normal input data
Define data cleansing
Process of removing redundant data, handling missing data entries, and removing or at least alleviating other data quality issues