Data science based methods for customer needs mapping Flashcards
Explain supervised learning
Builds a mathematical model of a set of data that containts both the inputs and the desired outputs
Explain unsupervised learning
Unsupervised learning algorithm in data science is where the algorithm is given unlabeled data, that contains only inputs and is tasked to find patterns, relationships, overall structure in the data, like clustering or grouping of data points.
Criterias for good datasets?
Answers a relevant question
Has a good variability in the key variable we are interested to predict
Has a good variability in the explanatory variables
Has man instances (rows in the table)
Does not have too many missing values
What is clustering in unsupervised learning?
Cluster analysis is the task of grouping a set of objects in such a way that objects in the same group (called clusters) are more similar to eaach other than those in other groups
How can we predict the probability of purchase? (demand)
- From discrete choice modeling
Goal: Predict likelihood of customer purchasing a particular product
Dependent variables: Often binary and represents whether a customer will buy product (1) or not (0)
Independent var: predictions, may include factors as price, advertising, or other that could influence purchasing decision
Method: multinominal logistics regression
- From historical product data
Goal: Analyze relationship between multiple independent variables and dependent variables
Dependent var: quantity of product sold
Independent var: factors that may impact sale, like price, packaging, branding etc
Method: Multinominal regression