Module 7 Flashcards
Overview of logistic regression
- Used to estimate the probability that an event will occur as a function of other variables
- Can be considered a classifier as well
Describe Inputs and outputs of logistic regression
Input - variables can be continuous or discrete
Output - Set of coefficients that indicate the relative impact of each driver + A linear expression for predicting the log-odds ratio of outcome as function of drivers
List logistic regression use cases
- probability of an event
- Binary classification
- Multi-class classification
What is the goal of logistic regression?
- Predict the true portion of success, pi, at any value of the predictor
- pi = # of success / # of trials
Describe Y X and PI in Binary logistic regression model
Y = Binary Response X = Quantitative predictor pi = success
Logistic regression Pros
- Explanatory value
- Robust
- Concise
- Easy to score data
- returns good probability estimates
- preserves summary stats of training data
Logistic Regression Cons
- Does not handle missing values well
- Doesnot work well with discrete drivers with distinct values
- Cannot handle variables that affect outcome in discontnues way ( step functions)
- Assumes each var affects log-odds
Describe Neural Network Concept
- constructed and implemented to model the human brain
- performs pattern matching, classification, etc tasks that are difficult for traditional computers
Describe an artificial neural network
- posses a large number of processing elements called nodes/neurons operating in parallel
- neurons connected by link
- each link has weight regarding input signal
- each neuron has internal state called activation level
What are the components of a single-layer neural network
Input layer, Hidden layer, output layer, parameters are weights and intercepts are biases
What are Ak and g(z) in a neural network
Ak is activations in the hidden layer
g(z) is called the activation function - popular functions are sigmoid and rectified linear
g(z) are typically non-linear derived features
Describe details of the output layer in ann and fitting model
- Output activation function encodes softmax function
- Fit model by minimizing cross entropy/ negative multinomial log-likelihood
Describe how CNN works
- builds up an image in a hierarchical fashion
- hierarchy is constructed through convolution and pooling layers
- Edges and shapes are recognized and pieced together to form shapes/target image
Describe the convolution filter ( learned, score)
- filters are learned during training
- Input image and filter are combined using the dot product to get a score
- score is high if sub-image of the input image is similar to filter
What is the idea of convolution, its result, and the weight in the filters?
- the idea is to find common patterns that occur in different parts of the image
- Result is a new feature map
- weights are learned by the network
What are Pooling and its adv
- each nonoverlapping 2 x 2 block is replaced by maximum
- sharpens feature identification
- allows for locational invariance
- reduces dimensions by a factor of 4