First Part Flashcards

To learn and answer well

You may prefer our related Brainscape-certified flashcards:
1
Q

What are the properties of normal distribution

A

Properties of Nornal Distribution:

Unimodal -one mode
Symmetrical -left and right halves are mirror images
Bell-shaped -maximum height (mode) at the mean
Mean, Mode, and Median are all located in the center
Asymptotic

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the goal of A/B testing

A

It is a statistical hypothesis testing for a randomized experiment with two variables A and B.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is sensitivity ,specificity ,accuracy and precision

A

Sensitivity or TPR(True Postive Rate)= TP/(TP+FN)
Specificity or TNR(True Negative Rate)= TN/(TN+FP)
Precision or PPV(Positive Predictive Value)=TP/(TP+FP)
ACC=(TP+TN)/(TP+FP+TN+FN)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is over-fitting

A

In over-fitting, a statistical model describes/follows the random error or noise instead of the underlying relationship. Over-fitting occurs when a model is excessively complex, such as having too many parameters relative to the number of observations. A model that has been over-fit has poor predictive performance, as it overreacts to minor fluctuations in the training data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is under-fitting

A

Under-fitting occurs when a statistical model or machine learning algorithm cannot capture the underlying trend of the data. Under-fitting would occur, for example, when fitting a linear model to non-linear data. Such a model too would have poor predictive performance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is Univariate analysis

A

Univariate analyses are descriptive statistical analysis techniques which can be differentiated based on the number of variables involved at a given point of time. For example, the pie charts of sales based on territory involve only one variable that can be referred to as univariate analysis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are bivariate and multivariate analysis

A

Bi variate tries to see how two variables interact with each other and understand what are the differences between the two. Example is a scatter plot. Multi-Variate analysis is to do the same but with more than 2 variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are eigen values and eigen vectors

A

Eigen Vectors are used for understanding linear transformations. In analysis they ae generally used for correlation or covariance matrix.Eigenvectors are the directions along which a particular linear transformation acts by flipping, compressing or stretching.

Eigenvalue can be referred to as the strength of the transformation in the direction of eigenvector or the factor by which the compression occurs.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is machine learning

A

Machine Learning explores the study and construction of algorithms that can learn from and make predictions on data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is supervised learning

A

Supervised learning is the machine learning task of inferring a function from labeled training data. The training data consist of a set of training examples.
It essentially means that there is a target variable
Eg: Support Vector Machines, Regression, Naive Bayes, Decision Trees, K-nearest Neighbor Algorithm and Neural Networks

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is unsupervised learning

A

Unsupervised learning is a type of machine learning algorithm used to draw inferences from datasets consisting of input data without labeled responses.
There is no target variable.
eg: Clustering, Anomaly Detection, Neural Networks and Latent Variable Models

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is logistic regression

A

Logistic Regression often referred as logit model is a technique to predict the binary outcome from a linear combination of predictor variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the logit model

A

Logit model=log(p/(1-p)) where p is the probability of the event occurring

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are recommender systems

A

Recommender Systems are a subclass of information filtering systems that are meant to predict the preferences or ratings that a user would give to a product.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is collaborative filtering

A

The process of filtering used by most of the recommender systems to find patterns or information by collaborating viewpoints, various data sources and multiple agents.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What do you mean by Deep Learning

A

Deep Learning is nothing but a paradigm of machine learning which has shown incredible promise in recent years. This is because of the fact that Deep Learning shows a great analogy with the functioning of the human brain.

17
Q

What are Artificial Neural Networks

A

Artificial Neural networks are a specific set of algorithms that have revolutionized machine learning. Neural Networks can adapt to changing input so the network generates the best possible result without needing to redesign the output criteria.

18
Q

What is Gradient Descent

A

A gradient measures how much the output of a function changes if you change the inputs a little bit. It simply measures the change in all weights with regard to the change in error. You can also think of a gradient as the slope of a function.
Gradient Descent can be thought of climbing down to the bottom of a valley, instead of climbing up a hill. This is because it is a minimization algorithm that minimizes a given function (Activation Function).

19
Q

What is an activation function. What are the various types of activation functions

A

Activation function A = “activated” if Y > threshold else not
Alternatively, A = 1 if y> threshold, 0 otherwise
1. Step Function
2.sigmoid function=1/(1+e^-x)
3. Tanh function
The tanh function is very similar to a scaled sigmoid function

20
Q

What is a tanh function

A

The tanh function is very similar to a scaled sigmoid function
(2/(1+e^-x))-1=2sigmoid(2x)-1

21
Q

How to solve gradient descent

A

It is based on the observation that if the multi-variable function F(x) is defined and differentiable in a neighborhood of a point a,-(diveregence)F(a).
When the function F is convex, all local minima are also global minima, so in this case gradient descent can converge to the global solution.

22
Q

What is Back Propagation and Explain it’s Working

A

Backpropagation is a training algorithm used for multilayer neural network. In this method, we move the error from an end of the network to all weights inside the network and thus allowing efficient computation of the gradient.

23
Q

What are the steps to back propagation

A
  1. Forward Propagation of Training Data
  2. Derivatives are computed using output and target
  3. Back Propagate for computing derivative of error wrt output activation
  4. Using previously calculated derivatives for output
  5. Update the Weights
24
Q

What are the variants of Back Propagation?

A

Stochastic Gradient Descent: We use only single training example for calculation of gradient and update parameters.
Batch Gradient Descent: We calculate the gradient for the whole dataset and perform the update at each iteration.
Mini-batch Gradient Descent: It’s one of the most popular optimization algorithms. It’s a variant of Stochastic Gradient Descent and here instead of single training example, mini-batch of samples is used.

25
Q

What is the role of Activation Function?

A

The Activation function is used to introduce non-linearity into the neural network helping it to learn more complex function. Without which the neural network would be only able to learn linear function which is a linear combination of its input data. An activation function is a function in an artificial neuron that delivers an output based on inputs

26
Q

What is the formula on which decision tree is based on

A

For decision trees we use 2 methods
Impurity
Impurity is when we have a traces of one class division into other. This can arise due to following reason
1.We run out of available features to divide the class upon.
2.We tolerate some percentage of impurity (we stop further division) for faster performance. (There is always trade off between accuracy and performance).
For example in second case we may stop our division when we have x number of fewer number of elements left. This is also known as gini impurity.
Entropy
Entropy is degree of randomness of elements or in other words it is measure of impurity
Formula is