Intro Flashcards

1
Q

Overfitting

A

the production of an analysis that corresponds too closely or exactly to a particular set of data, and may therefore fail to fit to additional data or predict future observations reliably

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Underfitting

A

a data model is unable to capture the relationship between the input and output variables accurately, generating a high error rate on both the training set and unseen data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Neural network

A

a biologically inspired mathematical function made up of artifical “neurons” that interact with one another. Neurons are typically represented by circles and arrows between neurons represent weights that describe the relationships between the neurons

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Sigmoid function

A

A weighted sum of inputs is passed through an activation function and this output serves as an input to the next layer. When the activation function for a neuron is a sigmoid function it is a guarantee that the output of this unit will always be between 0 and 1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Recurrent neural networks (RNN)

A

a neural network made up of identical “units” of neurons that feed into one another in a series. The most common type of network that interacts with time series or sequential data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Long Short-Term Memory (LSTM) networks

A

variant of RNNs. Its Gating mechanism sets it apart. This feature addresses the short term memory problem of RNNs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Regularization

A

step taken to reduce Overfitting (high variance) and underfitting (high bias)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Tensor

A

mathematical objects that can be used to describe physical properties, just like scalars and vectors. In fact tensors are merely a generalization of scalars and vectors; a scalar is a zero rank tensor, and a vector is a first rank tensor.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Scalar

A

has only magnitude, no direction

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Vector

A

having magnitude and direction

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Backpropagation

A

short for “backward propagation of errors,” is an algorithm for supervised learning of artificial neural networks using gradient descent. Given an artificial neural network and an error function, the method calculates the gradient of the error function with respect to the neural network’s weights.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Gradient descent

A

an iterative first-order optimization algorithm used to find a local minimum/maximum of a given function. This method is commonly used in machine learning (ML) and deep learning(DL) to minimize a cost/loss function (e.g. in a linear regression)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Gradient

A

calculated by taking the partial derivative of a function with respect to each variable. Result is expressed as a vector

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Jacobian matrix

A

the Jacobian matrix of a vector-valued function of several variables is the matrix of all its first-order partial derivatives.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Normalization

A

The goal of normalization is to transform features to be on a similar scale. This improves the performance and training stability of the model

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Data lake

A

datastore where data is stores from multiple datasources. Typically dirty with missing fields and inconsistent data types. Can have schema applied and data permissioning

17
Q

Data pipeline

A

Retrieve data from datasource (ingestion) -> apply data lake schema/permissioning parameters -> store in data lake (aggregation of data stores) -> pull data from data lake and clean/structure it (feature engineering)

18
Q

Imputation

A

filling in missing data values

19
Q

Data warehouse

A

different from data lake. Take slice of historical data, put data in column format as opposed to row, and apply additional transformations. Proxy between data lake and analyst. Aggregations are faster on column structured data

20
Q

Confusion matrix

A

visualize false positives, false negatives, etc. Uses to grade your ML algorithm, not train

21
Q

Classifier

A

In data science, a classifier isa type of machine learning algorithm used to assign a class label to a data input. An example is an image recognition classifier to label an image (e.g., “car,” “truck,” or “person”).

22
Q

Hyperparameter

A

In machine learning, a hyperparameter is a parameter whose value is used to control the learning process. By contrast, the values of other parameters are derived via training.

23
Q

Matrix factorization

A

Matrix factorization is a class of collaborative filtering algorithms used in recommender systems. Matrix factorization algorithms work by decomposing the user-item interaction matrix into the product of two lower dimensionality rectangular matrices.

24
Q

Convolutional layer

A

A convolutional layer isthe main building block of a CNN. It contains a set of filters (or kernels), parameters of which are to be learned throughout the training. The size of the filters is usually smaller than the actual image

25
Q

Examples of machine learning use cases

A

Recommender systems
Sentiment analysis
Classification/Clustering
Spam detection
Anomaly detection

26
Q

Support Vector Machines

A

Supervised learning algorithms that analyze data for classification and regression analysis.

A simple example may be drawing a lime on a graph where the points above the line are one category and those below the line are another category.

27
Q

Hyperplane

A

subspace whose dimension is one less than that of its ambient space. For example, if a space is 3-dimensional then its hyperplanes are the 2-dimensional planes, while if the space is 2-dimensional, its hyperplanes are the 1-dimensional lines.

28
Q

Linear regression

A

Simple regression machine learning algorithm that predicts your response variable y based on one or more independent variables X, fitting a linear relationship between the two.

29
Q

Cost function

A

Function used to provide a quantitative measure of a machine learning algorithms prediction inaccuracy. Mean squared error is a commonly used example

30
Q

Learning curves

A

A good tool to understand whether overfitting is going on.