Midterm 1 Flashcards by A B

What is Machine Learning?

A training set has attributes, where one of the attributes is the class.
We want to find a model for class attribute as some function of the values of other attributes using a test set.

How well did you know this?

Not at all

Perfectly

The dataset is split into?

a training and testing set

How well did you know this?

Not at all

Perfectly

Confusion Matrix

A way to lay out how many predicted categories/classes were correctly predicted and how many were not.
- true positive, true negative, false positives, false negatives

How well did you know this?

Not at all

Perfectly

Which of the following statements is/are correct?

a) In machine learning, most of the data is used for testing.
b) In machine learning, most of the data is used for training.
c) Training set is used to determine the accuracy of the model.
d) b and c.

b) In machine learning, most of the data is used for training.

How well did you know this?

Not at all

Perfectly

which machine learning technique should be applied to the following problem?
“In information retrieval, a search engine needs to find groups of documents that
are similar to each other based on important term appearing in them”.
a) Clustering
b) Classification
c) Regression
d) Validation

a) Clustering

How well did you know this?

Not at all

Perfectly

Which of the following tasks is an unsupervised learning technique?

a) Clustering
b) Classification
c) Regression
d) All of the above

a) Clustering

How well did you know this?

Not at all

Perfectly

Which of the following methods requires having a training set and test set?

a) Supervised Learning
b) Unsupervised Learning
c) a and b
d) None of the above

a) Supervised Learning

How well did you know this?

Not at all

Perfectly

Which of the following is NOT an example of a machine learning problem?
a) Optical character recognition: categorize images of handwritten characters by
letters represented
b) Face detection: find faces in images
c) Topic spotting: categorize news articles
d) None of the above.

d) None of the above.

How well did you know this?

Not at all

Perfectly

In classification problems, there may be multiple ways of classifying data items,
i.e., a data item may belong to more than one classification category. T/F

How well did you know this?

Not at all

Perfectly

Which of the following is an example of a flag variable?

a) Gender: female/male
b) Weather: clear/rainy/cloudy
c) Temperature: [21, 80]
d) a and b

d) a and b

How well did you know this?

Not at all

Perfectly

K-means clustering is an unsupervised technique to partition the dataset into K
pre-defined distinct non-overlapping subgroups. T/F

How well did you know this?

Not at all

Perfectly

Association rules are good means to predict sequential dependencies among
different events. T/F

How well did you know this?

Not at all

Perfectly

Why do we use regularization on models?

a) To measure the accuracy of a model
b) To prevent overfitting
c) To train a model
d) All of the above

b) To prevent overfitting

How well did you know this?

Not at all

Perfectly

What does the loss function measure?

a) residual error
b) prediction error
c) model parameters
d) all of the above

b) prediction error

How well did you know this?

Not at all

Perfectly

Training set is used to determine the accuracy of the model. T/F

How well did you know this?

Not at all

Perfectly

In a 2-layered Neural Network, the perceptron takes an input, calculates the weighted
sum of the inputs and weights, and returns 1 if the weighted sum is above a threshold
value (T/F)

Study These Flashcards

When training a model, the main goal is to:

a) Update model coefficients
b) Minimize the error by updating model coefficients
c) Add bias
d) None of the above

Study These Flashcards

b) Minimize the error by updating model coefficients

N-fold Cross validation is a method used to prevent overfitting. T/F

Study These Flashcards

OLS method is used when the relationship between input and output is very complex.
T/F

Study These Flashcards

What is Ordinary Least Squares Method for?

a) Minimize the loss function
b) Maximize the loss function
c) Update the parameters of a model
d) a and c

Study These Flashcards

d) a and c

Why do we use regularization on models?

a) To measure the accuracy of a model
b) To prevent overfitting
c) To train a model
d) All of the above

Study These Flashcards

b) To prevent overfitting

Gradient Descent method is used when the relationship between input and output is very
complex. T/F

Study These Flashcards

Regularization is a method that penalizes model coefficients to reduce overfitting.
T/F

Study These Flashcards

Lasso is an example of regularization method. T/F

Study These Flashcards

what is the name of a 3-layered neural network? a) Perceptron b) Multilayer Perceptron c) Deep Neural Network d) None of the above

b) Multilayer Perceptron

What is the popular technique to find the parameters of a Deep neural network? a) OLS b) Stochastic gradient descent c) Mini-batch gradient descent d) None of the above

c) Mini-batch gradient descent

``` What is the popular technique to find the parameters of a shallow neural network? a) OLS b) Stochastic gradient descent c) Mini-batch gradient descent d) None of the above ```

b) Stochastic gradient descent

How many output neurons in ANN is needed to perform a binary classification? a) 1 b) 2 c) 3 d) 4

a) 1

``` how many output neurons in ANN is needed to perform multiclass classification when the output labels are ordered? a) 1 b) 2 c) 3 d) 4 ```

a) 1

In a deep neural network, different activation functions may be used at different layers. T/F

A perceptron model can be used to emulate the functionality of AND logical gate. T/F

A 2-layered Neural network can be used to emulate the functionality of XOR gate. T/F

when is 1-of-n output encoding implemented in an Artificial Neural Network? a) To perform binary classification b) To perform multiclass classification when output variables are ordered c) To perform multiclass classification then the output variables are not ordered d) All the above

c) To perform multiclass classification then the output variables are not ordered

MLE uses natural log to optimize the computation cost of the MLE. T/F

Given a dataset with x1,..., x6 input attributes, how are the terms in a polynomial regression model constructed for this dataset? a) Features of the dataset are converted to their higher order polynomial to represent the terms in the model b) Features of the dataset are used in the same way as linear regression model c) Always two features are used d) None of the above

a) Features of the dataset are converted to their higher order polynomial to represent the terms in the model

What does the loss function measure? a) residual error b) prediction error c) model parameters d) all of the above

b) prediction error

K-means clustering is an unsupervised technique to partition the dataset into K pre-defined distinct non-overlapping subgroups. T/F

Midterm 1 Flashcards

(38 cards)