MODULE 2 S2.3.1 Flashcards

Linear Model for Classification (Logistic Regression)

1
Q

T/F In Naïve Bayes, training and testing are fast and easy to understand and process.

A

TRUE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

3 Kinds of Naïve Bayes Classifier in Scikit-learn

A

GaussianNB - continuous data
BernoulliNB - binary data, text data
MultinomialNB - integer count data, text data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Classification algorithms in linear models

A
  • How well weights and intercepts fir the training data
  • Cost function or loss function
  • Regularization methods can be used
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Logistic Regression is a ______________ algorithm, not a regression algorithm.

A

classification

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Default C value:

A

1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

T/F Using a higher value of C stresses the importance that each individual data point be classified correctly.

A

TRUE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

______________ and ____________ are a text-like used to count sparse data.

A

BerniuolliNB
MultinomialNB

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

T/F In Naïve Bayes, Larger alpha decreases the complexity of the model but does not change the performance.

A

TRUE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

______________ is a high-dimensional dataset.

A

GaussianNB

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

It is a family of classifiers that are quite similar to the linear models.

A

Naïve Bayes Classifier

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

T/F Logistic Regression is also a regression algorithm.

A

FALSE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

A regression where the dependent variable is in an ordered state (i.e., ordinal). The dependent variable (y) specifies an order with two or more categories or levels.

A

Ordinal Logistic Regression

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Linear SVM is implemented in svm.LinearSVC.

SVC stands for

A

Support Vector Classifier

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

It predicts the relationship between the independent and binary dependent variables.

A

Binary Logistic Regression

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

For LogisticRegression and LinearSVC the trade-off parameter that determines the strength of the regularization is called ______.

A

C

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

A regression type that has more than two possible outcomes.

A

Multinomial Logistic Regression

17
Q

It is a statistical technique used to develop predictive models with categorical dependent variables having dichotomous or binary outcomes

A

Binary Logistic Regression

18
Q

T/F The reason that Naïve Bayes are so efficient is that they learn parameters by looking at each feature individually and collect simple per-class statistics from each feature.

A

TRUE

19
Q

The two most common linear classification algorithms

A

Logistic Regression
Linear Support Vector Machines (Linear SVMs)

20
Q

In Naïve Bayes, generalization performance is ___________________.

A

slightly slower

21
Q

A categorical dependent variable has two or more discrete outcomes in a __________ regression type.

A

Multinomial

Multinomial Logistic Regression

22
Q

T/F Naïve Bayes works well with sparse high-dimensional datasets and is not parameter sensitive.

A

TRUE

23
Q

Higher values of C correspond to _________ regularization.

A

less

24
Q

In Naïve Bayes, training speed is ____________ than linear classifier.

A

faster

25
Q

T/F Using low values of C will cause the algorithms to try to adjust to the “majority” of data points.

A

TRUE