Introduction to Machine Learning Flashcards

1
Q

Why is machine learning becoming mainstream and essential for IT professionals?

A

They use collections of large amounts of data to gain insight and predict trends.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is a dedicated module for machine learning within Apache Spark?

A

Spark ML

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What does Spark ML do?

A

Spark ML is a dedicated module for machine learning within Apache Spark, integrating machine learning algorithms with Big Data for speed and distributed computation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the process for Machine Learning?

A

Data collection, feature engineering, algorithm selection, model training/evaluation, and live analysis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are the types of algorithms?

A

Supervised and Unsupervised.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are Supervised algorithms?

A

Regression and classification.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are unsupervised algorithms?

A

Clustering.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is Feature Engineering?

A

Supervised learning involves inputs (features/independent variables) and outputs (labels/dependent variables).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are the steps in Feature Engineering?

A

Data cleaning, feature analysis, feature preparation, and feature scaling.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is Linear Regression used for?

A

Linear regression is used to predict a continuous value.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are some of the methods in Linear Regression and what do they do?

A

Ordinary Least Squares Method: Minimizes the sum of squared errors to find the best-fit line.

Multivariate regression: involves predicting a label using multiple features.

Accurate measurement: coefficient of determination measures how well the model fits the data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly