1: Intro to ML Flashcards

1
Q

What is meant by Machine Learning?

A

A mathematical model to define the relations between the inputs and the outputs, and utilize it to predict the outputs for new scenarios or generate insights about new data scenarios.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are parameters?

A

Constants that define specific characteristics of the system (e.g., growth rate, decay constant).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What does ML use?

A

Machine learning uses a computing device to process data and integrates many distinct mathematical tools such as probability and statistics, optimization, and control theory.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are the possible outcomes in any dataset (generally speaking)?

A

Either Continous or Discrete.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What kind of model is predicted for Continous outputs?

A

Regression.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What kind of model is predicted for Discrete (finite, categorical) outputs?

A

Classification.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What ML techniques are used to discover the hidden patterns within the data (i.e., there are no target outputs provided)?

A

Clustering Analysis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are the four major machine learning paradigms?

A

Supervised L, Unsupervised L, Semi-supervised L, Reinforcement L.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is Unsupervised L about?

A

Concerned with discovering the hidden patterns in the data inputs and includes clustering as an important sub-domain.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is Supervised L about?

A

Denotes the learning tasks when data inputs and corresponding target outputs are provided, and includes classification and regression approaches.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is Semisupervised L about?

A

Covers problems where only partial label information exists. A basic classification model is designed on the few labeled data instances, which is called the semi-supervised classification step. A semi-supervised clustering step is then performed, where the model is tuned up to operate without supervision on the remaining large unlabeled data instances, and assigns them to the classes from the first step.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is Reinforcement L about?

A

Denotes the learning setup where the goal is to find an action policy that achieves a given goal. Follows the“cause and effect” method. A reward function that acts as a feedback to the agent.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are the two datasets used for building the model?

A

The training set - Develop the classification model. The testing set Evaluates the accuracy of the developed model.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What questions do precision and recall answer?

A

Precision: Of all the predicted positive cases, how many are actually positive?
Recall: Of all the actual positive cases, how many did the model identify correctly?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How to validate the best classificaion model?

A

Several techniques may be tested in parallel, and the technique that returns the highest evaluation performance is selected.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

How to validate a regression model?

A

The evaluation metric that is routinely calculated to judge the model’s performance is the mean squared error (MSE).

17
Q

Please define classification and give an example.

A

In classification, one assigns objects (instances) to one of a set of predefined classes designated by class labels. This is done based on information extracted from the training data that has already been classified. Classification is a form of supervised machine learning. Example: Determine if a given e-mail is a spam.

18
Q

Please list the evaluation metrics in a classification algorithm.

A

The confusion matrix is used to evaluate the model. Some of the metrics that are based on the confusion matrix are: accuracy, precision, recall, and f-score.

19
Q

Describe the dfference between classification and clustering.

A

Classification: Classifying data according to pre-defined categories

Clustering: Partitioning data into groups

20
Q

What are the ingridients in Reinforcement Learning?

A

Agent - performs action, Action - the possible moves, Environment - scenario the agent faces, State - current situation returned by E, Reward - the immediate return, Policy the strategy that agent employs, Value - the long-term return

21
Q

What are the applications of Reinforcement Learning?

A

Well-known applications of reinforcement learning are game playing and robotic movement control.

22
Q

How does RL work?

A

In particular, if the agent starts at state S1 and will take action A1, they will reach state S2 and gain a reward R2 . By moving further, taking an action A2, the output will be S3 and R3.

23
Q

Please list the reinforcement learning approaches.

A

There are three major approaches to perform a reinforcement learning, which are model-based (and model-free), value-based, and policy-based approaches.

Model-based: Learns a model of the environment’s dynamics (how actions affect states) to plan the best actions.
Value-based: Focuses on learning the value (expected return) of actions in states, e.g., using Q-learning.
Policy-based: Directly learns a policy (mapping from states to actions) to maximize rewards, e.g., using policy gradient methods.