General Machine Learning & Ethics Flashcards

Question 1

Q

What is the difference between supervised and unsupervised learning?

Answer

A

Supervised learning involves training a model on labeled data, while unsupervised learning deals with unlabeled data and seeks to find patterns or structures within it.

Question 2

Q

What is Machine Learning and what does it do?

Answer

A

Machine Learning is the development of algorithms and statistical models that enable computers to learn from data and make predictions or decisions without explicit programming

Question 3

Q

Supervised ML

Answer

A

uses labeled datasets
to train algorithms to
classify or predict outcomes.

Question 4

Q

What is required for Supervised ML?

Answer

A

you need labeled data

Question 5

Q

Unsupervised ML

Answer

A

uses algorithms to analyze and cluster unlabeled datasets.

Question 6

Q

Once an algorithm is deployed, ____ learning will manage data as it comes in and classify or analyze it.

Answer

A

Unsupervised

Question 7

Q

When is Linear Regression used?

Answer

A

Linear regression models are used when the result must be a continuous variable

Ex. predict rainfall amounts in inches

Question 8

Q

Supervise ML - Classification

Answer

A

Classification models will deliver results as a categorical variable, where there is a finite set of values that the variable can be. two results: Will Rain or Won’t Rain.

Question 9

Q

What is the 3 main goal of the analyze stage in machine learning?

Answer

A

Understanding response variables and how they’re structured. continuous? categorical?
Explore predictor variables.
Featuring Engineering

Question 10

Q

Will my machine learning model change over time?

Answer

A

For a model to predict accurately, the data that it is making predictions on must have a similar distribution as the data on which the model was trained.

Because data distributions can be expected to drift over time, deploying a model is not a one-time exercise but rather a continuous process.

Continuous monitoring of incoming data can help retrain your model on newer data if the data distribution has deviated significantly from the original training data distribution.

Question 11

Q

How can I determine that machine learning is the right solution?

Answer

A

Requires complex logic
Requires scalability
Requires personalization
Requires responsiveness

Question 12

Q

What are the reasons to NOT use machine learning?

Answer

A

Can be solved with traditional algorithms
Does not require adapting to new data
Requires 100% accuracy
Requires full interpretability

Question 13

Q

Is my data ready for a machine learning solution?

Answer

A

Is it easily accessible?
Does it respect privacy?
Is it relevant?

Question 14

Q

What is popularity bias in the context of machine learning?

Answer

A

Popularity bias refers to the phenomenon where more popular items are recommended more frequently by a system, often overlooking other items that could be just as pleasing to users.

Question 15

Q

Why is it important for data professionals to prioritize fairness in their data

Answer

A

reduce the potential for unintended consequences of machine learning applications, including the perpetuation of human biases.
It is part of responsible data stewardship.

Question 16

Q

What are some ethical considerations when building a model

Answer

Study These Flashcards

A

ensuring informed consent for the use of personal data
considering who is affected by the model and potential harm
ensuring data is appropriate and representative
considering the explainability of the model’s predictions
and regularly reviewing and monitoring the model’s performance.

Question 17

Q

What is a black box model?

Answer

Study These Flashcards

A

A black box model refers to a type of model where it’s difficult to understand how the model arrived at its predictions.

Question 18

Q

What is one way to evaluate model fairness?

Answer

Study These Flashcards

A

One way to evaluate model fairness is by checking how the model’s error is distributed over a population. If the model mainly makes errors in specific, similar cases it could carry higher ethical risk.

Question 19

Q

What is a potential risk in decision-making in machine learning?

Answer

Study These Flashcards

A

A potential risk is exposing a business and the people it serves to negative consequences

Question 20

Q

Where does bias in machine learning originate from?

Answer

Study These Flashcards

A

from human bias

Question 21

Q

Why can bias in machine learning be deceptive?

Answer

Study These Flashcards

A

It can be deceptive because even though the bias stems from humans, the computer making the prediction can give the result an appearance of objectivity.

Question 22

Q

What are some questions to help consider faireness of your model?

Answer

Study These Flashcards

A

If your model uses personal information, have these people given their consent for you to collect and use this data?
Is there a way for them to withdraw their consent?
Are they aware of what you’re doing with their information?

Question 23

Q

Explain the bias-variance trade-off in machine learning.

Answer

Study These Flashcards

A

the trade-off between a model’s ability to fit the training data (low bias) and its ability to generalize to new, unseen data (low variance).

High bias can result in underfitting,
High variance can lead to overfitting

Balancing these aspects is essential for optimal model performance.

General Machine Learning & Ethics Flashcards

(23 cards)