General Machine Learning & Ethics Flashcards

You may prefer our related Brainscape-certified flashcards:
1
Q

What is the difference between supervised and unsupervised learning?

A

Supervised learning involves training a model on labeled data, while unsupervised learning deals with unlabeled data and seeks to find patterns or structures within it.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is Machine Learning and what does it do?

A

machine learning is the development of algorithms and statistical to teach computers to find patterns in data and make predictions or decisions without being explicitly programmed.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Supervised ML

A
  • uses labeled datasets
  • to train algorithms to
  • classify or predict outcomes.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is required for Supervised ML?

A

you need labeled data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Unsupervised ML

A

uses algorithms to analyze and cluster unlabeled datasets.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Once an algorithm is deployed, ____ learning will manage data as it comes in and classify or analyze it.

A

Unsupervised

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

When is Linear Regression used?

A

Linear regression models are used when the result must be a continuous variable

Ex. predict rainfall amounts in inches

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Supervise ML - Classification

A

Classification models will deliver results as a categorical variable, where there is a finite set of values that the variable can be. two results: Will Rain or Won’t Rain.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the 3 main goal of the analyze stage in machine learning?

A
  1. Understanding response variables and how they’re structured. continuous? categorical?
  2. Explore predictor variables.
  3. Featuring Engineering
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Will my machine learning model change over time?

A

For a model to predict accurately, the data that it is making predictions on must have a similar distribution as the data on which the model was trained.

Because data distributions can be expected to drift over time, deploying a model is not a one-time exercise but rather a continuous process.

Continuous monitoring of incoming data can help retrain your model on newer data if the data distribution has deviated significantly from the original training data distribution.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How can I determine that machine learning is the right solution?

A
  1. Requires complex logic
  2. Requires scalability
  3. Requires personalization
  4. Requires responsiveness
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are the reasons to NOT use machine learning?

A
  1. Can be solved with traditional algorithms
  2. Does not require adapting to new data
  3. Requires 100% accuracy
  4. Requires full interpretability
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Is my data ready for a machine learning solution?

A
  • Is it easily accessible?
  • Does it respect privacy?
  • Is it relevant?
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is popularity bias in the context of machine learning?

A

Popularity bias refers to the phenomenon where more popular items are recommended more frequently by a system, often overlooking other items that could be just as pleasing to users.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Why is it important for data professionals to prioritize fairness in their data

A
  • reduce the potential for unintended consequences of machine learning applications, including the perpetuation of human biases.
  • It is part of responsible data stewardship.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are some ethical considerations when building a model

A
  1. ensuring informed consent for the use of personal data
  2. considering who is affected by the model and potential harm
  3. ensuring data is appropriate and representative
  4. considering the explainability of the model’s predictions
  5. and regularly reviewing and monitoring the model’s performance.
17
Q

What is a black box model?

A

A black box model refers to a type of model where it’s difficult to understand how the model arrived at its predictions.

18
Q

What is one way to evaluate model fairness?

A

One way to evaluate model fairness is by checking how the model’s error is distributed over a population. If the model mainly makes errors in specific, similar cases it could carry higher ethical risk.

19
Q

What is a potential risk in decision-making in machine learning?

A

A potential risk is exposing a business and the people it serves to negative consequences

20
Q

Where does bias in machine learning originate from?

A

from human bias

21
Q

Why can bias in machine learning be deceptive?

A

It can be deceptive because even though the bias stems from humans, the computer making the prediction can give the result an appearance of objectivity.

22
Q

What are some questions to help consider faireness of your model?

A
  • If your model uses personal information, have these people given their consent for you to collect and use this data?
  • Is there a way for them to withdraw their consent?
  • Are they aware of what you’re doing with their information?
23
Q

Explain the bias-variance trade-off in machine learning.

A

the trade-off between a model’s ability to fit the training data (low bias) and its ability to generalize to new, unseen data (low variance).

  • High bias can result in underfitting,
  • High variance can lead to overfitting

Balancing these aspects is essential for optimal model performance.