Internet of Things, Bias Flashcards

Also Mis v. Dis & Big Data

1
Q

True or False: It’s impossible to create a perfect machine learning model that doesn’t contain any bias or any variance.

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Define Implicit Biases

A
  • occur when we make assumptions based on our personal experience
  • manifests itself as attitudes and stereotypes we hold about others unconsciously
  • might look for information that would support our beliefs and hypotheses and disregard information that doesn’t
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Define Group Attribution Bias

A

occurs when humans assume that one’s characteristics are always determined by what the group believes, or that a group’s decisions are influenced by members’ feelings

How it Manifests
1. In-Group Bias = When you give preference to your own group
2. Out-Group Bias = When you stereotype members of groups you don’t belong to
* ex. Engineers likely to view you as better qualified if you attended the same school)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Define Over-Generalization Bias

A

occurs when a person applies something from one event to all future events (assuming what you see in a data set, so you’ll likely find what you think you’d see)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Define Reporting Bias

A

occurs when you include only a subset of results in an analysis, which typically only covers a small fraction of evidence.

Forms
* Citation Bias = Analyzing data based on studies found in citations of other studies
* Language Bias = Excluding reports not written in the scientist’s native language
* Publication Bias = Choosing studies with positive findings rather than negative findings

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Define Selection Bias

A

occurs when you try to do so in an attempt to mitigate bias

  • ex. Choosing random people in a survey, and then only selectively using those people based on variables like first names, age, etc
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Define Automation Bias

A

occurs when you take this AI-based recommendation and use it before verifying if the information was right

  • ex. When typing in something in a search engine, suggestions may come up that may not be what you truly want it to, but you chose it because it was the first thing that showed up
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Define Systemic Bias

A

occurs when certain social groups are favored and others are devalued (usually occurs when it’s not intentional)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

It is possible to create a perfect machine learning model that does not contain any bias or variance.

A

False

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Internet of Things

A

collective network of connected devices and the technology that facilitates communication between devices and the cloud (sensors, software, etc.) , as well as between the devices themselves

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Synthetic Data

A
  • potential ability to control the output that allows to produce a more balanced/clean/useful synthetic set
  • mitigate bias by complimenting it what it unseen
  • balance data
  • provide insight on what kind of data is in the model
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Variance

A

the variability in the model prediction

  • High variance models pay too much attention to training data and don’t generalize on data they haven’t seen before, leading to good results on training data but high error rates on test data
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Underfitting

A

when a machine learning model fails to capture the underlying trend of the data (because it is too simple to explain variance)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Overfitting

A

when a model is trained with so much data that it begins to learn from the noise and inaccurate data entries in the data set (forcefitting, too good to be true)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Underrepresented

A

saying something is more than what it is
* ex. Saying everyone did phenomenal on a test when only 5/18 did

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Overweight

A

too much of one specific type
* ex. Too much data concerning one race

17
Q

Machine Learning

A
  • How computers recognize patterns and make decisions w/o being explicitly programmed. You can program a computer to learn through trial and error, practice, and experience. But, it is only as good as the training data you put into it.
  • enables search algorithms to search out the underlying meaning of words
18
Q

Bias

A
  • Biased data favors some things and de-prioritizes/excludes others. There’s a chance of human bias in data.
  • Data that does not accurately reflect the full population or phenomenon being studied