Module 1 - Data and Foundations Flashcards

1
Q

The 2 types of data

A

Quant: temperature (continuum), integer variables (discrete), etc.

Qual: toggles that switch betw. controls (logical/state), categorical (ie. red buttons)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

“Event”?

A

Event = a specific outcome with a probability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Axioms of probabiltiy

A

P(S) = 1 (probability of some outcome S in sample space happening is 1)

0 <= P(S) <= 1

If I have 2 MUTUALLY EXCLUSIVE events: P(E1 U E2) = P(E1) + P(E2)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Random variables

A

Attach numerical labels to an outcome

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Example of random variable

A

If random variable value is ‘grey’ then number assigned is 4

T is the random variable for any temperature value on a continuum scale

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Cumulative probability function for discrete random variable

A

F_X(R) - P(x <= k) = SUM(xi*Pi) (xi=i) for all i = 0 –> k

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Cumulative probability function for continuous random variable

A

f_X(a) = P(a - st.dev < x < a + st.dev)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Cumulative distribution for continuous random var

A

P(x < k) = INT(f_X(x) dx)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Expected value for discrete cases

A

xi*P_X(xi) –> sum for all xi and P_X(xi)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Expected value for continuous cases

A

E{X} = INT(x*f_X(x) dx)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Linear operator conditions for expected value

A

Commutative: E{X1 + X2} = E{X1} + E{X2}

Mult. by scalar: E{kX} = kE{X}

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Human-centred insights

A

Ethnography

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

________ ________ brings in human-centred insights, which is another kind of _____.

A

Design thinking

Data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Anthropogenic

A

Made by humans

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Problems with surveillance tech to collect data from the public

A
  • Social policies bring in questions of privacy
  • Who owns the tech? Are they ethical?
  • Misidentification, inappropriate conclusions drawn from data
  • Gaps in monetary network (too expensive)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

We need a ___________ data set to feed a model so it is trained for the appropriate task

A

Representative

17
Q

In engineering, we use data to help develop solutions that lie at the intersection of…

A

Technical feasibility, social desirability, financial viability

18
Q

Sources of variability

A

Ambient fluctuations

Instrumentation and measurement fluctuations

(For chem processes) Models and system/process representation

19
Q

Ambient fluctuations

A

Disturbances within a process or physical system

Human interventions

External disturbances

20
Q

Instrumentation and measurement fluctuations

A

Electronic noise

Physical location of instrument

21
Q

Models and system/process representation

A

Assumption of well-mixed is spatially distributed

Associated with simplification in model form

22
Q

Active collection

A

Make a series of planned moves on process

Increases info content, guarantees “causality”/”cause and effect” relationships

22
Q

Passive collection

A

Record process values without actively intervening

“Historical” databases = passively collected data, ie. browser history

23
Q

Role of statistical methods

A
  1. Decision-making under uncertainty
  2. Categorizing and modelling variability (ie. Poisson distr w/ mean, var)
  3. Basis for “variability accounting” –> how it propogates
  4. Data “microscope”
  5. Effective presentation of results in graph/quant forms