probability / gradient descent Flashcards

1
Q

what is gradient descent

A

numerical method for finding the input to a function f that minimizes the function

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

most common use for gradient descent

A

minimizing empirical risk

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

when is gradient descent guaranteed to work

A

convex function (happy face)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

convex function

A

a function f is convex if, for every a, b in the domain of f, the line segment between (a, f(a)) and (b, f(b))

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

if f(t) is a function of a single variable and twice differentiable

A

f(t) is convex if and only if (d^2f)/(dt^2) (t) >= 0 for all t

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

if f(t) is convex and differentiable

A

then gradient descent converges to a global minimum of f as long as the step size is small enough

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

nonconvex functions and gradient descent

A

gradient descent might work, but not guaranteed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

choosing a step size

A

constant step size, alpha
t(i+1) = ti - alpha(df/dt)(ti)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

experiment

A

some process whose outcome is random

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

set

A

an unordered collection of items, usually denoted with curly brackets

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

sample space

A

set of all possible outcomes of an experiment

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

event

A

subset of the sample space or a set of outcomes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

what do probabilities mean

A

if probability(E) = p, then if we repeat our experiment infinitely many times, the proportion of repetitions in which event E occurs is p

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

the sum of the probabilities of each outcome

A

must be exactly 1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

probability distribution

A

describes the probability of each outcome s in a sample space S

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

probability of each outcome

A

must be between 0 and 1

12
Q

the probability of an event

A

sum of the probabilities of the outcomes in the event

13
Q

if S (sample space) w/ n possible outcomes and all outcomes are equally likely

A

then the probability of any one outcome occurring is 1/n

14
Q

probability of an event

A

of outcomes in event/ # of outcomes in sample space

15
Q

complement rule

A

it is a complement if it contains the set of all outcome in the space that are not in the circle or A

16
Q

mutually exclusive

A

no overlap, or they both can’t happen at the same time

17
Q

if A and B are mutually exclusive, then the probability that A or B happens is:

A

P(A U B) = P(A) + P(B)
U = union or symbol for or

18
Q

principle of inclusion-exclusion

A

if A and B are any two events, then P(A U B) = P(A) + P(B) - P(A intersect B)

19
Q

the probability that events A and B both happen is

A

P(A intersect B ) = P(A)P(B|A)

<– given that

20
Q

P(B|A)

A

the probability that B happens given that A happened

21
Q

if P(B|A) = P(B)

A

then A and B are independent

22
Q

probability that events A and B both happen

A

P(A intersect B) = P(A)P(B|A)

23
Q

if P(B|A) = P(B)

A

A and B are independent

24
Q

conditional probability

A

the probability of an event may change if we have additional information about outcomes

25
Q

population

A

choosing k elements randomly from a group of n possible elements

26
Q

sequence of length k

A

obtained by selecting k elements from a group of n possible elements with replacement, ORDER MATTERS

27
Q

number of ways k elements from a group of n possible elements with replacement

A

n^k

28
Q

permutation

A

selecting k elements from a group of n possible elements without replacement

29
Q

number of ways to select k elements from a group of n possible elements without replacement and order matters

A

P(n,k) = (n)(n-1) … (n - k + 1) = n! / (n - k)!

30
Q

combination

A

set of k elements selected from a group of n possible elements without replacement , order doesn’t matter

31
Q

law of total probability

A

if A is an event and E1, E2, and Ek is a partition of S, then Probability(A intersect E1) + P(A intersect E2) + …. + P(A intersect Ek)

32
Q

Bayes Theorem

A

P(B|A) = ((P(B)P(A|B))/(P(A))

33
Q
A