Information Theory Flashcards

Question 1

Q

log(A) + log(B)

Question 2

Q

Entropy for N equiprobable events

Answer

A

Entropy = E[-log p(x)] = - sum_{over classes} p(x_c) log p (x_c)

For equiprobable events, p(x_c) = 1/C, so
entropy = log(C)

Question 3

Q

Is entropy ever negative?

Question 4

Q

When is entropy additive?

Answer

A

For independent events

Question 5

Q

Interpretation of -log p_i

Answer

A

of Bits required to represent ith symbol efficiently

Question 6

Q

Shannon bit

Answer

A

0 or 1, but not both simultaneously (as opposed to qubit)
1 bit = amount of info gained
if we have apriori info about which event is more probable, then amount of info gain < 1 bit

Question 7

Q

Shanon Coding

Answer

A

Minimize the bits required to represent message
Done by assigning shorter codes to symbols with higher probabilities
Mean Length = sum(p_x log_k (p_x)), where k = arity of code e.g., 2 for binary and p_x is probability for symbol x
Coding must be efficient but also decodable (unique symbols)
Here, log_k(p_x) is the bits of information and p_x is the proba of x happening

Question 8

Q

Convert base of log

Answer

A

log_c(a) = log_b(a)/log_b(c)

Question 9

Q

Interpretation of cross-entropy

Answer

A

sum_x p_x log(q_x) where p is the true distribution and q is the predicted distribution
Avg bits required to transmit p if q is used instead

Question 10

Q

Interpretation of KL divergence

Answer

A

KLD(P||Q) = H(P, Q) - H(P)
i.e., cross-entropy - entropy (not symmetric)
i.e., additional bits required to transmit p since we are using the predicted distribution q instead of true distribution p

Question 11

Q

Is KLD a distance measure? Why/Why not?

Answer

A

No, since it is not symmetric

Question 12

Q

Mutual Information

Answer

A

I(X, Y) = H(X) + H(Y) - H(X, Y)
I(X, Y) = H(Y) - H(Y|X)

Information Theory Flashcards

(12 cards)