Chapter 3 PMF Flashcards
What is a Probability Mass Function (PMF)?
a map of each value to its probability.
What is the class size paradox?
The process of getting biased results due to more observations of large population. (ex. larger class size are observed more, so observations must be normalized)
Give the three sentaxs for selecting serises data from a dataframe?
df[‘a’] (simple indexing to select a column)
df. loc[‘a’] (loc attricutre to select row by label)
df. iloc[‘0’] (loc attribute to select row by int position)
Calculate a Pmf from this list
[1, 2, 2, 3, 5]
{1: 0.2, 2: 0.4, 3: 0.2, 5: 0.2}
Mathmatical definition for mean from a Pmf?
Where xi are the unique values and pi is the probability of that value from the Pmf