Lecture 2: Probability Distributions Flashcards
Frequency distributions
Summarise observed outcomes in a sample
Contingency table
Used to describe the joint frequency distribution, and possibly relationship, between two categorical variables
The standard normal distribution (Z-distribution)
A standardised version of the normal distribution, rescaled to have a mean of 0 and a standard deviation of 1
Discrete random variables
Variables with a finite or countable number of possible outcomes
Continuous random variables
Variables with an infinite number of possible outcomes
Indexing a contingency table
Telling other people which cell in the contingency table to look at
Row = i, column = j, frequency = f
E.g., look at index i = 2 means to look at row 2
Joint frequency distribution
Frequency of a combination of two (or more) variables
E.g., the frequency of Dutch students with a tattoo
Marginal frequency distribution
Looks at multiple variables and how often they occur
E.g., “tattoo” and “Dutch”
Marginal probability distribution
Divide marginal totals by the global totals (i.e., the entire sample)
If the entire sample is 74 and 21 people of this sample are non-Dutch, then the probability of someone being non-Dutch is 21/74 = 0.28
Conditional probability distribution
Divide the row- or column frequencies by the total of the row/column that we are interested in
E.g., what is the conditional probability of having a tattoo for international students? P (Tattoo | Dutch => chance of having a tattoo, given that you are Dutch)
Joint probability distribution
Dividing the cell frequency by the global total
E.g.., what is the joint probability of someone being Dutch and having a tattoo?
P (Dutch ∩ Tattoo)
The symbol ∩ refers to ‘and’