Khan Academy: Analyzing categorical data Flashcards
What does it mean when two variables have “ no association”, or are “independent”?
It means that P(A|B)=P(A)
Ref
Ref 2 Example
For this quiz, no association between IVY school and taking piano is like saying:
if there’s no association (dependence) then:
P(IVY|Piano)= P(IVY)
or
P(Pinao|IVY)=P(Piano)
What is meant by “two”-way frequency table?
Notice that there are two variables– gender and preference– this is where the two in two-way frequency table comes from.
What are two-way frequency tables good for?
Two-way relative frequency tables show us percentages rather than counts. They are good for seeing if there is an association between two variables.
Two-way relative frequency tables are useful when ____.
there are different sample sizes in a dataset
What is marginal distribution? How can it be represented?
The sum of the numbers row wise and column wise in a two-way table.
Can be represented as counts or percentages
What is conditional distribution? How can it be represented?(Percentage/Number/Both)
distribution of one variable given something true about the other variable.
It’s represented as precentages