Overall Flashcards
What are the two major types of data?
Categorical (qualitative) and metric (quantitative)
What are the two subtypes of categorical (qualitative) data?
Nominal and ordinal
What are the two subtypes of metric (quantitative) data?
Continuous and discrete
What does nominal data relate to?
It is used to label variables without any order or quantitative value. It usually relates to named things and there are no units of measurements. We allocate each value to a specific category
What does ordinal data relate to?
The values can be meaningfully ordered and it is categorical because each value is assigned to a specific category
What does discrete data relate to?
The values are distinct and can have units of measurements. The data can have finite values and they are integers
What does continuous data relate to?
Fractional numbers that result from measurement and they can have units of measurement
In a box (and whisker) plot, what are the adjacent values (defined in this specific course)?
Furthest away from the median but still within 1.5 times the interquartile range
In a box (and whisker) plot, what are the points outside the adjacent values?
Potential outliers
What is the interquartile range?
Upper quartile value (3/4) subtracted by the lower quartile value (1/4)
What is the sample standard deviation?
Square root of the summation of (each value minus the mean) squared then divided by the sample size - 1
What are the residuals in the standard deviation equation?
Value minus the mean
What is the variance when the sample standard deviation is s?
s squared
What is skewness and how is it measured?
A measure of symmetry of a distribution and it is measured by the skewness coefficient that can vary between -1 and +1
What is the skewness coefficient for a symmetric distribution?
0
What is the skewness coefficient for a distribution with the mean to the left of the mode (most values are larger values in the range, long tail to the left in the negative direction)
Closer to -1 (left or negative skew)
What is the skewness coefficient for a distribution with the mean to the right of the mode (most values are smaller values in the range, long tail to the right in the positive direction)
Closer to +1 (right or positively skewed)
Probability theory is based on set theory, what is contained in set S (called space)?
All sets are subsets of set S
What is the null set?
The set that contains no elements
For experimental events, what is an event represented by and what is an impossible event?
An event is a set and an impossible event is the null set
If sets A and B are mutually exclusive, what is P(A+B) and the intersection of A and B?
P(A+B) = P(A)+P(B), and AB ={}
The conditional probability of A given B is defined as: P(A|B) =
P(AB)/P(B)
For conditional probability, should there be a causal or temporal relation between A and B?
They may or may not
What does it mean if conditional probability has no effect on the probability of an event P(A|B)=P(A)?
Events A and B are statistically independent