Terms/Everything else Flashcards
Based of the correlation coefficient, how do we know if there is a strong or weak association?
The closer the correlation coefficient value is to +/- 1, the stronger the association
What is the range the correlation coefficient will always fall into?
The correlation coefficient will always fall between -1 and 1. If it doesn’t you did something wrong
What is the correlation coefficient used for?
The correlation coefficient is used to determine the strength of the association
How do we know what type of association we have based on the covariance?
If they covariance value is positive, we have a positive association. If it is negative, we have a negative association
What is the covariance used for?
The covariance is used to measure the linear association of data.
How do we know if data is skewed to the right or left?
Whatever direction has the tail with the least amount of the data is the direction of the skewness
What are the different types of skewness of a distribution?
No skewness (normal distribution), moderately skewed to the left/right, highly skewed to the left/right
How do we know if a z-score value is considered an outlier?
If the absolute value of a z-score is greater than 3, then we know if the value we are looking at is an outlier
What are z-score used for?
Z-scores are used to determine if a specific value is an outlier
What is a census?
A census is data collected on an entire population
What is Time-Series data?
Time-series data is data that is collected over several periods in time.
What is Cross-sectional data?
Cross-sectional data is data that is collect at roughly the same point in time
What is Quantitative data?
Qualitative data is numerical data (a.k.a quantitative variable)
What is Qualitative data?
Qualitative data is non-numerical data (a.k.a categorical variable)
What is ratio data?
Ratio data is numerical data that is used to compare elements