Statistical Methods- Lectures3-4 Flashcards
What are scatterplots useful for?
Scatterplots are useful for visualizing the relationship between two numerical variables.
How do you describe the distribution of a quantities variable in a graph?
Describe the overall pattern (shape, center, spread etc.) and deviations from the pattern (e.g. Outliers)
How do you calculate the sample mean?
The sample mean, denoted by x̄, can be calculated as
x̄=(x1+x2+…+xn)/n
The sample mean is a sample statistic, and serves as a point estimate if the population mean.
What is the population mean?
The population mean is a population parameter computed the same way using all values in the population and is denoted by μ.
What do histograms provide?
Histograms provide a view of the data density. Higher bars represent where the data is more common.
What are commonly observed shapes of distributions?
Modality- Unimodal, bimodal, multimodal, uniform
Skewness- right skew(tails off to the right), left skew, symmetric
What are outliers?
An outlier is an observation that lies an abnormal distance from other values in a random sample from a population.
How do you find the sample variance?
The sample Variance is roughly the average squared deviation from the mean.
S^2= ∑^n i=1 (xi-x̄)^2/n-1
Why do we use the n-1 in the calculation of variance?
To make the variance an unbiased estimate
How do you find the standard deviation?
The standard deviation is the square root of the variance
How can you describe the centre and spread of a distribution?
Spread:
- Mean
- Variance
- Standard Deviation
Centre:
- Median
- Range
- Interquartile Range
How do you find find the interquartile range and the range of all data?
IQR=Q3-Q1
Range=Max value - Min Value
What are the five number summary of the data?
The median, Q1, Q3, Min and Max are called the five number summary of the data
What does the box in a box plot represent?
The box in a box plot represents the middle 50% of the data, and the thick line in the box is the median
What are whiskers of a box plot?
Max upper whisker reach=Q3+1.5xIQR
Max lower whisker reach=Q1-1.5xIQR
A potential outlier is defined as an observation beyond the maximum reach of the whiskers