Outliers And Data Flashcards
How to find an outlier
Outlier is any value greater then Q3 +k(Q3-Q1)
Or
Less than Q1 -k(Q3-Q1)
In histograms, what is proportional to frequency
Area is proportional to frequency in histogram bars
Benefits of histograms
Histograms give a good picture of how the data is distributed. Enables you to see rough location, general shape and how spread out the data is
How to calculate height of bar histogram
Area of bar = k x frequency
Frequency density
Frequency density is frequency/class width
What are used to c9mpare data
Mean and standard deviation or median and interquartile range are used to compare data
Discrete data
Discrete data is data which falls into distinct catagories
Continuous data
Continuous data is data which can be anywhere along a range of values
What distribution does continuous data use
Continuous data uses normal distribution
What distribution does continuous data use
Continuous data uses binomial distribution
How to measure strength of correlation
Measure strength of correlation is when points are closer to a straight line
Association
Association is connected ness of 2 variables which aren’t necessarily linear. Data can be curved
What is explanatory variable
Explanatory variable is independent variable.
What is correlation
Correlation is description of linear relationship between 2 variables.
What is hypothesis
Hypothesis is statement made about value of a population parameter. E.g lamps have 0.1 chance of breaking