Numerical Methods Part 1 M1A:3 Flashcards
A measurement that represent the middle or center of the data
Central tendency
The average of the population data is what?
Population mean (u)
A number that is calculated from all of the population measurements that describes some aspect of the population
Population parameter
A number calculated using the sample measurements that describes some aspect of the sample
Sample statistic
What are the 3 measures of central tendancy and how do you get them?
Mean: the average or expected value. Add all and divide by number.
Median: the middle number. If there are two get the middle two numbers mean
Mode: the most frequent number is the set
The sample mean (x line over it) is what?
For a sample of size n the sample mean is a point estimate of the population mean.
It is the value to expect, on average, in the long run.
What must you do so the median is accurate
Arrange in order highest to lowest
A population or sample measurement that occurs most frequently
Mode
When data is arranged in classes, the class with the highest frequency is what?
The modal class, the highest class in the fequency
Of the mean median and mode are equal then the distribution is called what?
A normal distribution otherwise it is left or right skewed.
When do you want to use the mode as central tendency?
When there are outliers that will throw off the average.
Name the three measures of variation
- Range
- Variance
- Standard deviation
The largest minus the smallest measurements of a distribution is called what?
The range
The average of the squared deviations of all the population measurements from the population mean
Variance
The square root of the variance
The standard deviation
What is this calculation?
= (x1-u)^2 + (x2-u)^2 + (Xn-u)^2
-——————————————
N
This one?
= (x1-u)^2 + (x2-u)^2+ (xn-u)^2
-——————————————
N-1
- The population variance
- The sample variance don’t have the population total so had to calculate a sample variance so you use -1 and make a degrees of freedom
What are the three things you must have in order to use the empirical rule?
- A normal curve
- A mean
- A standard deviations
If the population measurements fall within one standard deviation of the mean it falls in what percentage?
68.4% of all measurements will fall within one standard deviation from the mean.
If the population measurements fall within two standard deviation s of the mean it falls in what percentage?
95.5% of all measurements will fall within one standard deviation from the mean.
If the population measurements fall within three standard deviations of the mean it falls in what percentage?
99.7% of all measurements will fall within one standard deviation from the mean.
For any x in a population sample, the associated z score is what (use the formula) and what is it?
Z= x-mean
—————
Stan dev
The z score is the number of standard deviations that x is from the mean
Positive and negative z scores are associated with the mean how?
- A positive z score is for x above the mean
- A positive z score is for x below the mean
When there are two or more populations that have different means and different standard deviations you use this formula to measure the size of the standard deviation relative to the mean.
Tell which population is less variation
The coefficient of variation
= standard deviation/mean x 100%
If you don’t have a normal distribution and have a skewed curve.
For set measurements arranged in increasing order, measurements fall at or below the value and (100-p) percent of fall at or above the value.
This is what?
Percents and quartiles
Explain the breakdown of the quartiles. Give an example of when we use them.
- the first quartile Q1 is the 25th percentile
- the second quartile Q2 is in the 50th percentile (median)
- the third quartile Q3 is in the 75th percentile
The interquartile range IQR is Q3-Q1
This is used in grades with classes
If you arrange the data in increasing order and calculate using this formula
į= (p/100)n
What are you trying to obtain and what is it called?
You are calculating percentiles; p is the percentile to find.
If your answer to į is 1.2 then what would you do?
Round to the higher integer so the answer is 1.2=2
To summarize data from a distribution that is skewed you’ll want to obtain what?
- The smallest measurement
- The first quartile
- The median
- The third quartile
- The largest measurement
Measurements that are different from the other measurements: much higher or lower
An outlier