Lesson 6 Flashcards

1
Q

When talking about a quantitative
variable, there are (at least) three
important aspects to discuss:

A

The center, The variability
/ spread,The shape

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

When we talk about the center, we’re talking
about an “average” or “typical” value in the
dataset. This could refer to any of the following:

A

Mean (most common), Median, mode (rare cases)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

If we’re talking about a sample mean, we use

A

𝑥 (called “x bar”)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

If we’re talking about a population mean, we
use

A

µ (the Greek letter mu)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

In each case below, would it make more sense to use the mean, median, or mode to
describe the center of the data?
1. Income for residents of Elon, NC
2. Heights of newborn babies
3. Number of siblings
4. Exam scores in a class where all students did well

A
  1. Income for residents of Elon, NC
    Median since there would likely be outliers and it would not be spread out evenly.
  2. Heights of newborn babies
    Mean since the data values would be spread out evenly.
  3. Number of siblings
    Mode (also median or mean)
  4. Exam scores in a class where all students did well
    Both Mean and Median
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

The simplest measure of spread / variability is the range. The range is
the biggest value minus the smallest value. In other words,

A

Range = maximum value – minimum value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

An alternative way to measure the variability / spread is with the

A

Standard Deviation, The standard deviation is like a “typical” distance that a value might be
away from the mean. Let’s think about this with our exam example.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Which will be more resistant to outliers? The range or the standard
deviation?

A

The standard deviation is more resistant to outliers than the range.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Do you think the measure of spread you chose will be impacted much
by outliers or not really? What is your reasoning?

A

However, it is probably still impacted by them (you can test this with
the Mike example). It does use all of our data values in the calculation,
but it will be less impacted than the range.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Percentiles are..

A

Percentiles are considered a measure of location. They tell
you where a data value lies within your data by describing
the percentage of the values that are below a specific
value.
E.g. Ben Wyatt’s SAT score was at the 98th percentile.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Traditionally, percentiles are

A

Whole numbers,If you are at
the kth percentile, then k% of the sample (or population)
is below you.
Ben Wyatt’s SAT score was at the 98th percentile → 98% of
people had an SAT score below Ben’s score.
To calculate these, we count the number of other
observations below a value, divide by the total number of
observations, then convert to a percentage.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

During the 2019-2020 season, Elon’s women’s soccer team
scored 2.1 goals per game. This was the 31st best out of 335
NCAA Division 1 teams. Find their percentile rank.

A

335 – 31 = 304 teams below Elon.
(304 / 335) * 100 = 90.7
Elon’s goals per game was at the 90th percentile
(Note: we will always round down for percentiles)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Quartiles are a..

A

specific type of percentile. They split our data into
quarters by representing the 25th, 50th, and 75th percentiles.
* First quartile (or Q1) has 25% of the data below it.
* Second quartile (or Q2) has 50% of the data below it.
* Third quartile (or Q3) had 75% of the data below it.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Question: What is another name for the second quartile?

A

Median

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

One other measure of spread/variability that you may
encounter is called the

A

interquartile range or IQR.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Rather than looking at the entire range (max – min) we look at the range between our
quartiles (Q3 – Q1).
Why won’t the IQR be impacted by outliers like the range was?

A

Any outliers will be below the first quartile or above the third quartile! This means the
IQR is more resistant to outliers than the range (and the standard deviation).

17
Q

How do you find the weighted avg

A

Multiply each data value by its weight
* Add these together (first sum)
* Add your weights together (second sum)
* Divide the first sum by the second sum
As a formula: σ(𝑣𝑎𝑙𝑢𝑒∗𝑤𝑒𝑖𝑔ℎ𝑡)