Chapter 2 - Frequency Distributions Flashcards

Question

# **APPLYING THE CONCEPTS:** The United Nations Development Programme(2015) published life expectancy rates—the number of years an adult can expect to live—for 195 countries around the world. Following is a randomly selected sample of 30 of them (see graph below). **a**. Create a grouped frequency table for these data. **b**. The data have quite a range, with the lowest life expectancy of 50.72 years in Côte d’Ivoire and the highest life expectancy of 83.58 years in Japan. What research hypotheses come to mind when you examine these data? State at least one research question that these data suggest to you. **c**. Create a grouped histogram for these data. As always, be careful when determining the midpoints of the intervals. **d**. Examine the histogram and give a brief description of the distribution. Are there unusual scores? Are the data symmetric, or are they skewed? If they are skewed, in which direction?

Answer 1

A

The data that is gathered from participants. All the numbers that have not been organized or graphed or cleaned up.

WHY not use raw data?
* Finding a pattern in raw data is difficult
* We want to visualize and summarize the data
* Need to also inspect for outliers and for data entry errors.

Answer 2

A

Frequency Distribution Table = a visual depiction of data that shows how often each value occurred (how many scores were at a certain value – how many students got exactly 7 hrs sleep? 5 hours of sleep?) SEE PIC BELOW for how it’s done.
Grouped Frequency Table: (Groups the data) 2 reasons -
1. when data has a large range of potential values (like IQ going from 70 - 149 ) see table on next card
2. When the data has decimal points (is continuous)

Principles to keep in mind for a Grouped Table:
a) you need to determine the full range of data and include the points that have zero frequency (Top Value - Bottom Value: 8 - 3.5 (then + 1) = 5.5)

b) aim for between approx. 5-10 intervals (no less than 5, no more than 15)

c) for continuous data, use lower and upper limits (the lowest and highest possible values)

Frequency Distribution Table

Answer 3

A

GROUPED FREQUENCY TABLE (the data initially)

Answer 4

A

HISTOGRAM - for continuous data

Answer 5

A

When you want to show proportions of the whole picture.

Answer 6

A

Visual depictions of data when the independent variable is nominal and the dependent variable is interval (specifically, scale) :

TWO WAYS:

Present frequency or proportion Data. EX: graph showing the % of girls and boys getting over 9 hours of sleep per night.
Present mean or average values EX: the previous graph shows the mean score of the two variables, neutral and emotional. The black stick bars on top are ‘standard error bars’.

EX: develop a chart demonstrating the cost of tuition (dep. variable) for 3 types of schools - public, semi-public, & private (indep. variable)

1st way

Answer 7

A

Used to depict the relationship between 2 scale variables

ex: amount of abdominal fat & dementia symptoms

Answer 8

A

A histogram is a bar graph of data that shows the frequency of each value of a variable. Same info as a frequency table, but visualised differently.

Answer 9

A

When the choices are biased towards an outcome, such as when a scale has ‘Not Satisfactory, Good, Excellent, Truly Superior’…… and there’s no negative ratings on there! Another example is ‘Rate Toronto as 1st, 2nd, 3rd. or 4th’ and then the person reports ‘Toronto is in the top 4 cities in Canada!’. It is set up to have a biased outcome.
sometimes there is a dichotomy amoung the data because either people had very good experiences or very bad experiences (Travel Advisor, Rate my Professor, Yelp). People self-select to participate and it’s not randomized sampling!:)
When a line is drawn between data points that have been selectively placed on the graph
When a line is drawn outside of the data points and the graph assumes the model line will go down, up or across.
Uses scaling to distort the graph data. Looking at the pic below, the Tim Hortons and the Starbux uses different scales so the whole thing is hard to read at a glance! (Should start at 0 and label the scales)

All of these need to have representative sampling.

#5

Answer 10

A

is a graph showing the typical bell curve in the middle – meaning most of the participants scores were in the middle of the graph.

Answer 11

A

Instead of being a ‘normal’ graph with the bell graph in the middle, there is a tail to one side. It is non-normal and non-symmetrical.

POSITIVE — generally has ‘floor’ effects
NEGATIVE — generally has ‘ceiling’ effects

Answer 12

A

to look at the shape of the distribution

Answer 13

A

A situation in which a constraint prevents a variable from taking values below a certain point. Pushes the distribution to the LEFT side of the graph (positive skew)