Week 9 : Interpreting graphs Flashcards by Catherine Fielding

Q

Data

Types of data:

A

categorial data
numerical data

How well did you know this?

1

Not at all

2

3

4

5

Perfectly

Q

Types of data

Categorial data

A

each value represents a discrete category
order does not matter (e.g. 1=tiger, 2=lion, 3=ape)
represent with pie charts, bar graphs & stacked column charts

How well did you know this?

1

Not at all

2

3

4

5

Perfectly

Q

Types of data

Numerical data

A

each value represents either a real number (e.g., age) or place on a continuum (e.g. a rating scale)
Order matters (1= very unhappy, 7= very happy)
histograms & scatterplots
Time series graphs if data are collected over time
discrete vs continuous

How well did you know this?

1

Not at all

2

3

4

5

Perfectly

Q

numerical data

discrete

A

the variable has a discrete, finite number of values
e.g. binned volume (e.g. day of month on which u bought ur last avocado… only 31 possibilities)

How well did you know this?

1

Not at all

2

3

4

5

Perfectly

Q

numerical data

Continuous

A

the variable has an infinite number of values
e.g. log-transformed volume, average number of avodados you buy each month
assumed to be normally distributed

How well did you know this?

1

Not at all

2

3

4

5

Perfectly

Q

What is a graph?

A

visual representation of data that can present complex information quickly and clearly & assist the reader to see patterns and trends in data
graph is good when… precise numbers arent required, trend or comparison can be demonstrated and there are relationships between data values

How well did you know this?

1

Not at all

2

3

4

5

Perfectly

Q

Graphs need to include the following…

A

titles
axis labels
legends
footnotes
representation of axes
scale and error
a visualstyle that is easy to interpret

How well did you know this?

1

Not at all

2

3

4

5

Perfectly

Q

Graphs need to include the following…

Title

A

summarize what graph is showing
placed in centre above/below graph
apply numbering system to the titles of all graphs
explain what the X and Y axez represent

How well did you know this?

1

Not at all

2

3

4

5

Perfectly

Q

Graphs need to include the following…

Labels & legend

A

X-axis (horizontal) and Y-azis (vertical)
brief
explain exactly what each aspect of the graph is showing
include units of measurement
Legend… key to the various data plotted on a graph (e.g. colours or shading)

How well did you know this?

1

Not at all

2

3

4

5

Perfectly

Q

Graphs need to include the following…

Footnotes

A

further explain data
e.g. in sample survey include footnote describing the sample that’s being represented and the number of respondents in the sample (n)
incluse a base on th egraph that allows reader to see how many ppl answered the question & make a quick assessment of the likely accuracy of the results based on sample size
should mention the source of data

How well did you know this?

1

Not at all

2

3

4

5

Perfectly

Q

Graphs need to include the following…

Axes & scales

A

vertical axis starts at 0
Only exception to this rule is when there are negative values, in which case the scale would start at less than 0

How well did you know this?

1

Not at all

2

3

4

5

Perfectly

Q

Indicating range of error

Confidence intervals

A

gives an estimated range of values taken from a set of sample data
The range of values is likely to include what the ‘true’ value would be if the entire population were to be surveyed
when reporting exact known figures, confidence intervals are not necessary
statistical results are often presented using the 95% confidence interval… range of valyes within which there is a 95% chance the true population value lies
typically displayed by using error bars… if they do not overlap there is usually a statistically significant difference in the estimates for those response categories

How well did you know this?

1

Not at all

2

3

4

5

Perfectly

Q

Visual style

how to make the graph look the best

A

reduce clutter
highlight what is importan
data ink… numbers (scale) and vital points representing data (non-data ink is titles, headings, legends, etc. should not be overused)
colours and patterns… do not over use to distract
dimension… graphs should be 2D whenever possible

How well did you know this?

1

Not at all

2

3

4

5

Perfectly

Q

Graphs

Bar graphs

A

compares a series of categories by representing each one as a bar
used to evaluate categorial data
simple & easy to interpret
easy to include error bars
populat in survey reporting

How well did you know this?

1

Not at all

2

3

4

5

Perfectly

Q

Bar graphs

Vertical/horizontal & clustered bar graph

A

vertical… best for comparing estimates (means or percentages) & between 2-7 groups
Horizontal… best for showing categorial data when comparing estimates, 8+ different groups, use when category labels are too long to appear neatly on x-axis
clustered…

How well did you know this?

1

Not at all

2

3

4

5

Perfectly

Q

Graphs

Stacked column graphs…

Study These Flashcards

A

compare percentage that each value contributes to a total of 100% across categories
best used for categorial data when each column is comprised of no more than 3 components

Q

Graphs

Line graphs

Study These Flashcards

A

used to illustrate trends over time for continuous data
can also be used to compare 2 different variables over time

Q

Graphs

Histograms

Study These Flashcards

A

Shows the distribution (shape) of a numerical variable
Provides a visual, intuitive sense of the data (Mean, range, skew, possible outliers )
data are grouped into ranges than plotted as connected bars
each bar represents a range of data (width of bar proportionate to width of each category, height proportional to frequency/percentage of that cagetory)
bars presented in ascending or descending order
used for data that are at least at the ordinal level of measurement, and most often for plotting continuous data

Q

Graphs

Scatter plots

Study These Flashcards

A

used to plot data points on a horizontal and a vertical axis to show relationships between 2 variables
plotting continuous data
useful when comparing 2 variables in situations when there are so many data points

Q

Graphs

Line graphs

Study These Flashcards

A

Tells you how two variables are associated by drawing a line through a series of places where X and Y intersect
X-axis: discrete variable
Y-axis: continuous variable of interest

Q

Graphs

Box whisker plot

Study These Flashcards

A

displaying variation in a set of data
used in exploratory data analysis
shows shape of distribution, central value & variability
shows 5 number summary
useful for indicating whether a distribution is skewed and whether there are unusual observations (outliers) in the data set
Ideal for comparing distributions because the centre, spread and overall range are immediately apparent

Q

Graphs

Pie charts

Study These Flashcards

A

Great for displaying relative frequencies (parts of a whole) but do not really give you much else
Not great for displaying absolute frequencies
show parts or percentages of a whole
limitations… hard to tell difference, error bars & confidence intervals not shown legend & labels hard to read & align & do not work when comparing data

Q

Graphs

time-series graphs

Study These Flashcards

A

special kind of line graph that shows how something changes over time
X-axis: time, usually as a discrete variable
Y-axis: continuous variable of interest

Q

Importance of y-axis

Study These Flashcards

A

Truncating the y-axis (e.g. not starting at baseline) can exaggerate differences
A more reasonable y-axis range starts ay 0 and show the whole range
But also too broad of a range can minimize differences
There should only be one y-axis

Importance of X-axis

- Truncating an Axis = Restricting range to maximize differences - Expanding an Axis = Using too broad a range to minimize differences - Ignoring conventions (E.g., values should go from small to large) - Comparing non-equivalent data (E.g., two different Y axes on same graph)