Measurement and Graphical Representations of Data Flashcards

1
Q

What do descriptive statistics tell us?

A

What type of variable we have and what are their values in our sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

When we want to study a _________, we ____ what happens in the __________ by studying a ______________

A
  1. characteristic
  2. infer
  3. population
  4. representative sample
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is categorical data?

A

Qualitative data that can be divided into groups, usually based on the limited and fixed number of possible values it can have (e.g. colours)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are the two main types of categorical data?

A

Nominal and Ordinal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is a nominal variable?

A

Categorical data with no inherent order (e.g. ethnicity). It cannot be a number or measured but may be coded for ease of analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is an ordinal variable?

A

Categorical data that can be ranked, although not necessarily easily spaced (e.g. SES status). Numbers may represent an order but have no mathematical meaning.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is numerical data?

A

Quantitative data with values that are always expressed in number form

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are the three main types of numberical data?

A

Interval, discrete, and continuous data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is an interval variable?

A

Data which is ordinal but with equidistant and meaningful spaces, usually with 5 or more categories

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is discrete data?

A

A countable variable that involves a specific, limited number of possible integer values (e.g. number of kids, shoe size). It may have decimals to include halves if logical.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is continuous data?

A

A variable that can be measured. It is not fixed and can have an infinite number of possible values in a prespecified interval (e.g. height, weight, age)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the least informative type of variable?

A

Categorical nominal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the most informative type of variable?

A

Numerical continuous

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

If a variable has four equidistant categories, what type of variable is it?

A

Categorical ordinal - there is not enough information to approximate the underlying variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

If a variable has five equidistant categories with mathematical meaning, what type of variable is it?

A

Numerical interval - we can treat it as continuous as we can sufficiently estimate the underlying continuum

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

If a variable has ordered values and we know that the difference between two values is meaningful, what type of data is this?

A

Numerical interval

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

A measure of agreement asks participants to rate their agreement 1) strongly disagree, 2) disagree, 3) agree, 4) strongly agree. What type of variable is this?

A

Categorical ordinal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

A survery on fruit intake asks participants to rate how often then eat fruit per week: 1-2 days per week, 3-4 days, 5-6 days, or everyday. What type of variable is this?

A

Numerical interval

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

A survery asks participants to rate their agreement with 1) I am not sure, 2) I agree to some extent, 3) Depends on the occasion, 4) I am not informed. What type of variable is this?

A

Categorical nominal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

A survery asks participants to rate quality of communication on a scale where 1=very poor and 10=very good. What type of variable is this?

A

Numerical interval/continuous

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What is a valid percentage?

A

The frequency of a category amongst those who responded (excluding those who did not)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What are the best descriptive indices for categorical data?

A

Frequencies and percentages

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

What are the best types of graphical representation for categorical data?

A

Bar charts or pie charts

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What is the best use of pie charts in research?

A

For nominal data with 2 or more categories

25
What goes on the X and Y axis of a bar chart?
X=category Y=count/frequency
26
What are the two main measures used to describe numerical data?
Location (central tendency) and dispersion (variability)
27
What is the variance used for?
To understand how far values are from the mean (the average squared distance)
28
What does the standard deviation measure?
How spread out a group of numbers are from the mean
29
If you have the variance, how do you calculate the standard deviation?
Take the square root
30
If you have the standard deviation, how do you calculate the variance?
Square it
31
Why do you divide by n-1 when working out the variance or SD when using a sample?
To obtain an unbiased estimate for the population
32
What are examples of measures of location and central tendency?
Mean, median, and mode
33
What are examples of measures of dispersion?
Standard deviation, minimum and maximum, range and IQR
34
What are the best types of graphical representation for numerical data?
Histograms and box plots
35
What do the bins represent in a histogram?
Intervals, not values
36
What can you add to a histogram to help visualise the spread compared to what may be expected?
A normal distribution curve
37
What is the normal curve?
A bell shaped symmetrical curve around the mean
38
In a normal distribution, what percentage of values are lower than the mean?
50%
39
In what type of distribution does the mean equal the median and the mode?
The normal distribution
40
What is the name for data that is not normal?
Skewed/non-symmetrical
41
If the median and mode are smaller than the mean, what is the distribution?
Positively skewed
42
If data is positively skewed, the mean is ________ than the median and the mode.
Greater
43
If the median and mode are greater than the mean, what is the distribution?
Negatively skewed
44
If data is negatively skewed, the mean is _______ than the median and the mode.
Smaller
45
What are the best descriptive indices to use for normal (symmetrical) numerical data?
Mean and standard deviation
46
What are the best descriptive indices to use for skewed (non-symmetrical) numerical data?
Median, minimum and maximum, and interquartile range
47
What is the best measure of dispersion for normal data?
Standard deviation
48
What is the best measure of dispersion for skewed data?
Minimum and maximum, and interquartile range
49
What is the best measure of central tendency for normal data?
Mean
50
What is the best measure of central tendency for skewed data?
Median
51
What are the best descriptive indices for discrete numerical data?
Median, mode, min-max, and IQR
52
What is the benefit of using a box plot?
To visualise outliers and depict the distribution of data across groups
53
In a box plot, what percentage of values are greater than Q3?
25%
54
In a box plot, what does the middle line represent?
The median
55
What type of distribution would produce a box plot where (Q2 - Q1) < (Q3 - Q2) ?
A positively skewed distribution
56
What type of distribution would produce a box plot where (Q2 - Q1) = (Q3 - Q2) ?
A normal distribution
57
What type of distribution would produce a box plot where (Q2 - Q1) > (Q3 - Q2) ?
A negatively skewed distribution
58
If the mean, median, and mode are all close together, what can we infer from the data?
That it is normally distributed
59
What are usually the best descriptive indices for numerical interval data?
Median and range (assuming it is skewed)