data analysis :descriptive statistics Flashcards

1
Q

define descriptive statistics

A

the use of graphs , tables and summary statistics to identify and analyse sets of data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

define measures of central tendency

A

the general term of any measure of the average value in a set of data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

what are the 3 types of measures of central tendency ?

A

mean , median and mode

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

how is the mean calculated ?

A

calculated by adding up all the scores or values in a data set and dividing this figure by the total number of scores there are

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

work out the mean for the following data set of scores :
5 , 7 , 7, 9 , 10 , 11 , 12 , 14 , 15 ,17

A

total is 107
107/10 the number of scores
mean value of 10.7

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

why is the mean the most sensitive of the measures of central tendency and what does that mean ?

A

as it includes all of the scores/values in the data set within the calculation
–> this means it is more representative of the data as a whole

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

what is a limitation for mean and give an example of how this could happen ?

A

it is easily distorted by extreme values
example : if we replace 17 in the data above with the number 98 –> the mean becomes 18.8 which doesn’t really seem to represent the data overall

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

define mean

A

the arithmetic average calculated by adding up
all the values in a set of data and dividing by the number
of values there are

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

define median

A

the central value in a set of data when values
are arranged from lowest to highest

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

how is the median calculated ?

A

the middle value in a data set when scores are arranged from lowest to highest

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

when is the median easily identified ?

A

in the odd number of scores

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

how is the median identified in the even number of scores ?

A

the median is halfway between the 2 middle scores

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

what is a strengths for the median ?

A
  • the extreme scores does not effect the result
  • it is so easy to calculate
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

what is a limitation for the median ?

A

it is less sensitive than the mean as not all scores are included in the final calculation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

what does ‘sensitive’ refer to ?

A

refers to how easily a measure is influenced by data that’s unusual or doesn’t fit the rest

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

define mode

A

the most frequently occurring value in a set of
data

17
Q

how to calculate the mode ?

A

the most frequently occurring score/value within a data set

18
Q

in some sets of data what may there be ?

A

2 modes - bimodal
OR
no mode if all the scores are different

19
Q

what is limitation of the mode and give an example ?

A

it is a very crude measure
the mode can
- sometimes be quite different from the mean and the median

20
Q

when would mode be the only method that can be used ?

A

example :
if you asked your class to list their favourite dessert , the only way to identify the most ‘typical’ or average would be to select the modal group

21
Q

when deciding what method of central tendency should be used what should be considered ?

A

whether there are any extreme scores

22
Q

what would be best to consider if there is no extreme values and why ?

A

the mean is the best option as it is
the most sensitive measure of the three

23
Q

what would be best to consider if there is extreme values and why ?

A

the median is most suitable as the mean
would become distorted

24
Q

what would be never the best option and when would it be appropriate ?

A

the mode - except if the data are in categories

25
define measures of dispersion
the general term for any measure of the spread or variation in a set of scores
26
what are the 2 examples of measures of dispersion ?
range and standard deviation
27
define the range
a simple calculation of the dispersion in a set of scores which is worked out by subtracting the lowest score from the highest score and adding 1 as a mathematical correction
28
how is the range calculated ?
a simple calculation of the spread of scores and is worked out by taking the lowest value from the highest value and (usually) adding 1
29
why is 1 added when calculating the range and give an example ?
t allows for the fact that raw scores are often rounded up (or down) when they are recorded within research - someone may complete a simple task in 45 seconds - however, it is unlikely they took exactly 45 seconds to complete this task (in fact it may have taken them anywhere between 44.5 and 45.5 seconds), so the addition of 1 accounts for this margin of error
30
what is a strength for the range ?
it is easy to calculate
31
what s a limitation for the range and give an example ?
- it only takes into account for the 2 extreme values --> this may be unrepresentative of the data set as whole EXAMPLE : a student was ill during a test and score 0 and the highest value was a 100 due to the student being given the paper as a homework - this illustrates the problem with range and it may not give a fair representation of the general spread of scores as in this example most students achieved around half marks in the test
32
define standard deviation
a sophisticated measure of dispersion in a set of scores. It tells us how much scores deviate from the mean by calculating the difference between the mean and each score. All the differences are added up and divided by the number of scores. This gives the variance. The standard deviation is the square root of the variance
33
what is the SD considered as ?
much more sophisticated measure of dispersion
34
what does the SD tell us ?
is a single value that tells us how far scores deviate (move away from) the mean
35
the larger the SD
the greater the dispersion or spread within a set of data
36
in a situation where we are talking about a particular condition within an experiment a large SD suggests that ..
that not all participants were affected by the IV in the same way because the data are quite widely spread - may be few anomalous results
37
what may a low SD value reflect ?
the fact that the data are tightly clustered around the mean, --> might imply that all participants responded in a fairly similar way
38
what is a strength of SD ?
much more precise measure of dispersion than the range as it includes all values within the final calculation
39
what is a limitation of SD ?
like the mean – it can be distorted by a single extreme value