Data Analysis and Graphs Flashcards
What are the 3 types of data ?
Nominal data
Ordinal data
Interval data
Define nominal data
Data represented in the form of categories
for eg. How many students in the 6th form drive to school
Nominal data is discrete one item can only appear in one category
Define ordinal data
Data which is ordered in some way
Eg. Ask everyone in the class how much they like the 6th form on a scale of 1-10
Does not have equal intervals between each unit
Would not make sense to say that someone who rated the 6th form a 8 enjoys it twice as much as someone who rated
It 4
What is a disadvantage of using ordinal data?
Lacks precision as its based on subjective opinion
- psychology questionnaires do not measure something ‘real’ as opposed to measuring height for example
- can be referred to as unsafe which is why it is turned into ranks before we can use statistical test on it
Define interval data
Based on numerical scales that include units of equal precisely defined size
Eg. Units of measurements for height , time and temperature
Define discrete data ?
The units of measurement cannot be split up
Can only be exact measures
Define containing data
Measured using a scale of measurement
Eg. Height and distance
Distance has meaning at all points between the numbers given eg. We can travel a distance of 1.2, 1.85 even 1.632 miles
What is measures or central tendency ?
Inform us about central ( or middle ) values for a set of data
The average can be calculated in different ways , each one appropriate for a different situation
Eg. Mean , median , mode
Define mean
Is calculated by adding up the data items and dividing but the number of data items
Define median
The middle value in a ordered list
Define mode
The value that is the most common data item
If two categories or data items have the same frequency the data has two modes
What are some advantages of the mean ?
- it is a very sensitive statistic because it takes account of the exact distance between all the values of all the data
- it is representative of all the data
What are the disadvantages of the mean ?
- if one of the values is extremely high or low ( anomalous ) then the overall mean can be very distorted and thus misrepresent the data
- it cannot be used with nominal data
What are the advantages of using the median ?
- it is not affected by extreme scores so can be useful under such circumstances
- it is easy to calculate
- it is not distorted by any anomalous values
What are the disadvantages of using the median ?
It is less sensitive than the mean because the exact values are not reflected in the median