lec 5 variables and scales Flashcards
what is a variable
a condition/ characteristic that can change or have different values
defining characteristics
- attribute that describes a person/ place/ thing
- value can vary betw/ diff entity’s
*
qualitative vs quantitative variables
qualitative: values that are names/ labels
quantitative: numeric variables that measure quantity
discrete vs continuous variables
continuous variable: a variable that can have any value bet/ it’s minimum and max values
discrete: variable that can’t have any value betw/ min and max
univariate vs bivariate data
Univariate data: when a study consists of only one variabe
Bivariate data: when a study examines the relationship bet/ 2 variabels
what is a nominal scale
- lowest statistical measurement level
- this scale is given to items that are divided into categoris without any order or structure
- e.g.
- gender
- eye colour
- blood type
- e.g.
what is the ordinal scale
consists of variables that have an inherent order to the relationship among diff categories
- a ranking of responses that may have diff meaning among individuals
- allows gross order but not the relative distance between them as the distance is not equal
- properties of ordinal scale:
- 1)Identity: quality being measured
- 2) Magnitude: amount of the quality being measured gives a quantitative distance betw/
*
what is the interval scale
variables that have a constant and equal distances between values but the zero point is arbitrary
properties:
- identity
- magnitude
- equal distance: shows how the difference bet/ points
e.g. IQ score, pain scale w/ no,
what is a ratio scale
top level of measurement with all the properties of abstract an abstract number system but with an absolute zero
properties
- identity
- magnitude
- equal distance
- absolute zero: allows how many times greater one case is from another
- allows use of all mathematical operations
- e.g.
- wieght,
- pulse rate
- respiratory rate
- e.g.
what is a measure of central tendency / central location
a single value that attempts to describe a set of data by identifying the central position within the data set
- mean
- median
- mode
describe the mean §
- most familiar measure of central tendancy
- most common value in the data set even though its not one of the values=> min error
- used wi/ discrete and continous data, latter most common
- sum of all the values divided by no of values in data set to min error
- includes every value of data set
- only central tendency w/ the sum of deviations of each value from mean = 0
- sample mean = X bar
- populaiton mean = µ
what is the main disadvantage of the mean
very susceptible to outliers (values unusual compared to data set by small/ narge numerical value)
mean can be skewed by these values
if so the median is a better measure of central tendency
when not to use the mean and use the median instead
presence of outliers
_skewed distributio_n- the mean moves away from the centre but the median stays central and is least influenced
- in normal distribution: mean= median=mode
what is the median
the middle score for a set of data that has been arranged in order of magnitude
- least affected by outliers and skewed data
- order the values and find te middle, if even no. find mean of the two
what is the mode
most frequent score in the data set.
- highest bar on histogram
- used for categorical data when the most popular option is sought after
*
problems with the mode
-
non unique,
- causes problems when 2 values are equally popular
- even more problematic for continous data as a finding an exact mode is unlikley=> mode is rarely used w/ continous data
- if the mode is far from the rest of the data in the set then it’s inaccurate