Week 14: hypo testing II Flashcards
descriptive stats definition
a way of summarizing data to convey main information
—instead of presenting raw scores, researchers present a few summary measures that capture their data’s key characteristics
descriptive stats can be
- frequencies and percentages
- measures of location (central tendency)
- measures of variability
- measures of individual location
frequencies and percentages
- preferred when researchers want to convey how often particular phenomena occurred (nominal measures)
- divide # in a category by # in group and x100
- proportion is an alternative to percentage, just dont multiply by 100
measures of location or central tendency
single values that describe an entire set of data
- include two broad categories
- –central location
- —–mean, median, mode
- –fractiles
mean
average
- the sum of n divided by n (the sample size)
- most appropriate for interval or ratio levels
- most reflective of data when the distribution of scores fits a normal distribution
- –outliers make it less representative
- x bar= sigma x/n
median (mdn)
- the midpoint or center of the data or set of numbers
- computing the median requires ordinal, interval, and ratio levels (must be arranged from low to high)
- if even # the average the 2 middle
- if data is not normally distributed the median may e preferred over the mean
the mode
- based on frequency info, primary measures of central tendency for nominal measures
- –category, response, or score that happens most often
- relatively uninformative but is the only thing you can do with nominal level
- bimodal–when two sets have the same number of frequency (2 humps like a camel) normal is unimodal
fractiles
statistical procedures are called fractiles when they divide a set of data into two or more nearly equal parts
- identify the proportion of observations above and below them (median divides in half)
- can be quartiles, quintiles, deciles, percentiles
measures of individual location
used to specify the location of one participant in relation to a group of participants, they include: rank, percentile rank, and standard scores
rank
orders a group of participants in terms of their performance on some measure from low to high or high to low
percentile rank
shows the relative position for an individual within a group of participants
standard score
- aka z score
- raw scores that are converted to standard deviation units, they are useful as indicators of individual location when data are normally distributes
- z=(x-mean)/standard deviation
measures of variability
the degree of dispersion in a set of data (aka spread)
- measures:
- –range
- –variance
- –standard deviation
range
the largest observed value minus the smallest
- useful when comparing variations between two sets of data, however tells nothing about variability of scores falling between
- nominal data
variance
considers the dispersion of individual values around the mean
- can be calculated by either interval or ratio level data
- to calculate
1) list all scores in a column
2) compute the mean for the set of scores
3) obtain the difference between each score and the mean
4) square the difference between each score and the mean
5) add the squared differences together–sum the squares
6) divide the sum of squared by the number of participants minus one (n-1) - sum of x minus xbar squared all divided by n-1