PSY1022 WEEK 11 DISC 5 Flashcards
DESCRIPTIVE STATISTICS
Techniques that help describe a set of data. eg. Graphs, tables, calculating an average score.
Creating a meaningful summary, so patterns and trends in the raw data can be seen.
INFERENTIAL STATISTICS
Methods that use the limited information from samples to answer general questions about populations.
- help researchers determine when it is appropriate to generalize from a sample to a population
STATISTIC
A summary value that describes a sample.
- eg. average age of students in a sample.
Two purposes:
- They describe or summarize the entire set of scores in the sample.
- They provide information about the corresponding summary values for the entire population.
PARAMETER
A summary value that describes a population.
- eg. average age of students in the population.
- symbolised by Greek letters.
FREQUENCY DISTRIBUTIONS
A method of simplifying and organising a set of scores by grouping them into an an organised display that shows the entire set.
- consists of a tabulation of the number of individuals in each category on the scale of measurement (X and f)
- displays set of categories and and number of individuals with scores in each category.
- advantage: allows researcher to view the entire set of scores
- disadvantage: can be tedious with large sets of data.
FREQUENCY DISTRIBUTION GRAPH
Scale of measurement, or categories, on X axis.
Frequencies on Y axis.
- histogram (boxy graph thing)
- polygon (line graph thing)
- when not numerical = bar graph. Similar to histogram, but bars don’t touch.
- first step in examining a set of data. Rarely used in published reports.
FREQUENCY DISTRIBUTION TABLE
Two columns of information
- first is scale of measurement, or categories. Labelled X.
- second is frequency or number of individuals
CENTRAL TENDENCY
A statistical measure that identifies a single score that defines the center of a distribution. The goal of central tendency is to identify the value that is most typical or most / best representative of the entire group.
- mean, median, mode
MEAN
Add all scores, divide by number of individuals.
Represented by M in research papers.
Mean for population represented by µ (mu).
- can be skewed by a few extreme scores
- can’t use with nominal scale
- ordinal is generally inappropriate.
MEDIAN
The score that divides a distribution in half.
Good when there are extremes.
Is the “middle” number. Write scores in order, pick the middle one.
If two in the middle add them together and divide by 2.
Also called 50th percentile.
- can be used for ordinal, ratio, and interval
- good alternative to mean when there are a few extreme scores.
MODE
Is the score or category with the greatest frequency. In a frequency distribution graph, the mode identifies the location of the peak (highest point) in the distribution.
- possible to be bimodal or multimodal.
- can be used for NOIR
VARIABILITY
Descriptive: describes the spread of scores in a distribution.
Inferential: provides a measure of how accurately any individual score or sample represents the entire population.
A primary value to describe a distribution of scores.
Small = clustered together (good representation)
Large - spread out (distorted picture)
Measured by range or standard deviation.
STANDARD DEVIATION
Uses the mean of the distribution as a reference point and measures variability by measuring the standard (average) distance between each score and the mean.
- averaged squared distance from the mean.
- represented by s or SD
- half will be positive and half with be negative
VARIANCE
Average squared distance from the mean.
= s^2
Measures variability.
CALCULATE STANDARD DEVIATION
- For each score, measure distance from mean (score = 84, mean = 80, deviation = 4)
- Find variance by summing all the squared distances
- find average squared distance (divide by n-1, not n).
- square root to find standard deviation
DEGREES OF FREEDOM
n-1
df
produces a variance for the sample that is an accurate and unbiased representation of the population variance