Descriptive & Inferential Stats Flashcards
What are the main four steps in the data collection process?
- Constructing a data collection form
- Establishing a coding strategy
- Collecting the data
- Entering data onto the collection form
The data entry process is vulnerable to what?
Human error
When coding data, what main things should you remember?
Use single digits
Use codes that are simple and unambiguous
Use codes that are explicit and discrete
Name some data collection rules:
- Get permission from your institutional review board
- Decide what type of data you will need to collect
- Consider where will the data come from?
- Make a duplicate of original data and keep separate
- Ensure whoever is collecting data is well trained
- Cultivate sources for finding participants
- Don’t throw away your original data!
What is Qualtrics and SPSS used for?
Qualtrics - for creating online questionnaires
SPSS - used to enter data collected from data collection forms and analyse the data using a wide range of statistical methods
What are descriptive statistics?
Descriptive statistics include measure of central tendency, variability and distribution, and association, presented both numerically and visually. It assists in simplifying the data so you can analyse it.
What are the three main statistics for central tendency?
Mean
Median
Mode
What is central tendency?
Central tendency looks at the middle of the data and tries to capture a middle road picture of the data set.
3 main types - mean (sum of a set of scores divided by the number of score), median (the score of point in a distribution above which one-half of the scores lie) and mode (the score that occurs the most frequently)
What is one major issue with the mean value? (with respect to scores…)
It can be influenced by extreme scores or outliers
When should we use the Median?
Relatively few scores fall at the high or low end of the distribution, when the distribution is not normal. You still include the extreme scores…
Use with ordinal data eg. rank in class, birth order, income
When should we use the Mode?
When the data is measured in a nominal (sometimes ordinal) scale. eg eye colour party affiliation
When should we use the Mean?
For interval and ratio data eg. speed of response, age in years
What is variability?
Variability refers to the dispersion or spread of scores in the data. Some measures of variability are: range, inter-quartile range, standard deviation
What is range?
Range is the simplest and crudest measure of variability - it is effected significantly by extreme scores
What is interquartile range?
Quartiles divide set into four equals based on three values (the middle value being the median). The interquartile range is the upper quartile minus the lower quartile.
What is standard deviation?
The average amount that each individual scores deviate from the mean. SD is good when you have a normal distribution
Researchers like to examine the entire set of scores at one time.. how can they do this?
By looking at the data’s distribution
What is a histogram?
Graph showing frequency distribution (Y = frequency, X = score)
What is a normal distribution?
A normal distribution has the following properties: it has a bell shape, the mean and median are equal, and 68% of the data falls within 1 standard deviation.
What is something (not average…) that the normal distribution can tell us?
It tells us a lot about people who deviate from the average cluster of people (whether low or high end).
You can accurately determine what percentage of the data is below or above any value.
Name some ways data can deviate from the norm?
Skew
Kurtosis
Modality
Explain skew
skewness is a measure of the asymmetry of the probability distribution of a real-valued random variable about its mean