descriptive and inferential statistics Flashcards
What are examples of forms of raw data?
a. Questionnaires
b. Interviews
c. Objective measures
Explain descriptive vs inferential statistics
Descriptive statistics
These are basics measures
They average scores on variables and how they differ
iii. Cannot conclude statistical differences
Inferential statistics
These help make decisions about the Null and research hypotheses
They make Generalizations from the sample population
What is central tendency
used for summarizing large amounts of data into one typical or average value
Uses mean median and mode
A summary of measure that attempts to describe a whole set of data with a singe value that represents the middle or centre of its distribution
Explain what the mean its level of measurement and when it is used
Mean measures the average score for each variable.
- Level of measurement = interval and ratio
- Used when: you can, and the data fit
Explain what the median its level of measurement and when it is used
Median is the mid point of data
a. Level of measurement: ordinal
b. Used when: data include extreme scores
Explain what mode is, its level of measurement and when it is used
Mode is the most frequent score
a. Level of measurement: nominal
b. Used when: data are categorical
What are the indices of variability?
a. Range
b. Standard deviation
c. Skewness
What is the importance of descriptives?
Provide a first glance at the data
i. Describes the variability of the sample
b. Allows other researchers to understand the data
- All variables are assumed to follow a ____ distribution
normal
What is the meaning of z-scores?
the number of standard deviations a particular score is away from the sample mean
a. Different scores represent different locations on the x-axis
i. The location on the axis is associated with a certain %
b. Z-scores can be used to predict:
i. Percentage of scores both above and below a particular score
ii. The probability that a particular score will occur in a distribution
c. Z-scores are a good method to understand how scores relate to one another
i. Typically used at population level, but can be useful in samples however not always practical.
if you want to compare it to the “average” person’s weight, looking at a vast table of data can be overwhelming (especially if some weights are recorded in kilograms). A z-score can tell you where that person’s weight is compared to the average population’s mean weight
…
let’s say you have a test score of 190. The test has a mean (μ) of 150 and a standard deviation (σ) of 25. Assuming a normal distribution, your z score would be:
z = (x – μ) / σ
= (190 – 150) / 25 = 1.6.
The z score tells you how many standard deviations from the mean your score is. In this example, your score is 1.6 standard deviations above the mean
way to compare results to a “normal” population
If a z-score has a p value less than 0.05, it is deemed _ _ than the comparison
Significantly different
Reject the null
Want it to be less than .05
What is a type 1 error?
Error of optimism, resulting from rejecting the null hypothesis when it is true.
a. The probability of making a type 1 error:
- Set by researcher
(Ex. 0.01% chance of rejecting the null when it is true)
What is a type 2 error and how can risk of this be reduced?
Error of pessimism, resulting from accepting the null when it should be rejected
- Reduced by increasing sample size
- Reduced by increasing treatment effects
List the 8 steps of using a significance test:
a. State the null hypothesis
b. Establish the significance level
c. Select the appropriate test statistic
d. Compute the test statistic (obtained value)
e. Determine the value needed to reject the null (critical value)
i. This depends on:
1. Level of significance chosen (e.g. p = 0.5)
2. Degrees of freedom (based on sample size)
f. Compare the obtained value to the critical value
g. If the obtained value is greater than the critical value, reject the null
h. If the obtained value is less than or equal to the critical value, accept the null
Explain non-parametric
(Remember the measures)
a. Data are nominal or ordinal
b. Population is not normal
c. Useful in frequency data or proportions
Tests:
Chi square
Mann Whitney wilcoxan u (compares 2 samples or groups)