Statistics and Data Analysis in Quantitative Study Designs Flashcards

1
Q

What are the levels of measurement/types of data?

A

Nominal, ordinal, interval, ratio
Each level has an additional characteristic, and contains all of the characteristics of the previous level.
Nominal and ordinal contain non-parametric data.
Interval and ratio contain parametric data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Nominal

A

Provides information about difference, but not much more.
Used to name identify, or classify into categories.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Ordinal

A

Shows direction of difference, but we don’t know amount of difference.
Numbers indicate rank or order.
Allows for greater than or less than.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Interval

A

Intervals or distances between numbers are known, but it’s not known how far any of the numbers are from zero.
Equality of units, but no true zero.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Ratio

A

Each number can be thought of as a distance measured from zero.
There is an absolute zero point, which represents the absence of the variable being measured.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Statistics

A

Objective means of interpreting data
Inform people about reliability and meaningfulness.
Measures of central tendency and variability are the most fundamental components of most statistical techniques.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Central Tendency

A

Attempts to describe a set of values by placing focus on the central position in the data set.
A single score.
Measures of central tendency include mean, median, mode.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Mean

A

The sum of all scores in a data set, divided by the number of scores in the data set.
The average.
The most common measure of central tendency.
The mean can be effected by outliers (skews data depending on if the outlier is higher or lower than the rest of the data).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Median

A

The number occurring at the midpoint of the data set.
If there is an even number in a data set, take the mean of the two mid-point scores.
Not affected by skewed data.
Drawback is that median may not represent all data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Mode

A

The most frequently occurring number in a data set.
Mode is the most popular option (see frequency of a number)
Drawback is that 2 modes or no two of the same responses (can get 2 modes)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Variability

A

Best estimate of the spread of scores.
Indicates how spread out the scores are or how close they are to the mean.
If variance = 0, all values in the data set are the same.
If variance is low, the values are close to the mean and the range is small.
If variance is high, the range of values is vast and some of the values are far from the mean.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Standard Deviation

A

Is the square root of the variance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Normal Distribution

A

Many statistical tests are based on the assumption of a normal distribution.
Distributions provide a visual way to see scores disperse from the mean.
Normal distributions are symmetrical around the mean.
Graph of distributions found in notes on statistics.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

t-Test

A

Comparing means.
If comparison is focused on two groups/samples, the appropriate stat is a t-test.
Aims to understand the difference between the means of those two groups.
Groups may be different or same people measured twice.
Includes independent t-test and dependent t-test.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Independent t-test

A

Used when the study has two groups of different participants
Can be used for experimental and quasi-experimental designs.
Compare two independent samples.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Dependent t-test

A

When two data points collected over time or when participants are exposed to the two experimental conditions.
Compare scores from same or matched sample.

17
Q

What happens where there are more than two groups/samples that are different?

A

One way ANOVA (extension of t-test).
Can include repeated measure (repeated measure ANOVA is used when two different time points or more).

18
Q

What happens after finish statistical analysis?

A

We revisit the hypothesis.

19
Q

P-values

A

Probability of error = p-value
Researchers aim to reduce chances of error as much as possible.
P-values of 0.05 means there is 5% room for error or chance (researchers are also 95% confident that the results are genuine and not a chance finding).
Written in research p less than 0.05 = significance or ability to reject the null hypothesis.

20
Q

Type I Error

A

Researchers make the decision that a manipulation or treatment has been successful when it in fact has not been.

21
Q

Type II Error

A

Researchers make the decision that the manipulation has failed when it worked.

22
Q

What are some other types of distribution?

A

Skewness: scores spread out more at one end of the distribution (positive or negative).
Kurtosis: peakedness of distribution.

23
Q

Correlation

A

A measure of the strength of the relationship between two variables (x and y).

24
Q

Pearson product-moment correlation

A

Assumes x and y are normally distributed and are interval or ratio scale scores.
Pearson r

25
Q

Spearman rank-order correlation

A

Used for ordinal scale data or interval or ratio scale data that deviate substantially from normality.

26
Q

Pearson r

A

Pearson r is a relationship independent of number of scores, size of scores, dispersion of scores.
Pearson r is derived from covariance.

27
Q

What information can you get from Pearson r?

A

Direction of relationship: positive correlation means x increases and y increases; negative correlation means x increases and y decreases or vice versa.
Strength of the relationship: closer to line/mean = stronger relationship

28
Q

Correlation Coefficient

A

A measure of the direction or strength of a linear relationship (relationship shown from Pearson r).
Can be positive (closer to 1.0) or negative (closer to -1.0).
Can also be no relationship (closer to 0.0).

29
Q

What is the significance of r in a correlation?

A

Significant correlations do NOT provide the basis for drawing causal conclusions.
Causation depends on methodology, not analysis.

30
Q

Null Hypothesis

A

The observed correlation is due to error.

31
Q

Alternative Hypothesis

A

There will be a significant correlation between x and y