Data Handling and analysis Flashcards
1
Q
Levels of Measurement
A
- Nominal Data
o Definition: Data in categories (e.g., gender, blood type).
o Strength: Easy to generate from closed questions.
o Limitation: Lacks depth; cannot express relative magnitude. - Ordinal Data
o Definition: Data that can be ordered, but intervals between values are not equal (e.g., ranks, rating scales).
o Strength: Provides some order of magnitude.
o Limitation: Differences between scores are subjective. - Interval Data
o Definition: Data with equal intervals between values but no true zero (e.g., temperature in Celsius).
o Strength: Precise, allows for more detailed analysis.
o Limitation: Cannot establish a true absence of the variable. - Ratio Data
o Definition: Data with equal intervals and a true zero point (e.g., height, weight).
o Strength: Allows for the comparison of magnitudes.
o Limitation: Not as common in psychological research.
2
Q
Descriptive Statistics
A
- Mean
o Definition: Sum of all values divided by the number of values.
o Strength: Uses all data points, providing a precise measure.
o Limitation: Affected by extreme scores (outliers). - Median
o Definition: Middle value when data is arranged in order.
o Strength: Unaffected by outliers.
o Limitation: Doesn’t consider the magnitude of all values. - Mode
o Definition: Most frequently occurring value.
o Strength: Useful for categorical data.
o Limitation: Can be uninformative if multiple modes or no mode exist.
3
Q
Measures of Dispersion
A
- Range
o Definition: Difference between the highest and lowest scores.
o Strength: Simple and quick to calculate.
o Limitation: Affected by outliers, doesn’t represent the distribution. - Standard Deviation (SD)
o Definition: A measure of the average distance from the mean.
o Strength: Provides a more precise measure of variability.
Limitation: Can be complex to calculate and interpret for non-experts.
4
Q
Types of Distributions
A
- Normal Distribution
o Definition: A symmetrical, bell-shaped curve where most scores cluster around the mean.
o Features: Mean = Median = Mode, and 68% of scores fall within 1 SD of the mean. - Skewed Distribution
o Positive Skew: Tail extends to the right (mean > median > mode).
o Negative Skew: Tail extends to the left (mean < median < mode). - Kurtosis
o Leptokurtic: Sharp peak (high kurtosis).
Platykurtic: Flat distribution (low kurtosis).
5
Q
Inferential Statistics
A
- Parametric Tests
o Assumptions: Normally distributed data, interval/ratio level, equal variances.
o Examples: t-test, ANOVA. - Non-Parametric Tests
o Used when data doesn’t meet parametric assumptions.
o Examples: Mann-Whitney U, Wilcoxon, Chi-Square. - Choosing Tests
o Correlation: Pearson’s r (parametric), Spearman’s rho (non-parametric).
o Difference between Groups: t-test (parametric), Mann-Whitney U (non-parametric).
6
Q
Significance and Probability
A
- p-value
o The probability of obtaining a result if the null hypothesis is true.
o Significance Level: Typically set at 0.05 (5%). If p < 0.05, results are statistically significant. - Type I Error
o Rejecting the null hypothesis when it’s true (false positive). - Type II Error
o Failing to reject the null hypothesis when it’s false (false negative).
7
Q
Effect Size
A
- Definition: A measure of the strength or magnitude of the effect or relationship found in a study.
- Cohen’s d
o Measures the difference between two means.
o Small Effect: 0.2
o Medium Effect: 0.5
o Large Effect: 0.8 or above. - Importance: Even if a result is statistically significant, the effect size tells us how practically significant it is.
8
Q
Correlations
A
- Positive Correlation
o As one variable increases, so does the other. - Negative Correlation
o As one variable increases, the other decreases. - No Correlation
o No relationship between variables. - Correlation Coefficient
o A statistical value (ranging from -1 to +1) that indicates the strength and direction of a relationship.
9
Q
Presentation of Data
A
- Bar Chart
o Used for discrete data, where categories are separate (e.g., gender). - Histogram
o Used for continuous data, where bars touch to show the distribution of scores. - Scatterplot
o Shows the relationship between two continuous variables (used in correlation studies). - Tables
o Used to present raw data and descriptive statistics clearly.
10
Q
Significance Testing and Statistical Tests
A
- Chi-Square Test
o Used for nominal data to assess differences between groups or associations between variables. - Mann-Whitney U Test
o A non-parametric test for comparing differences between two independent groups (ordinal data). - Wilcoxon Signed-Rank Test
o A non-parametric test for comparing differences between two related groups.
11
Q
A