The principles of statistics, the assumptions on which statistical tests are based, sampling, and the preparation of data for analysis Flashcards

1
Q

Type 1 error

A

The incorrect rejection of a true null hypothesis (a false positive)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Type 11 error

A

Incorrectly retaining a false null hypothesis (a false negative). Low power means high type 2 error

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How does power relate to type 11 error

A

Power = 1-B, where B = the probability of making a type 11 error

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

List 4 factors which affect power

A
Effect size (large effect size increases power)
Alpha (decreasing alpha decreases power, increasing alpha increased power but also increases chance of making a type 1 error)
Variance of the distribution (decreasing the variance of the distributions increases power)
One tailed test increases power
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What do you need to calculate power

A

Effect size
Sample size
Alpha

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Assumptions for independent measures designs

A

Normally distributed variables (skewness and kurtosis, Shapiro-Wilks tests for both)
Linear relationship between variables
No multicollinearity or singularity
Homogeneity of variance (Tested using Levene’s test of homogeneity. If significant, the assumption has been violated). When data violates this it is heteroscedastic. Heteroscedastic=bad.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Assumptions for repeated measures designs

A

Normally distributed variables (skewness and kurtosis, Shapiro-Wilks tests for both)
Linear relationship between variables
No multicollinearity or singularity
Sphericity (only univariate ANOVA, not RM MANOVA)
•Tested using Maulchly’s test. Assumes that all pairs of levels of the within-subjects variable have equivalent correlations. In RM ANOVA, violations of sphericity can increase type 1 error rate. It is only relevant when there are more than two levels. If violated, use a correction, which are provided in the output in SPSS. Two available corrections are the greenhouse geisser (GG) and the huynh-feldt (HF).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Nominal data

A

Discrete data, categorical variables (qualitative)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Ordinal

A

Quantities that have a natural ordering but the intervals between each value are not equal. (e.g. when you rank things in order) (qualitative)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Interval

A

Like ordinal except the points on the scale do have equal intervals. For example temperature, and Likhert scales with numbers on them. (quantitative)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Ratio

A

Like interval but with a neutral 0 point. (distance, time)

quantitative

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Sampling error

A

The sampling error is the difference between a sample statistic used to estimate a population parameter and the actual but unknown value of the parameter. The smaller the sample, the greater the sampling error.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Random error

A

Random errors in experimental measurements are caused by unknown and unpredictable changes in the experiment. These changes may occur in the measuring instruments or in the environmental conditions. When averaged, random error should become 0.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Probability sampling

A

The sampling technique in which every element of the population has an equal, non-zero probability of being selected.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Categorical variable

A

Categorical variables contain a finite number of categories or distinct groups. Categorical data might not have a logical order. For example, categorical predictors include gender, material type, and payment method.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Discrete variable

A

Discrete variables are numeric variables that have a countable number of values between any two values. A discrete variable is always numeric. For example, the number of customer complaints or the number of flaws or defects. If a discrete variable has enough levels, you can treat it as a continuous variable in a statistical analysis. If not, you may consider treating it as a categorical variable.

17
Q

Non probability sampling

A

In non-probability sampling, not all members of the population have a chance of participating in the study unlike probability sampling, where each member of the population has a known chance of being selected. Non-probability sampling is used in studies where it is not possible to draw random probability sampling due to time or cost considerations.

  • convenience sampling (i.e. students)
  • quota sampling (recruiting 250 males and 250 females)
  • snowball sampling
18
Q

Volunteer sampling

A

Non-probability sampling. Participants self-select into the survey

19
Q

Quota sampling

A

Non-probability sampling. Research decides how many people to recruit who possess particular characteristics, such as gender or ethnicity.

20
Q

Random sampling

A

Each item from the population has an equal, non-zero chance of being selected for the sample.
Can be achieved using a lottery system where all of population given a number and numbers are then randomly selected. Can be done using computer software for random selection

21
Q

Descriptive statistics

A

A methods for organising, displaying and describing data. Includes measures of central tendency and measures of dispersion

22
Q

Inferential statistics

A

The estimation of parameters and the testing of statistical hypotheses

23
Q

Systematic error

A

Systematic errors are errors that are not determined by chance but are introduced by an inaccuracy (involving either the observation or measurement process) inherent to the system. Systematic error may also refer to an error with a non-zero mean, the effect of which is not reduced when observations are averaged.

24
Q

Positive skew

A

Tail to the right, in the positive direction. Bulge to the left of the graph.
The median and mode will likely be to the left of the mean, i.e. have a smaller or more negative value than the mean.

25
Q

Negative skew

A

Tail to the left, in the negative direction. Bulge to the right of the graph.
The median and mode will likely be to the right of the mean, i.e. have a larger or more positive value than the mean.