Topic 5: Two-Sample Hypothesis Testing Flashcards

1
Q

Key features of Between subject designs (3)

A
  1. Each participant participates in one and only one group
  2. Comparisions made between the groups
  3. Random assignment used to assign participants to groups (if true experiment)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Two group designs do not always have to…… for example …..

A
  • compare an experimental group to a control
  • fpr example, it can compare two different amounts of an IV (comparision of length of prision centre such as 2 years and 4 years) or compare an “apple vs orange” such as how good people do depending on AM/PM lecture
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Ex-Post-Factor (2)

A
  • IV not manipulated (already occured)
  • Does not allow cause and effect claims
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

One sample T test vs Independent sample t-test

A
  • 1 sample: you find the sample distribution and standarize it
  • Independent sample: look at the difference between the sample means and compare that diff to 0. *Standaridize the difference
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

How do we know if variances are equal?

A

Not equal if one variance is more than 3x the other

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Three necessary conditions to use a t-test for independent samples:

A
  1. The samples must be independent
  2. Each population should have a normal distribution. However, this assumption is frequently violated with little harm as long as n>25
  3. Homogeneity of variance: Both groups must be sampled from populations with similar varince, if not do not pool variance, use adjusted df
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Two-Sample t-test for the difference between Means steps:

A
  1. State the claim mathematically. Identify the Null and alternative hypotheses
  2. Specify the level of significance (identify alpha)
  3. Identify the degrees of freedom and sketch the sampling distribution (df= n1+n2 -2 or df= smaller of n1-1 or n2-1)
  4. Describe the critical values (t-table)
  5. Determine the rejection region(s)
  6. Find the standaridized test statistic with formula below
  7. Make a desicion to reject or fail to reject null
  8. interpret the desicion in the context of the original clai
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Power (2)

A
  • Probability of rejecting a false H0
    probability that you will find difference thats really there
  • 1-beta
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Beta

A
  • The probability of making a type 2 error
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

1-B

A

Chance of finding an effect that exist

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Influences on power+the disadvantages of each ways (5):

A
  • Increase alpha value increases the ability to find an effect: raising alpha levbel increases our probability of a type 1 error
  • One tail test increase power: not appropriate for a non-directional hypothesis, goes against convention as many do 2 tail test without specifying
  • Decrease population variance (σ) which is impacted by population SD and sample size: Results are harder to generalize
  • Increase sample size increases power as the sampling distribution for means are skinnier: may be hard/expensive to get alot of ppl
  • Make your manipulation strong aka effect size: may not be ethical
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Effect size + formula (2)

A
  • Magnitude of true difference between null and alternative hypotheses (u1-u2)
  • d= l (u1-u2)/σ l
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Large effect size means that

A

your null and HA population dont overlap very much

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Delta

A

Value used in referring to power tables that combines effect size and sample size

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

variance and SD relationship

A

SD is the value when variance is sqaure rooted
SD^2= Variance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Standard deviance symbol

A

σ

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Variance symbol

A

σ2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What does this mean: Estimated power for delta= 1.5, alpha=0.05 and is roughly 0.32?

A

If the study were to run repeatedly, 32% of the time, the result would be statistically significant

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Steps to find out what sample size we need to give us the power to detect a difference (3):

A
  1. Start with anticipated effect size (d)
  2. Determine delta required for desired level of power
  3. Calaculate n required for that value of delta
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Importance of power when evaluating study results

A
  • When a result is statistically significant: Effect size can tell us whether the result is practically significant
  • When a result is not statistically significant: May not be that there is no difference, but simply that the study lacked the power to find it
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Why does larger sample make distribution narrower when talking about impact of sample size on power

A

Since the standard deviation of the sampling distribution decreases with increasing n, the curve has a narrower, taller graph as more probability is squeezed toward the middle.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Matched pair (2)

A
  • When participants are measured and equated on some variable and then assigned to each group based on that
  • can be experimental or quasi-experimental (non-randomization)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Natural pairs (4)

What it is+ wjat kind of experiment+ ex+ advantage/disadvantage

A
  • When participants are matched based on some biological or social relationship
  • Typically quasi-experimental
  • Ex: using siblings: may differ in personality but similar genectics and childhood experience.
  • The primary advantage of the natural pairs design is that it uses a natural characteristic of the participants to reduce sources of error. The primary limitation of this design is often the availability of participants. The researcher must locate suitable pairs of participants (such as identical twins) and must obtain consent from both participants
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Repeated Measures

A

Where the same participants are exposed to and measured in both conditions

25
Q

Matched pair vs Natural pair

A

In a natural pairs design, scores were paired for some natural reason. In a matched pairs design, scores are
paired because the experimenter decides to match them on some variable.

26
Q

Dependent (Paired) Samples design

A
  • When we compare either the same participants or participants who have some type of predetermined relationship.
  • Subjects in one group do provide information about subjects in other groups. The groups contain either the same set of subjects or different subjects that the analysts have paired meaningfully.
27
Q

Within-subjects designs ex (3):

A
  • matched pairs
  • Natural pairs
  • repeated measure
28
Q

Dependent (paired): advantages (3)

A
  • Greater control over the equality of groups
  • more statistical power to find an effect: variability due to individual differeces decreased/nonexistent
  • can often have smaller sample sizes
29
Q

Which method to use if you dont have alot of people or you have alot of variability in a population?

A

Dependent (paired)

30
Q

đ

A

the mean of the diffferences between each data pair in a paired t-test

31
Q

d

A

The difference between each data pair:
d= x1-x2

32
Q
A

The SD of the differences between the each data pair

33
Q

Assumptions of the Paired t-test

A
  1. The samples must be dependent (paired)
  2. Both populations must be normally distributed
34
Q

What are we comparing for a paired t-test?

A

We are comparing our average difference to a null hypothesis that the average difference between out two groups is 0.

35
Q

t-test for the difference between paired means steps (9)

A
  1. State the claim mathematucally. Identify the null and alternative hypotheses
  2. Specify the level of signifcance
  3. Identify the degrees of freedom and sketch the sampling distribution [df= (number of pairs: n) -1]
  4. Determine the critical values (Use t-distribution)
  5. Determine the rejection regions(s)
  6. Calculate d bar and Sd
  7. Find the standardied test statistic of the t value with formula.
  8. Make a desicion to reject or fail to reject the null hypothesis. (depends if t-observed is in the rejection region or not)
  9. Interpret the desicion in the context of the original claim
36
Q

We will use t-tests exclusively for tests of ——- and the z-test for tests of —–.

A
  • tests of either independent or dependent group means
  • proportions
37
Q

t-tests can be used for comparing means of —— level variables

A
  • interval and ratio
38
Q

statistically significant

A

Is the difference large enough that we can reject the null hypothesis of no difference?

39
Q

Z-tests can be used to compare proportions of —– level variables.

A

nominal and ordinal

40
Q

When to use t and z test, 2 components:

A

The t-test is used when the population variance is unknown, or the sample size is small (n < 30). At the same time, the z-test is applied when the population variance (σ2) is known and the sample size is large (n > 30).

41
Q

As the number of samples and/or the size of the samples increase, the standard error becomes….

A

smaller and smaller indicating less variation around the population mean.

42
Q

The smaller the standard error, the more — the estimate of the population mean.

A

precise

43
Q

The central limit theorem states that

A

the sampling distribution of the mean will always be normally distributed, as long as the sample size is large enough. Regardless of whether the population has a normal, Poisson, binomial, or any other distribution, the sampling distribution of the mean will be normal.

44
Q

The probability associated with a difference between two means can be determined once …..

A

the difference is standardized as a t-statistic. A calculated t-value with an associated probability in the “rejection region” in either of the two tails of the distribution would provide evidence that there is a difference between the two means.

45
Q

If we expect there will be a difference between the two means but are not predicting that one will be larger than the other, this would constitute a —

A

two-tailed test

46
Q

As you learned previously, the t-distribution is —- whose shapes depend on —-

A
  • a family of distributions
  • their degrees of freedom (df)
47
Q

degrees of freedom

A

refer to the number of values in a calculated statistic that are free to vary (do not have a fixed value).

48
Q

Explain why its df-1 for degrees of freedom

A

If you are given the first three numbers in a distribution of four numbers as 2, 4, and 5, and then must identify the fourth number so that all four numbers will add to 20, the fourth number must be 9. The first three numbers can vary, but once they are determined, the last number is no longer free to vary. So we say that this distribution has n – 1 degrees of freedom, or df = 3.

49
Q

Both methods, Z and T-test assume a ——-, but the z-tests are most useful when the —-

A
  • normal distribution of the data
  • standard deviation is known.
50
Q

When do we use a pooled estimate of the standard error

A

For independent sample means, if one variance is no more than twice the other, they are considered approximately equal, and a pooled estimate of the standard error can be used.

51
Q
A

sample variances

52
Q

Using a pooled estimate when the variances are “too different” has been shown to lead to

A

unreliable results with the t-test (i.e., low p-values)

53
Q

Figuring Degrees of Freedom – Approximately Equal Variances:

A
54
Q

The p-value —- as the df decrease

A

increases

54
Q

The variance that determine if we use a T test or Z test is the

A

variance for the population providing the samples

55
Q

How to use the p-value method approach for independent sample means

A
  • calculate the p-value of t and compare that to the probability limit set by alpha
  • Ex: the p-value associated with a t-statistic of 10.00 and df = 48 is P < .00001. Since the P-value is less than the alpha limit of .05, the null is rejected.
56
Q

The value for d tells us

A

the mean difference between the two groups is equivalent to x.xx standard deviations

57
Q
A