Week 2 - Lecture 2 Flashcards

1
Q

Hypotheses Testing

define conceptual hypothesis

A
  • outlines the predicted relationship between the independent varible (IV)/predictor variable and the dependent variable (DV)*/outcome variable *
  • eg. Intoxicated participants will be more creative than sober participants
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

define statistical hypothesis

A
  • represents the mathematical relationship **presumed to exist **between **two or more population parameters **
  • eg. creativity scores will be higher among intoxicated participants than among sober participants
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

what is the Neyman-Pearson Hypothesis Testing Model?

A
  • statistical analysis in psychology is guided by the Neyman-Pearson hypothesis testing model
    In this model there are **two statistical hypotheses offered: **
  • The null hypothesis (H0)
  • The alternative (research) hypothesis (H1)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

what does the null hypothesis (H0) represent?

A
  • there is no relationship between the two variables that we are investigating
  • assertion that researcher is trying to reject (something researchers want to say is unlikely to be true)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

what does the alternative (research) hypothesis (H1) represent?

A
  • there is a relationship between the** two variables**
  • what the researcher believes to be true
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

how does the null hypothesis vs alternative hypothesis look graphically

A
  • yellow could be drawn anywhere
  • statistically you’re testing the** likelihood** that the sample came from the population as specified in the null
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is meant by “rejecting the null”?

A

the question we are always considering is:
- what is the likelihood my **sample was drawn **from a population with the mean specified under the null hypothesis?
- this is why we must **specify **the **H0 mean **up front
- we then aim to reject it on the basis that the sample mean is so extreme that it is unlikely to have come from the population **specified under the null **

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How “extreme” does a sample mean need to be for us to “reject the null”?

A

By convention, we** reject the null** when the sample mean is so extreme that it would occur when the** null is true** less than 5% of the** time**
- Referred to as** p < .05** (critical region, alpha greek symbol)
- specifies acceptable risk of **rejecting H0 when it is true**
- ie. only 5 out of 100 experiments would have produced the obtained result (or an even more extreme one) by chance (under H0)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

what does the critical region look like mapped into the normal standard distribution curve?

A

when you get a Z score in the middle region, we retain H0
- It’s not extreme enough to feel confident that it’s unlikely the sample came from the population *
Once it gets into the rejection region of the two tails, it’s unlikely *we would** get a value this extreme** based on chance alone,
if H0 was correct
- spread our error into two tails, so that** 2.5% of the time we’re going to make an error in either direction **

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

what is the significance/meaning behind rejecting/retaining the null hypothesis (H0)?

A
  • rejecting H0 simply means that it is unlikely given the current data and we accept the alternative hypothesis for the time being
  • retaining H0 does not mean that it is true - we might just not have found sufficient evidence to reject it at the moment
    **We cannot prove a statement **
  • the best we can do is evaluate the evidence to see whether H0 is unlikely to be true
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Four types of hypotheses

define causal-directional hypotheses

A
  • more A causes more (or less) B
  • eg. consuming alchohol increases creativity
  • experimental data
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Four types of hypotheses

define causal non-directional hypotheses

A

A causes B
- eg. consuming alchohol affects creativity
- experimental data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Four types of hypothses

Define non-causal directional hypotheses

A
  • more A is related to more (or less) B
  • Greater alchohol consumption is associated with greater creativity
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Four types of hypotheses

Define non-causal non-directional hypotheses

A

A is related to B
- alcohol consumption is associated with creativity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

which type of hypothesis is the following scenario?
- Giving a gift to someone impacts your happiness

A

Causal, non-directional

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

which type of hypothesis is the following scenario?
- setting specific, measurable goals leads to better performance

A

causal, directional

17
Q

which type of hypothesis is the following scenario?
- stress is related to workplace attitudes

A

non-causal, non-directional

18
Q

what is a non-directional hypotheses?

A

Predict that a **difference will exist without specifying the nature or direction of that difference**
- eg. alchohol consumption is associated with creativity
- eg. relatable examples affect enjoyment learning statistics
However, we’re always interested in the direction when it comes to **interpreting any significant effects **

19
Q

define one-tailed vs two-tailed tests

A

When Alpha is 0.05, acceptable chance of making a** Type 1 error **(rejecting H0 when H0 is actually true)
- One-tailed test: Large rejection region just on one side
- Two-tailed test: Smaller rejection regions on both sides
- **allows for differences in either direction (more room for error) **

20
Q

why do we compare a** sample mean** to a distrubution of means in psychology?

A

Discussed** individual scores** of their l**ikelihood of being observed **in the **general population **
- compared one individual score (yours) to a distribution of individual scores (the population)
- But in psychology we’re interested in sample scores, *not *a single *individual’s score *
Statistically, we can’t compare a sample mean with a comparision distribution of individual scores
- instead we use a comparison distribution of means of samples of scores: we compare a sample mean to a *distribution of means *

21
Q

what are sampling distributions?

A
  • sampling distributions are how things would look if you repeatedly took samples (of a given size) from a **population of interest **
  • It’s not representing the raw scores but some **statistic from the sample **
  • Can have a sampling distribution for any statistic, but the most common is the** sampling distribution of the mean **
22
Q

what is natural variability?

A

Gives you the **distribution of means **we can expect when we do this repeated process of drawing samples of the same size and calculating their mean
- mean of the sampling distribution will be the population mean, but the SD shows you there is **variability around that **

23
Q

When will the sampling distribution of the mean be normally distributed?

A
  • the sampling distribution of the mean will be normally distributed if the population of raw scores is normally distributed or if the** sample size is at least 30 (central limit theorem)**
  • don’t need to understand central limit theorem, just know that sample size matters
    • **bigger sample size **= sampling distribution of the mean will be normal
    • *critical assumption for our statistical tests *
  • the mean of the sampling distribution is equal to the mean of the actual population (refer to photo)
24
Q

what is the standard error of the mean?

A
  • Standard error of the mean is the** standard deviation **of the **distribution of sample means **
  • It represents the typical or average distance between a **sample mean **and the mean of the population
  • It is used to* define *and accurately measure sampling error
  • Tells you how much you would expect the sample mean to vary just **based on chance alone **
25
Q

define standard error (SE)

A

how much would you expect the sample mean to differ from the population mean, just based on chance alone, for a sample of this size

26
Q

True or False

We have a standard error of 3. This means

we would expect the **mean IQ **of our sample to differ from the population mean on average by 3 IQ points. So a sample mean IQ of 97 (3 points below the mean) or 103 (3 points above the mean) is within the margin of error we would expect based on sampling error

27
Q

what does the Z score tell you?

A

tells you in SE units how far the sample mean is from the population mean

28
Q

shortcut for rejecting the null

A

critical z score (when doing a two-tailed test, and alpa = 0.05):
- 1.96 is the critical value on either side you need to exceed to reject the **null **hypothesis
- any z-score falling above 1.96 = you can **reject the null **
- any z-score falling **below -1.96 **= you can reject the null
- any **other z-score **= you need to retain the null

29
Q

When we know the mean of a population but do not know the standard deviation for the population the best we can do is…

A
  • Use an approximation of that standard deviation from the standard deviation for the sample
  • This means we end up using a one-sample t-test
30
Q

steps to testing hypotheses about means - sigma unknown

A
  • conduct a one-sample t-test
  • to calculate SE need to substitute an estimate of sigma
31
Q

what is the formula for sigma unknown hypotheses testing?

A

refer to photo

32
Q

When sigma is unknown, the nature of the test changes how?

A
  • then we obtain a t value and compare to a* t distribution *
    WHY?
  • the sampling distribution of the sample variance is** positively skewed**
    -** S^2** will underestimate sigma^2 (esp. for small samples)
  • We’d be more** likely to make a Type 1 error **if we used critical z scores, so we use **critical t instead **
33
Q

explain Student’s t distribution

A
  • Gosset showed that if s^2 is used in place of sigma^2 it would lead to a particular distribution, called a t-distribution
34
Q

What are the similarities in both z and t distributions

A

Both distributions (z & t):
- have a mean of zero
- are symmetrical
- are unimodal

35
Q

what are the differences between a z and t distribution?

A

The t distribution:
- has a flatter distribution
- Has a larger standard deviation
- Depends on df, so t-distribution is a family of distributions

36
Q

To find the correct ‘critical’ t value you need…

A
  • df (degrees of freedom) and alpha
37
Q

Implications for statistical inference

A
  • larger values of t required to reject null hypotheses - depends on df
  • as df approaches infinity, the distribution will approach normality and be equivalent to z
38
Q

Degrees of freedom - what are they?

A
  • **number of scores that are free to vary **
  • df = N = number of parameters estimated
  • For a one sample t-test,** df = N - 1 **
    • because in a one-sample t-test we use s^2 to estimate alpha^2

EXAMPLE:
- take the numbers 6,8 & 10
- sample mean (x-bar) = 8
- **Now you may change any of these numbers as long as the mean stays as 8 **
- How many numbers can you change?
- only 2 are “free” to vary and keep the mean of 8
- 6,8,10 (sample mean = 8)
- 7,?,13 (sample mean = 8)
- ? has to be 4
- so for a one sample t-test, **df = N - 1 **