Correlation and Hypothesis testing - Ryan Ward Flashcards

1
Q

what is correlation?

A

refers to a statistical measure that quantifies the relationship or association between two variables. It indicates the extent to which changes in one variable are systematically related to changes in another variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

what can serve as the basis for well-designed experiments

A

correlation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Brazelton Neonatal Behavioural Assessment Scale

A

women smoking during regency
children did worse on some scales

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Bower (2020) anxiety in women

A

found level of anxiety in pregnant women and likelihood of premature birth or low birth weight of baby
example of correlation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

positive correlation and how it can be represented

A

as one variable gets bigger the other variable gets bigger can be represented on a scatter plot
tilts upward from Left to Right

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

negative correlation and how it is represented

A

as one variable gets bigger the other variable gets smaller
tilts down on scatter plot from Left to Right

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

zero correlation and how its represented

A

no consistent relation between variables
scattered points with no patter

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Correlation in terms of strength

A

strong = close to the centre line IV predicting DV stronger
weak = something else is probably going on as well as correlation causing dots to be further from the centre line
the stronger the correlation the better the predictability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

way to compute correlation is Pearson R what is this

A

slope of line that minimises difference between line and each point
used by psychologists

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

3 things to know about R

A

variables to be correlated must be measured on the same individuals
variables must be measured on an interval or ratio scale
r can detect only linear relationships

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

linear

A

points that generally fall on a straight line

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

if r = 0 or is low it means

A

it may be that there is no relationship or it may ne that the existing relationship is non-linear

may be because of a restricted range - need a certain amount of spears or variability in scores

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

curvilinear

A

increase the x results initially in increase in y, then decrease in y

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

example of curvilinear

A

Yerkes-Dodson arousal curve

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Yerkes-Dodson arousal curve

A

describes the relationship between arousal and performance. It suggests that there is an optimal level of arousal for achieving peak performance on a task.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Cross-lagged-panel correlation procedure

A

A way of dealing with the directionality problem to a certain extent
It is commonly used in longitudinal research to explore the temporal order of variables and investigate potential causal relationships.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

cross-legged-panel correlation underlying assumption

A

if one variable “causes” the other, it should be more strongly related over time

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

cross-legged-panel correlation general strategy

A

obtain several correlations over time then look at size and direction of the correlation coefficients to determine what leads to what

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

inferential statistics

A

iued to decide about the population based on observations of the sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

characteristics of population and the symbols

A

parameteres:
μ mean and σ standard deviation
can’t measure the entire pop so use sample

21
Q

characteristics of sample the symbols for them

A

statistics:
X̄ mean and s standard deviation

22
Q

3 steps for sampling distributions and logic

A

make a guess about the population frequency distribution - hypothesis what pop mean is
take a random sample
decide if sample came from a pop, like the one you guessed in step 1 (usually based on how close sample mean is to the hypothesised pop mean)

23
Q

central limit theorem

A

take enough samples with means always get a normal distribution
when independent random variables are summed or averaged, regardless of their underlying distribution, their sum or average tends to follow a normal distribution as the sample size increases.

24
Q

sampling distribution

A

When we take a single random sample from a population and calculate a statistic, such as the sample mean, we obtain a single value. However, if we were to take multiple random samples of the same size from the population and calculate the statistic for each sample, we would end up with a distribution of those statistics. This distribution is called the sampling distribution.

whether the difference in sample and pop is due to chance or something real based on variability and true difference

25
Q

if the likelihood is very small that the results could have been obtained from the distribution suggested by Ho

A

then reject the null hypothesis in favour of the alternate hypothesis – if p is low reject Ho

26
Q

if the observed mean (X̄) could have reasonably been obtained from the distribution suggested by Ho (due to chance variation) then

A

retain (never accept the null hypothesis) the null hypothesis

27
Q

what is the null hypotheses (Ho)

A

statement that asserts that there is no significant relationship or difference between variables or populations.
a position of no effect, no difference, or no relationship. It assumes that any observed differences or relationships in the data are due to chance or random variation rather than a true underlying effect.

28
Q

what is the alternative hypothesis (H1)

A

statement that contradicts or opposes the null hypothesis (Ho). It represents the possibility of a significant relationship, difference, or effect between variables or populations that is not due to chance or random variation.

29
Q

significance level or alpha (α) level

A

the probability value that defines the boundary between rejecting or retaining the Ho

if p is less than alpha then you reject the Ho

30
Q

what is the significance level usually set at for psychologists

A

0.05 or sometimes 0.01
p < .05

31
Q

Region of rejection

A

The region of rejection represents the range of values of the test statistic that would be unlikely to occur by chance alone if the null hypothesis were true. If the calculated test statistic falls within this region, it suggests that the observed data are inconsistent with the null hypothesis, providing evidence for the alternative hypothesis.

if number is in region of rejection - reject Ho

32
Q

when is a one-tailed test used

A

when we have a directional alternative
eg something improving memory

used when there is evidence or theory to suggest that the treatment will have an effect in one particular direction

33
Q

when is a two-tailed test used

A

when we have a non-directional alternative
eg. something could improve or worsen memory

two directional could do either

34
Q

critical value

A

specific value or set of values that define the boundaries of the region of rejection in hypothesis testing. It is used to make decisions about whether to reject or fail to reject the null hypothesis based on the observed test statistic.

z = critical value

35
Q

type 1 error

A

This is the error of rejecting the null hypothesis when it is true. It represents a false positive result.

36
Q

type 2 error

A

This is the error of failing to reject the null hypothesis when it is false. It represents a false negative result.
p = beta

37
Q

use a two-tailed test unless there is what

A

a “good” (theoretical reason) to use a one-tailed one

38
Q

what are decision errors and what are the two types

A

refer to the incorrect conclusions that can be made when performing a statistical test. There are two types of decision errors: Type I error and Type II error.

39
Q

what does it mean that type | and type || errors are mutually exclusive

A

if you have one you can’t have the other
changes in one type of error have an effect on the other type of error

40
Q

two ways to minimise type || errors

A

reducing beta and increasing power (1-β) so that we can reject the null hypothesis when it is false
increase alpha
increase sample size
use most powerful statistical test
have a good experimental design

41
Q

what effect does increasing alpha have in order to minimise type 2 error

A

increasing alpha also produces a higher probability of a Type 1 error (rejecting the null when it is true)

42
Q

when to set alpha high or low

A

set alpha very low (0.01) when consequences of type 1 error are severe

set alpha high (0.05) when consequences of type 1 errors are not too serious

43
Q

by increasing sample size to minimise type 2 error

A

has less variability
narrower sampling distribution
reduced β (alpha doesn’t change)

44
Q

single sample t-test

A

used to test the null hypothesis for a single-sample experiment when the standard deviation of the population must be estimated

45
Q

two sample t-test

A

used to compare the means of two separate groups and determine if they are significantly different from each other. It helps assess whether there is a significant difference between the means of two populations.

46
Q

degrees of freedom is

A

how many scores in the sample are free to vary - generally all scores except the last one

47
Q

assumptions of the single-sample t-test

A
  1. the random sample compromises interval or ratio scores (directly comparable quantitative rather than ordinal)
  2. the distribution of the individual scores is normal
  3. that standard error of the mean is estimated using ^sx computed from the sample
48
Q

student t-test

A

is a statistical test used to compare the means of two groups and determine if they are significantly different from each other. It is commonly used when the sample sizes are small or when the population standard deviation is unknown.