Psych Stats Exam #4 Flashcards

Question 1

Q

How is correlational research different from experimental?

Answer

A

1) no manipulation of the IV
2) no random assignment
3) at least 2 DV’s measured

Question 2

Q

Purpose of correlational research

Answer

A

to explore association between variables

Question 3

Q

Correlation definition

Answer

A

the linear association between variables

Question 4

Q

What does the correlation coefficient provide?

Answer

A

An indicator of a linear relationship

Question 5

Q

Visualizing correlation

Answer

A

scatterplots: each point represents two measurements of the same person

Question 6

Q

Things to look for in a scatterplot

Answer

A

direction
scatter/dispersal
shape
-outliers

Question 7

Q

Negative correlation

Answer

A

subjects with high scores on one variables tend to have low scores on the other variable

“when a score of X if above the mean of X, scores of Y will tend to be below the mean of Y” (and vice versa)

Question 8

Q

Positive Correlation

Answer

A

subjects with high scores on one variable tend to have high scores on the other variables (or low/low)

“when a score of X is above the mean of X, scores of Y will tend to be above the mean of Y” (and below/below)

Question 9

Q

Correlation coefficient definition (r)

Answer

A

statistic that quantifies the linear relationship between two variables
“ a measure of the tendency for paired scores to vary systematically”

Question 10

Q

What does the sign of r tell us?

Answer

A

direction NOT magnitude

Question 11

Q

R value ranges

Answer

A

positive +1 to negative -1
- tells us magnitude

Question 12

Q

Perfect linear relationship

Answer

A

+1 or -1 (usually don’t exist in nature)

Question 13

Q

R effect size guidelines

Answer

A

small: 0.1
medium: 0.3
large: 0.5

Question 14

Q

R as a descriptive statistic

Answer

A

describes effect size

Question 15

Q

R as an inferential statistic

Answer

A

you can compare it to a critical value to find the rejection region

Question 16

Q

Null hypothesis of correlation

Answer

A

there is not a linear relationship between A and B (r = 0)

Question 17

Q

r for a population

Question 18

Q

degrees of freedom for correlation

Answer

A

df(r) = N-2
- N = number of pairs of observations (20 data points = 10 pairs of data sets)

Question 19

Q

Example of correlation write-up

Answer

A

“there is a statistically significant negative correlation - a negative linear relationship - between number of absences and exam score r(8) = -0.85, p<0.05. The more classes students miss, the worse they tend to perform on the exam.”

Question 20

Q

Correlation…

Answer

A

does not equal causation

Question 21

Q

Factors that influence r

Answer

A

1) truncated range
2) outliers
3) non linear relationships

Question 22

Q

Truncated range

Answer

A

zooming in on one group of people (ex: just high or low scores)
- can alter correlation: misrepresenting the true strength of the existing relationship by altering sample size

Question 23

Q

Outliers and small sample sizes

Answer

A

can mask or exaggerate a relationship between variables
- with a small sample size, outliers heavily affect results
- extremity of outlier: very extreme outliers have larger influences

Question 24

Q

Pearson’s correlation coefficient

Answer

A

for linear relationships only
used for parametric tests (scale DV)

Question 25

Q

Examples of nonparametric inferential tests

Answer

A

chi-squared tests
Mann-Whitney U test

Question 26

Q

Spearman’s correlation

Answer

A

used in nonparametric tests

Question 27

Q

When do we use nonparametric tests?

Answer

A

1) When assumptions of parametric tests are not met (population skewed or non linear)
2) small sample sizes (usually under 30)
3) DV is not scale (ordinal and nominal)

Question 28

Q

Disadvantages of nonparametric tests

Answer

A

1) tend to have low statistical power (higher probability of type II error)

Question 29

Q

Chi-squared

Answer

A

used when we only have a nominal variable
“how different are the observed values from the expected values under the null hypothesis”

Question 30

Q

What is “O”

Answer

A

observed value

Question 31

Q

What is “E”

Answer

A

expected value (under the null hypothesis)

Question 32

Q

What is Σ

Answer

A

sigma: summation

Question 33

Q

what is χ2

Answer

A

chi-squared: test statistic

Question 34

Q

Types of Chi-Squred tests:

Answer

A

1) chi-squared test for goodness of fit: one nominal variable, 2+ categories
- df = number of categories - 1
2) chi-squared test for independence: 2 nominal variables

Question 35

Q

Misuse of NHST parts

Answer

A

1) failure to control for bias
2) low statistical power
3) poor quality control
4) p-hacking
5) publication bias

Question 36

Q

What is replication crisis?

Answer

A

ongoing methodological crisis to replicate and reproduce psychological findings

Question 37

Q

reproducibility

Answer

A

obtaining consistent results using the same original data, methodology, and analysis

Question 38

Q

Replicability

Answer

A

obtaining consistent results across several studies that aim to answer the same question with different data

Question 39

Q

Open science collaboration

Answer

A

attempted to reproduce the findings of 100 journal articles
270 scientists
only 39% replicated

Question 40

Q

Power posing

Answer

A

only self-reported feelings replicated, no physiological impact

Question 41

Q

Smiling make you happier

Answer

A

did not hold up

Question 42

Q

P value definition

Answer

A

the probability of your observed results (or results more extreme) occurring if the null is true

Question 43

Q

Why reliance on P-value can be misleading

Answer

A

1) can result in binary thinking: 0.049 is significant but 0.5 is not
2) statistical significance is not necessarily meaningful (need to look at effect size)

Question 44

Q

Tools to use besides P-values

Answer

A

1) confidence intervals: more precise and accurate measure of the sample mean as an estimate of the true population
- small interval = better precision

Question 45

Q

Significant result but small effect size

Answer

A

something may be there but not meaningful

Question 46

Q

Not significant result but large effect size

Answer

A

might indicate you missed something (type II error) - might indicate low power

Question 47

Q

P-hacking (ways to increase power)

Answer

A

1) use a higher alpha
2) use a one-tailed hypothesis instead of two
3) increase sample size
4) somehow reduce variability
5) somehow make the difference between populations means bigger

Question 48

Q

P-hacking (definition)

Answer

A

the misuse of data analysis to find and report statistically significant effects
- data dredging, data snooping, significance chasing

Question 49

Q

Ways to P-hack

Answer

A

1) trimming data sets (get rid of outliers, zooming in)
2) adjusting values in the data set (what you think participants “mean”)
3) significance chasing: adding a few more participants at a time until the result becomes statistically significant
4) selective reporting: running many analyses but only reporting the ones that showed the desired effect

Question 50

Q

Debunking published research

Answer

A

very hard - once we see reported evidence, it is hard to change our perceptions

Question 51

Q

Publication bias

Answer

A

journals tend to publish significant results - may lead researchers to engage in shady research practices
- biased in incomplete understanding: important to know what is NOT different as well
- “file drawer problem”

Question 52

Q

Best Practices

Answer

A

publish what you plan to collect and analyze to you don’t adjust
people held accountable

Question 53

Q

Simple regression

Answer

A

use data to produce an equation for a straight line that captures the trend of the data
- used to make predictions about Y given a particular X score

Question 54

Q

Multiple Regression

Answer

A

use data to produce an equation for a line including MANY variables
- multiple predictor variables
- can compare strength of different variables on how they jointly affect Y

Question 55

Q

IV in regression

Answer

A

predictor variable

Question 56

Q

DV in regression

Answer

A

outcome/criterion variable

Question 57

Q

Line of best fit

Answer

A

captures the best trend of the data

Question 58

Q

Simple linear regression equation

Answer

A

ŷ = a + bX
y = predicted score on outcome
a = intercept
bX = slope of regression line (predicts change in Y for an increase of 1 unit in X)
b = unstandardized regression coefficient
- can not flip variables and get same regression

Question 59

Q

Ordinary least square (OLS) estimation

Answer

A

used to draw a line minimizing error/residuals

Question 60

Q

standardized beta

Answer

A

a 1 standard deviation increase in (IV) is related to (beta value) standard deviation increase in (DV)
- used in multiple regressions

Question 61

Q

Write up for multiple regression (beta)

Answer

A

“Controlling for all other measures variables (TV exposure, age, lower grades, parent education and education aspirations) exposure to sexual content on TV is still a significant predictor of pregnancy”

Question 62

Q

Intercorrelated

Answer

A

all variables relate to one another

Question 63

Q

Regression can not:

Answer

A

1) establish temporal precedence: do not know what came first (can not determine cause and effect)
2) control for variables that aren’t measured (can not measure all the variables in the world)

Question 64

Q

How is regression different from correlation?

Answer

A

Correlation: association between 2 variables Regression: prediction of DV using IV

Answer 64

A

test for significant difference between two independent samples (two levels of IV, ordinal/nominal DV)
- parametric partner: independent samples t-test

Answer 65

A

Test for significant difference between two paired samples (two levels of IV, nominal/ordinal DV)
-parametric partner: paired samples t-test

Answer 66

A

Test for significant differences among all pairs of independent samples (three levels of IV, and ordinal/nominal DV)
- parametric partner: one-way ANOVA, tukey HSD tests

Answer 67

A

Describe the degree of correlation between two variables (nominal/ordinal DV)
- parametric partner: Pearson coefficient (r)

Answer 68

A

Negative skewed data

Answer 69

A

Positive skewed data

Answer 70

A

Summarizing a distribution of data with a single number - conclusions you draw from numbers

Answer 71

A

Sample size increase: easier to reject the null

Answer 72

A

number describing the population
- muew: mean
- s-hat = standed deviation

Answer 73

A

number describing sample
- mean = M
- S = standard deviation

Answer 74

A

can be used to determine the sample size required to detect an effect size

Answer 75

A

False alarm: you said yes but there is no effect

Answer 76

A

Miss: you missed an effect that was actually there

Answer 77

A

the probability that we will correctly reject the null when we should

Answer 78

A

Null-hypothesis significance testing
Testing against a null hypothesis (no significant difference) to see how odd your results are

Answer 79

A

When an assumption of a parametric test is violated, but the test still operates (mostly) as intended
- The tests we’ve covered this semester are robust against the assumption of normality

Answer 80

A

Pearson:
- parametric: scale DV
Spearman:
-non parametric: nominal/ordinal DV

Answer 81

A

make a Type II error