Summa Week 6 Flashcards

1
Q

What is the GLM?

A

the GLM or the General LInear MOdel is the conceptual framework unifying a large set of statistical methods

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What can GLM answer?

A

almost any question if the DV is continuous

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What must the DV be for the GLM to likely be able to answer it?

A

if it is continuous, and therefore NOT categorical

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are familiar statistics within the GLM?

A

t-tests
correlation
ANOVA
regression

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is bivariate correlation?

A

an association between scores on 2 random variables

e.g. hrs spent on Twitter and # followers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the relation in correlation?

A

a straight line, or “linear correlation” or “linear regression”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is assumed about a correlation?

A

it can be a straight line if a linear correlation or regression, or a curved one referred to as a non-linear regression

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

how are correlations detected by the eye?

A

usually by a scatterplot

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are scatterplots?

A

a graph that shows the degree and pattern of the relationship between two variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is on the horizontal axis of a scatterplot?

A

a variable that does the predicting (likely arbitrary, and an IV)
e.g. hours of studying, income

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is on the vertical axis of a scatterplot?

A

usually the variable that is predicted (likely a DV)

e.g. grades, happiness

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the shape of a positive bivariate distribution or a scatterplot?

A

a line from the bottom left to the top right

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the shape of a negative bivariate distribution or a scatterplot?

A

line from top left to bottom right

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the shape of no correlation in a scatterplot?

A

there are datapoints scattered all ove r the plot

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the shape of a curvilinear distribution or a scatterplot?

A

a bell curve on a scatterplot

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the covariance?

A

the correlation coefficient which is a bivariate statistic that measures the degree of linear association between two quantitative variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What is covariance?

A

a number that reflets the degree to which two variables vary together

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What represents the correlation coefficient?

A

italicized r

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What does italicized r indicate?

A

the precise degree of linear correlation between two variables, or the correlation coefficient

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What is the range of italicized r?

A

the range for a correlation coefficient is between -1 and 1 [-1,1]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What is the formula for covariance?

A

COVxy = sum of (stat1 - mean1) (stat2 - mean 2)/ (N - 1)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What is the formula for a correlation coefficient?

A

r = COVxy / SxSy

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Who developed the covariance, or correlation coefficient?

A

Pearson

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What is an assumption of random sampling for Pearson’s r?

A

each sample is a random sample from its population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

What is an assumption of random sampling for the robustness of Pearson’s r?

A

considered inappropriate to conduct if violated, but some argue ti is robust if violated

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

What is an assumption for Pearson’s r in regards to independence of cases?

A

each case is NOT influenced by other cases

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

What is an assumption for Pearson’s r in regards to the robustness of the independence of cases?

A

NOT robust to violations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

What is an assumption for Pearson’s r in regards to normality?

A

the DV is normally distributed in each population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

What is an assumption for Pearson’s r in regards to normality’s robustness?

A

robust to violations if the sample size is large

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

What is an assumption for Pearson’s r in regards to linearity?

A

the relationship between the two variables in the population is a linear one

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

What is an assumption for Pearson’s r in regards to linearity’s robustness?

A

not robust

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
32
Q

What is the sign for adjusted correlation coefficient?

A

Radj

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
33
Q

What is Radj?

A

when the number of observations is small, the sample correlation will be a BIASED estimate of the population correlation coefficient (not accurate). To correct for this, we can computed the Radj, which is an unbiased estimate of the population correlation coefficient

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
34
Q

What is the formula for Radj?

A

Radj = square root of (1 - [(1-r^2) (N-1)/(N-2)]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
35
Q

What is the reduced formula for Radj?

A

Radj = square root of (1 - (N-1/N-2) (1 - r^2))

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
36
Q

What is the symbol for the correlation coefficient in the population?

A

(p) rho

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
37
Q

What is the (p) rho?

A

an unbiased estimate of the correlation coefficient in the population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
38
Q

What is the amount that the common hypothesis that we test for a sample correlation between X and Y in the population, denoted by p (rho)?

A

zero

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
39
Q

Why is the p (rho) a meaningful test of the population correlation coefficient?

A

the null hypothesis being tested is really the hypothesis that X and Y are linearly independent.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
40
Q

Rejection of the hypothesis that X and Y are linearly independent leads to…

A

the conclusion that they are not independent, and there is some linear relationship between them

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
41
Q

What is the sampling distribution of r when p = 0?

A

above 0 is ur +, and below is ur -

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
42
Q

When p = 0 and the sample size is relatively large, the sampling distribution of r will be _____ with a standard error of ____

A

normal, sr

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
43
Q

What is the formula for standard error of sr?

A

sr = square root of [(1 - r^2)/(N-2)]

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
44
Q

What can we calculate a t-stat for a correlation coefficient as?

A
df = n - 2
t = r (square root of N-2)/(square root of 1 - r^2)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
45
Q

What factors influence the Pearson r?

A

(L) linearity
(O) outliers
(R) restriction of range
(C) context

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
46
Q

Why does linearity affect the Pearson r?

A

r will underestimate the relationship of a bivariate distribution by departing from linearity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
47
Q

Why do outliers influence the Pearson r?

A

discrepant data points, or outliers, affect the magnitude of r and the direction of the effect depending on the outlier’s location in the scatterplot

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
48
Q

Why does restriction of range affect the Pearson r?

A

other things being equal, restricted variation in either X or Y will result in a lower Pearson r and would be obtained were variation greater

49
Q

Why does context influence Pearson r?

A

because of the many factors that influence r, there is no such thing as the correlation between two variables. RAther, the obtained r must be interpreted in full view of the factors that affect it and the particular conditions under which it was obtained

50
Q

How many possible directions of causality are possible when two variables are correlated?

A

3

51
Q

What is the relationship of correlation and causality?

A

1st variable causes 2nd
2nd variable causes 1st
some 3rd variables causes both the 1st and 2nd

52
Q

What issues are there in interpreting the correlation coefficient?

A

statistical significance
r^2
restriction in range
unreliability of measurement

53
Q

What is r^2?

A

the proportion of common variance or the proportion of common variance shared by two variables, used to compare correlations

54
Q

What is the z-distribution to do hypothesis testing and to compute confidence intervals?

A

z = X_ - u/ ox

55
Q

What if we don’t know o and had to estimate it from our sample?

A

if we repeated choose samples of size n and computer variable as

s^2 = sum of (X - X_)^2 / (n-1)

we will create a distribution of variances and the mean of the distribution of s^2

56
Q

What is the formula for the t-score for a sample t-test?

A

t = X_ - ux / sx

57
Q

Why is s^2 typically equal to o^2?

A

the sample variance computed with n - 1 in the denominator (s^2) is an unbiased estimate o fo^2

58
Q

How do we ensure an unbiased estimator?

A

we use the degrees of freedom

one-sample is n - 1, whereas ANOVA is n - 2

59
Q

Who figured the t-distribution?

A

William Sealy Gosset

60
Q

Who wrote Student (1908) The probable error of a mean, Biomethika?

A

William Sealy Gosset

61
Q

True or false: in general t-scores are normally distributed, but the distribution of t-scores is asymmetric and has a mean of 1

A

false.

In general, t-scores are not normally distributed but the distribution of t-scores is symmetric and has a mean of zero

62
Q

Who do t-distributions vary?

A

according to the degrees of freedom

63
Q

Does sample size influence the t-distribution?

A

hell yeah

64
Q

With ___ observations, t-distrubtion approximates z-distrubtion. What’s the answer?

A

n = 30 or more

65
Q

What are degrees of freedom?

A

a mathematical concept that involves the amount of freedom you have to substitute various values in an equation

66
Q

Should we indicate the degrees of freedom when listing the t-value?

A

Yes

67
Q

Should we indicate the degrees of freedom when listing the t-value?? And Why?

A

Yes, because the size of the n affects the integrity of the t-distribution

68
Q

What kind of test do you use when you know the o?

A

use a sample z-test, as well as confidence intervals

69
Q

What kind of test do you use when o is not known?

A

the one-sample t-test, independent samples t-test (2), or the dependent samples t-test (2)

70
Q

What should you know about the CI with the one sample t-test?

A

the CIs about the mean

71
Q

What should you know about the CIs with the independent samples t-test?

A

the CIs about the difference between means

72
Q

What do you need to know about the dependent samples t-test and CIs?

A

CIs about the mean DIFFERENCE

73
Q

Hirsch (1997) examined cortical activity associated with
production of speech in bilingual subjects. She took 7 subjects
who became bilingual relatively late in life (meaning after the
age of about 10), and those who were bilingual early on.
(There are some fields of psychology in which 10 year olds
are considered ʺover the hill.ʺ)
 The first question concerns whether cortical activity
associated with language production takes place in slightly
different regions of Broca’s area when speaking two different
languages. The following data represent the data collected
from 6 subjects who were bilingual early. The dependent
variable is the distance between the centroids of those two
areas What tetst did he use?

A

the one sample t-test

74
Q

How do you find the one sample t-test in SPSS?

A

Analyze - Compare Means - one-sample t-test

75
Q

If the test value is 0, what does that mean in an SPSS one sample t-test output?

A

that the mean is 0, which mean it’s standardized

76
Q

The most interesting part of Hirsch’s study was the fact that
she compared early and late bilingual speakers. The data
below represent the distance between centroids of activation
in Broca’s area when people are thinking in two different
languages. What test would be used?

A

the paired-samples t-test

77
Q

How do I find the t-test for dependent means?

A

Analyze - Compare means - Paired-sample t-test

78
Q

If pairs are not correlated what does that mean?

A

the pairs are not similar enough to warrant a test of a paired samples t-test

79
Q

Everitt (1994) compared two different treatments for
anorexia. One group represented a family therapy
intervention, while the other was a control group. The
dependent variable was the amount of weight gain over the
course of the experiment. He had earlier shown that girls in
the Family Therapy condition gained weight, but that could
simply be because the girls were growing older. Comparison
with a Control Group would help to clarify the results. The
data shown below have been taken from Everitt and are the
weight gains in the two treatment conditions. What test would you use?

A

an independent t-test

80
Q

How do you find an indpendent samples t-test/

A

Analyze - Compare means - independent-samples t-test

81
Q

What test do you need to do to before stating the results of an independent samples t-test?

A

Levene’s test

82
Q

Do you want Levene’s test to be significant or not?

A

No, because an insignificant t-test means that the variances can be rightfully assumed to be equal, and therefore we can and use the same degrees of freedom

83
Q

What do we assume about a one-sample t-test in regards to random sampling?

A

the sample is a random sample from the pop, and robustness is considered inappropriate to conduct if violated, but some argue it is robust if violated

84
Q

What do we assume about a one-sample t-test in regards to independence of observations?

A

cases within the sample don’t influence each other, and it is not robust to violations

85
Q

What do we assume about a one-sample t-test in regards to normality?

A

the DV is normally distributed in the pop, and robust to moderate violations if the sample size is large

86
Q

It is generally considered that psychological characteristics (like personality) and physical characteristics (like height) are not normally distributed. True or false?

A

False. It’s the opposite

87
Q

R^2 is:

A

the proportion of common variance or the proportion of common variance shared by two variables

88
Q

R^2 is also

A

used to compare correlations

89
Q

Is there a restriction in range for r^2?

A

Yes

90
Q

Is r^2 considered a reliable measurement

A

No

91
Q

How do you do correlations in SPSS?

A

Analyze - Correlate - Bivariate - …

92
Q

What does Pearson’s product moment correlation measure?

A

degree of association between two variables

93
Q

What are the conditions that Pearson’s product moment correlation measures degree of association between two variables?

A

where both variables are:

  • LINEARLY correlated
  • measured on a CONTINUOUS scale
  • have a degree of NORMALITY and HOMOGENEITY OF VARIANCE allowing assumptions to be made
94
Q

What is assumed for Pearson’s correlation?

A

bivariate analysis has variables that are:
linearly correlated
continuous scale variables
are normal and have a homogeneity of variance

95
Q

What are Pearson correlation coefficients?

A

when both factors are continuous (i.e. interval or ratio data)

96
Q

What are Point-Biserial correlation coefficients?

A

when one factor is dichotomous (nominal) and one factor is continuous (interval or ratio)

97
Q

What is a Phi coefficient?

A

when both factors are dichotomous (nominal)

98
Q

What are Spearman correlation coefficients?

A

when both factors are rank or ordinal data (i.e. non-normally distributed data)

99
Q

What are Kendall’ tau correlation coefficients?

A

when you have a SMALL data set with a LARGER number of tied ranks; a better estimate of the correlation in the population

100
Q

How do you determine correlation in SPSS?

A

Analyze - Correlate - Bivariate Correlations - enter variables - use Pearson Correlation Coefficient - ensure it is a two-tailed test of significance, and that it is flagged for significant correlations

101
Q

What is the degrees of freedom for a Pearson’s correlation coefficient?

A

N - 2
e.g. Assignment 3 has a sample of 50 professors, therefore the correlation results would be:
r(48) = X.XX, p = .XXX

102
Q

What line is the correlation coefficient in SPSS output?

A

on the line Pearson correlation according to the specific variable (will be the same for each intersection between the two same variables, and be 1 for the same variable)

103
Q

What is important to note in SPSS output?

A

the asterisk to the right of the correlation indicates that the result is stat significant at the .05 alpha level, two-tailed (*) or the .01 alpha level, 2-tailed (**)

104
Q

How do you find out the power of a Pearson’s correlation in SPSS?

A

use the Sample Power function and enter the correlation coefficient in the Population correlation area, enter the ACTUAL number of cases in the N of Cases area, and also enter another group under population as TEST AGAINST THE CONSTANT with a Population Correlation of 0.00.
the power will show a standard error, be indicated as 95% lower, 95% upper, and the power in the bar will be indicated under this

105
Q

What is an example of a correlation analysis results in APA format?

A

In this study, the relationship between a husband’s flexibility in his gender role and a wife’s degree of marital satisfaction was assessed in a RANDOM SAMPLE of eight married couples from one city. There was a stat significant, positive relationship, r(df) = .76, p < .001, between the two variables: couples with a high score on one varaible tended to have a higher score on the other. The 95% confidence interval for p, the population value of the relationship between male gender role flexibility and female marital satisfaction, ranges from .12 to .95 when expressed in Pearson correlation coefficient units. This WIDE confidence interval indicates UNCERTAINTY about the strengthof the relationshp between gender role flexibility and marital satisfaction in the population.

106
Q

How do you represent power in a correlation analysis?

A

E.g. While the power of the correlation analysis is 0.97, the small sample size in the present study makes it impossible to say how strong the relationshp between these two variables is in the larger population. The study should be replicated with a larger sample size in order better to determine the strength of the relationship.

107
Q

What is an example of showing what a correlational study doesn’t address?

A

E.g. This is a correlational study, so it does not address whether (1) more gender flexible husbands lead to more satisfied wives, or (2) wives who are more satisfied give their husbands the leeway to be more gender flexible, or (3) that a third variable – such as age, education level, or socioeconomic status – could influence both marital satisfaction and gender role flexibility. Future research should attempt to determine the order of the relationship.

108
Q

How to show a correlation matrix?

A

Table (N = x)

Variable 1 2 3 4 5

  1. A -
  2. B .08 -
  3. C .08 .10 -
  4. D -
  5. E -

p < or = .05, two-tailed

109
Q

What is the formula for power with a correlational study?

A

italicized d/power = u - u0/o

e.g. H1 people with psych problems have higher IQs, N = 25, u0 = 100, u1 = 105, o = 15

= 105 - 100/15 = 5/15 = 1/3

delta/Greek d = d square root of n

e.g. 1/3 x square root of 25 = 1.67

delta is converted on power table to become

Greek d 0.05

  1. 60 0.36
  2. 70 0.40

therefore, power is 0.38

110
Q

What do you use to find sample power?

A

Appendix Power: Power as a Function of Greek d/delta and Significance Level/alpha

111
Q

What is the formula for delta?

A

d x square root of n

112
Q

What is the formula for d?

A

= u - u0 / o

113
Q

What is G*Power?

A

software in SPSS that calculates the power from delta

114
Q

What is used in the G*Power calculation?

A

tails
effect size/d
alpha error problem
sample size
[test family (t tests)
Stat test: Means difference from constant (one sample case)
Type of power analysis: Post hoc - compute achieved power - given alpha, sample size, and effect size]

115
Q

What does a power of .38 mean?

H1 people with psych problems have higher IQs, N = 25, u0 = 100, u1 = 105, o = 15

A

If H0 is false and u1 is really 105, only 38% of the time can a clinician expect to find a statistically significant difference between her sampl e mean and that specified by H0. A prob of .38 is rather discouraging because it means that if the true mean really is 105, 62% of the time a clinician will make a Type II error

116
Q

How does sample size affect the probability that the interval will contain the population mean?

A

sample size has NO effect on the probability the interval will contain the population mean

117
Q

True or false: Larger sample sizes decrease the width of confidence intervals but leave the probability that the interval contains the population mean unchanged.

A

True

118
Q

What are the assumptions about data to create confidence intervals?

A

The two populations have the same variance. This assumption is called the assumption of homogeneity of variance.

  1. The populations are normally distributed.
  2. Each value is sampled independently from each other value.