Midterm revision 3 Flashcards

1
Q

What is data analysis?

A

Non inferential description of an empirical distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is an empirical distribution?

A

A set of scores on a variable or set of variables, where one score is for one variable or set of variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Which has scientific priority - data analysis or inferential stats?

A

data analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are quantitative representations?

A

Succinct and accurate descriptions of an empirical distribution’s: location, dispersion and shape

ARE NOT: references to normality, hypothesis speculations, claims regarding population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is location?

A

In a metaphorical sense, it is where the empirical distribution sits on the X axis.

Includes the mean, median and mode

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the median?

A

The point at/below which 50% of the scores lie

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the mode?

A

The number of peaks

poor information

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is dispersion?

A

how spread out the scores are

Includes: variance, SD, and range

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is variance?

A

Average squared distance that each score is from the mean.
Natural dispersion counterpart of the mean
Has to be non-negative, not bounded above, bounded below

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is standard deviation?

A

Square root of variance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is shape?

A

Shape of the empirical distribution

Includes skewness and kurtosis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is skewness?

A

Captures degree of symmetry (NOT normality)

Symmetrical: 0
Positively skewed: hill falls down to positive end
Negatively skewed: hill falls down to negative end

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is kurtosis?

A

Captures the degree of peakedness or flatness

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are transformations?

A

Involve taking a function of a variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What are the reasons for transforming?

A
  1. To re-express distribution for taste, preference, and convenience (e.g., transforming to a proportion for comparison across different scales)
  2. To bring a distribution into alignment with a theoretical distribution (involves nonlinear transformations that change the shape of a distribution, e.g., normality)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are the outcomes of linear transformations?

A

Linear transformations do not change the shape, but change the location and dispersion in predictable ways

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What happens to the mean when a constant is added/deducted to/from it?

A

+/- by the same

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What happens to the mean when a constant is multiplied/divided to all scores?

A

x/div by the same

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What happens to the SD when a constant is added/deducted to/from it?

A

It stays the same

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What happens to the SD when a constant is x/div to all scores?

A

x/div by the same

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What is the outcome of a nonlinear transformation?

A

It will alter the shape in unpredictable ways

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What is a sampling distribution?

A

A frequency distribution of a statistic (repeated with an infinite number of samples)

*theoretical

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

What is standard error?

A

Standard deviation of a sampling distribution

*theoretical

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What are confidence intervals?

A

Upper and lower bounds for the population. Shows where the data from the actual population is likely to fall.

e.g., 95% confidence interval = 95% all samples will contain that value; 95% chance that a sample from the population will have that obtained value.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

What are the 10 steps in hypothesis testing?

A
  1. Data analysis
  2. Specify hypothesis pair
  3. Specify population distribution
  4. Deduce assumptions
  5. Formulate if/then link
  6. Type 1 Error control
  7. Type 2 Error control
  8. Decision Rule
  9. Employ a procedure to make a decision about hypothesis pair
  10. If H0 rejected, estimate effect size
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

What is the purpose of inferential testing?

A

To render a decision as to which of the two (H0 or H1) is in fact that state of nature at the moment the procedure is employed. i.e., to make a correct binary decision regarding the extant state of nature.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

What are parametric approaches?

A

Begin with modeling the distribution of X in P by a theoretical density function
Makes an assertion regarding the parent population distribution
Makes assumptions regarding the parent population distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

What are the advantages of parametric approaches?

A

you know what is going on regarding the distribution, errors, powers, etc.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

What are the disadvantages of parametric approaches?

A

They invoke artificiality

30
Q

What are non-parametric approaches?

A

No claims made regarding X in P

31
Q

What are the disadvantages to using parametric approaches?

A
  1. Distributional results are shaky

2. B/c no structure - unclear about what is being tested: means? variance? shape?

32
Q

Do parametric vs non-parametric have greater strength of inference?

A

Parametric have greater strength of inference

33
Q

What is an assumption?

A

An assumption is any potentially false assertion made about the hypothesis pair, save for H0 and H1. An assumption is a side condition, asserted solely for the purpose of deriving a statistical method for testing a hypothesis.

34
Q

What is another term for an assumption?

A

“nuisance conditions”

35
Q

What are the typical parametric assumptions?

A

Normality and homogeneity of variance

36
Q

What is an IF-THEN link?

A

The test statistic component of inferential machinery. Establishes what to expect at the level of the sample when H0 happens to be true.

37
Q

What is the structure of an IF-THEN link?

A

Antecedent: If H0 is true and assumptions are met
Consequent: THEN [Test stat] will pile up as [insert null distribution]

38
Q

What is a test statistic?

A

A statistical test employed to make a decision regarding the hypothesis pair

39
Q

What is the null distribution?

A

The complete set of values of the test statistic, over an infinity of samples, under the condition that H0 is true.

A sampling distribution of the statistic observed under the condition that H0 is, in fact, the extant state of nature

i.e., if H0 was true, and you calculated statobs for an infinite number of samples, the plotted those statistics observed, you would get the null distribution

40
Q

What is statobs?

A

The value obtained from the test statistic from one sample

41
Q

Type 1 error

A

The probability of rejecting H0 when H0 true

42
Q

Type 2 error

A

The probability of retaining H0 when H0 false

43
Q

What is Beta?

A

Type 2 error

44
Q

What is a sound inferential procedure?

A

One which yields correct decisions with high probability. We determine whether a procedure is sound by checking whether: 1) assumptions are reasonable; 2) probability of type 1 error has been made small; 3) probability of type 2 error has been made small

45
Q

What are the two forms of decision rules?

A

Probability-probability

Point-point

46
Q

What is the point-point decision rule?

A

Compares statobs with a critical value

If crit1 < statobs < crit2, then reject H0, else retain

47
Q

What is a critical value?

A

A value used to partition the X axis into sections (acceptance and rejection regions)

48
Q

How do you choose critical values?

A

The basis of choosing critical values, and thence forming a decision rule, rests on considerations of type 1 error control, and a sound inferential procedure

49
Q

How do you control Type 1 error?

A

Choice of critical values - this is done by fixing alpha to a low value.

50
Q

What is the probability-probability rule?

A

Compare Pobs to alpha

If Pobs is greater than or equal to alpha, reject H0, else retain

51
Q

What is Pobs?

A

The probability of drawing, from the null distribution, a value as or more extreme than statobs

It is a nonlinear tranformation of statobs

Smaller Pobs, less in keeping with H0
Larger Pobs, more in keeping with H0

52
Q

How do you control for Type II error?

A
  1. Change alpha

2. Increase sample size

53
Q

How can H0 be false?

A

There are an infinite number of ways that H0 can be false.

For each and every possible departure from H0, there is an associated probability of a type II error

54
Q

What is phiprime*?

A

To consider the control of type II error, we require an effect size: phiprime*

phiprime* quantifies possible degrees of departure from H0

Possible degrees by which H0 may be incorrect

55
Q

What is power?

A

The probability that the procedure will detect a departure from H0 of phiprime*, should such a departure exist when the procedure is employed

56
Q

What is the aim of Type II error control?

A

Maximizing power for each phiprime*, the detection of which is of interest

57
Q

Describe power analysis

A
  1. Nominate one or more phiprime* that (should any of these be the state of nature when the procedure is employed) you are interested in detecting
  2. Calculate power for each
  3. Forward looking: consider in advance of employing a test procedure, how the procedures would do in a probabilistic sense, if nature happens to “serve up”, when the procedure is employed, a departure of phiprime*
58
Q

How do you decide on a phi’* of interest?

A
  1. Knowledge of your research area

2. Cohen’s guidelines

59
Q

What are the two power “games”?

A
  1. calculation of power for a fixed sample size, alpha and phi’*, the detection of which is of interest to us
  2. Calculation of sample size to yield desired power(phi’), for fixed alpha and phi’, the detection of which is of interest
60
Q

What does power game one involve?

calculation of power for a fixed sample size, alpha and phi’*, the detection of which is of interest to us

A
  1. Use Keisan to evaluate relevant area, power’*, under the relevant alternative distribution
61
Q

What is the issue with power game 1?

calculation of power for a fixed sample size, alpha and phi’*, the detection of which is of interest to us

A

Does not ensure the procedure is sound

62
Q

Why would we use power game 1?

A

If our sample size is fixed or we are calculating for another researcher’s analysis

63
Q

What does power game 2 involve?

A

Working backwards

64
Q

How does the null distribution change when you have a directional hypothesis pair?

A

There is an infinity of values associated with the null state of nature

65
Q

How do you control type II error with a directional hypothesis?

A

Because the alternative distribution shifts to the right, use the positive critical value when calculating power

66
Q

How do you fix alpha for a directional hypothesis?

A

Use the boundary null - the last point in keeping with H0

67
Q

What is the bivariate distribution of Y and X?

A

Shows how Y is dependent on X. i.e., how Y varies as a function of X

Represents an infinite number of individuals (p) in the population, each p has an X value and a Y value

68
Q

What is the conditional distribution of Y given X?

A

The distribution of Y conditional on X given that X takes on a particular score of X.

A population of Y scores (all Y values with some particular X value)

of conditional distributions depends on the # of particular values

69
Q

What is omega_sq?

A

Portion of the variance explained out of total variance

70
Q

What are the possibilities with respect to strength for a dichotomous IV and continuous DV?

A

Linearity not defined, and no sense talking about steepness. Therefore we cannot talk about primary and secondary strength separately.

Thus, we blend primary and secondary strength with omega_sq

71
Q

What are the focal factors?

A

The factors of interest