Exam 3 Flashcards

Question 1

Q

What are the Assumptions for Parametric Tests

Answer

A

Interval or Ratio Data
Population follows Normal Distribution Curve
Homogeneity of variance in the population
Numerical score for each individual

Question 2

Q

When to use Parametric Test?

Answer

A

= Normal Distribution

Interval/Ratio scale data
Sample size 30 or more

Question 3

Q

What are the Assumptions for Non-Parametric Tests

Answer

A

Nominal or Ordinal Data

- Not normally distributed

Question 4

Q

What does the P Value mean

Answer

A

Probability that the result would occur if H0(null hypothesis)were true
Probability of a Type 1 Error.

Question 5

Q

What are the P Value cut offs (Alphas)?

Answer

A

.005, .01. .05, .10, .25

.05 and .01 most common

Question 6

Q

What is a Parametric Test?

Answer

A

Ordinary hypothesis testing procedure that requires assumptions about the shape or other population parameters.

Question 7

Q

What are Degrees of Freedom?

Answer

A

(df) the number of scores in a sample that are free to vary
- an honesty factor when comparing samples to population

(generalizing a sample to population we lose some power due to sample size.)

Question 8

Q

How do we calculate Degrees of Freedom (df)?

Answer

A

df for Pearson’s correlation (n-2)
df for Goodness of Fit Test: k(number of categories) - 1
df for Test of Independence: df = (k-1)(k-1) one k for row, one k for column

Question 9

Q

What is the Correlation Method?

Answer

A

The technique where two or more variables are measured and naturally occurring relationship between them is accessed.

Question 10

Q

What are the Characteristics of a correlational Relationship?

Answer

A

Direction: negative or positive, indicated by the + or - sign of the correlation coefficient

Shape/Form: linear is most common

Magnitude/Strength: between two variables varies from 0 to +/- 1

Question 11

Q

Correlation is not a ___________

Answer

A

proportion

Question 12

Q

Squared Correlation (r2) is defined as what?

Answer

A

Coefficient of Determination

Question 13

Q

What are the Assumption of Correlation Method

Answer

A

Causality: the assumption that a correlation indicates a causal relationship between the two variables.
Directionality: inference based on direction of a causal relationship between two variables.
Correlation ‘describes’ a relationship but does NOT demonstrate causation.

Question 14

Q

What is the Experimental Method?

Answer

A

A research technique that establishes the causal relationship between an IV (x) and a DV (y) by randomly assigning participants to experimental groups characterized by differing levels of x, and measuring the average behavior y that results in each group.

Question 15

Q

The experimental method is the only method that allows a research to establish a ________ ___ _______ relationship.

Answer

A

cause and effect

the researcher has full control of the experimental environment.

Question 16

Q

What is the strength of the experimental method?

Answer

A

it isolates the relationship between the independent and dependent variable.

Question 17

Q

What are concerns with Correlational Method?

Answer

A

there might be other influences on the variables (third variable) that make it hard to measure how strong the relationship between the two is.

Question 18

Q

What conclusions can be made from the Correlational Method?

Answer

A

Predictions about the likelihood of two variables occurring together.

Question 19

Q

What conclusions can be made from the Experimental Method?

Answer

A

Experiments are generally the most precise studies
have the most conclusive power.
effective in supporting hypothesis about cause and effect

Question 20

Q

What are the components (numerator, both pieces of the denominator) of the Pearson’s r equation?

Answer

A

numerator: co-variability of X and y
denominator: variability of X and Y seperately

Question 21

Q

What information do we gain from r?

Answer

A

r(for a sample), is an estimate of a population coefficient of correlation.

Question 22

Q

How do we interpret the r?

Answer

A

a measure of the strength and direction of the linear relationship between two variables that is defined as the ‘sample’ co-variance of the variables divide by the product of their (sample) standard deviations.

Question 23

Q

Can we conclude there is a cause and effect relationship based on a correlation?

Question 24

Q

What is the r2 and why do we use it?

Answer

A

Coefficient of Determination
Measures the proportion of variability in the data explained by the relationship between X and Y.
Example: r2 = 0.64 (or 64%) of the variability in the Y scores can be predicted from the relationship with X.

Question 25

Q

What are the different types of correlations and when are they used?

Answer

A

Pearson’s r: interval or ratio variables
Spearman’s rho (r): at least 1 ordinal variable
Point-biserial: 1 dichotomous, 1 interval/ratio
Phi (f) coefficient: dichotomous variables.

Question 26

Q

What is the Third Variable Problem?

Answer

A

A correlation between two variables being dependent on another third variable.

Question 27

Q

What are the Correlation Concepts

Answer

A

Relationship between two variables
Ratio of change in one variable with respect to another
Can be positive, negative, none or curvilinear

Question 28

Q

What are indicators ofSignificance of Correlations

Answer

A

Obtained correlation must exceed the magnitude/strength of the critical value
Computing r alone, only provides a measure of strength
Significance takes into account strength AND number of participants in study.

Question 29

Q

What are the three Correlation Magnitudes/strengths and Directions/powers

Answer

A

Strong: ±.70 – 1.00
Moderate: ±.30 –.69
Weak: ±.00 –.29

Question 30

Q

What do the components of the chi-square equation indicate?

Answer

A

χ2 is the lower-case Greek letter Chi
O is the Observed Frequency
- E is the Expected Frequency

Question 31

Q

Describe the components of the Chi-Square Formula

Answer

A

The sum of the squared difference between observed and expected frequencies, divided by the expected frequency.

= (O - E)2 / E

Question 32

Q

Why and how do we calculate Effect Size with Phi Coefficient?

Answer

A

For a 2x2 matrix, the phi coefficient (Φ) measures the strength .(effect size) of the relationship

square root of x2/N

Question 33

Q

What are the two ‘ways’ to calculate Effect Size?

Answer

A

Phi Coefficient

- Cramer’s V

Question 34

Q

Why and how do we calculate Effect Size with Cramer’s V

Answer

A

For a Larger Matrix, the Cramer’s V measures the strength (effect size) of the relationship

Note: with Cramer’s V, the df* is the smaller of the rows or columns (k - 1)

square root of x2/(N)(df smaller)

Question 35

Q

What are the three Phi Coefficient Effect Size Interpretations for x2. “Magnitude and Power}

Answer

A

.10 – .29 Small effect
.30 – .49 Medium effect
.50 or greater Large effect

Question 36

Q

How do we report Chi-Square Statistic in APA format?

Answer

A

x2 (df, N = sample_size) = obt_value, significance.

Question 37

Q

Why do we use Goodness of Fit (one nominal variable):

Answer

A

Uses sample data to test hypotheses about the same or proportions of a population distribution
Tests the fit of the proportions in the obtained sample with the hypothesized proportions of the population

Question 38

Q

What are the Goodness of Fit Assumptions

Answer

A

Individual are classified in each category (grades, exercise frequency)
Observed frequency is tabulated for each measurement category(classification)
Each individual is counted in only one category) no overlap.

Question 39

Q

When do we use a Chi-Square Goodness of Fit Test?

Answer

A

Used to see how well an Observed Frequency distribution fits an expected (or predicted) frequency distribution.
Non-parametric data
Data distribution does not follow the normal curve
Subjects do not have two scores within the same study (repeated measures)
Data must be a categorical/nominal variable (or transformed into nominal data)

Question 40

Q

Why use a Test of Independence?

Answer

A

Tests for evidence of a relationship between two variables
two nominal variables/each with several categories)
Each individual jointly classified on each variable(male - tall)
Counts are presented in the cells of a matrix
Design may be experimental or non- experimental
Frequency data is used to test for relationship between the two variables using a two-dimensional frequency matrix.

Question 41

Q

What does a Test of Independence Null hypothesis mean?

Answer

A

the two variables are independent (no relationship)

Question 42

Q

What are the two versions of the Test of Independence?

Answer

A

Single Population: no relationship between two variables in this population
Two Separate Populations: no ‘difference’ between the distribution of variables in the two populations.

Question 43

Q

Reasons you would use a Test of Independence?

Answer

A

Non-parametric data
Data is not interval or ratio-scaled
Data distribution does not follow the normal curve
People cannot have two scores within the same study (repeated measures)
There are two nominal variables, each with several categories

= Too see whether paired observations on two variables are independent of (different population), or have a relationship to (same population), each other.

Question 44

Q

What is the meaning of Expected Frequency

Answer

A

Expected frequencies are based on the null hypothesis (H0) prediction of the same ‘proportions’ in each category(population)
Expected frequency of any cell is jointly determined by its column proportions and it’s row proportion.
Computing Expected Frequencies

E = (Column total)(Row total) \ N total

Question 45

Q

What is the meaning of Observed Frequency

Answer

A

Frequencies in the sample are ‘observed’ frequencies for the test

Question 46

Q

When do we use a Cramer’s V or Phi?

Answer

A

Cramer’s V: Larger than a 2x2 Matrix

- Phi: 2x2 Matrix

Question 47

Q

If the calculated chi-square value is greater than or equal to the critical chi-square value (from table), what do we do?

Answer

A

Reject the null hypothesis.

Question 48

Q

If I increase the categories what happens to the critical value?

Answer

A

Critical value goes up

Question 49

Q

What are the components of the line of best fit?

Answer

A

Regression is a method of finding an equation describing the best-fitting line for a set of data
Both variables are measured at the interval level (interval data)
Data must be from a random sample
Normal data distribution (or have a large sample)
How to define a “best fitting” straight line when there are many possible straight lines?
The answer: A line that is the best fit for the actual data that minimizes prediction errors

Question 50

Q

Why do we use regression?

Answer

A

• The goal for regression is to find the best-fitting straight line for a set of data. For every X value
in the data, the linear equation determines Y values on the line.

Question 51

Q

What information do we know and what information are we trying to predict with a Regression Line?

Answer

A

If we know the equation of the regression line, we can predict values of criterion variable Y, so long as we know X: Ŷ = bX+a
The regression procedure produces a line that minimizes total squared error of prediction
This method is called the least-squared-error solution
The purpose of regression equation makes a prediction of a value Y from value X
Precision of the estimate is measured by the standard error of estimate (SEoE)
In regression, predicting Y from X also involves error for the same reason as we found in our z-test
Residual: The amount of error we make in predicting Y from X

Question 52

Q

How is Standard Error of Estimate related to correlation?

Answer

A

The relationship between correlation and SEoE:
As the correlation becomes stronger (as r goes from 0 to ±1) SEoE decreases to 0 because there is less error variability in the data

Question 53

Q

What is basic APA FONT/Margin formatting?

Answer

A

Times Roman font
12 point size
1 inch margins all around

Question 54

Q

What are aspects of an APA Reference page?

Answer

A

Begins on new page after end of Main Body

References centered at top of page
Double-spaced, using hanging indent, alphabetically
Only references cited within text should be listed

Question 55

Q

What goes in an APA formatted Abstract page?

Answer

A

Identify your purpose
Explain the study
Explain your methods
Describe your results 
Conclusion
Keywords

Question 56

Q

When do we use a Wilcoxon Rank-Order Test?

Answer

A

Non-parametric (define) data
Data is not interval or ratio-scaled
Data distribution does not follow the normal curve
You want to know if two groups (such as an experimental group and a control group) differ from each other

Question 57

Q

What are Assumption of the Wilcoxon Rank-Order Test?

Answer

A

Data must be converted to ranked (ordinal data) before conducting the test
Data distribution does not follow the normal curve
Observations are independent
Compares medians (Md) rather than means
Null hypothesis is that the two populations have the same median

Question 58

Q

What are the steps to transform the data for a Wilcoxon Rank-Order Test?

Answer

A

Transform the scores to ranks (lowest score is rank is 1…)
When two scores are tied, both ‘ranks’ become the average of the two scores
(2+3)/2 = 2.5 use 2.5 for the rank for both scores.
Add up total ranks in the group that you expect or hypothesize that have a lower score, then compare that total to Critical Values for W table(?).
How to check that you ranked the scores correctly:
(n1+n2) = Total both groups
N(N+1)/2 = Sum of ranks
Both should match

Question 59

Q

What are we looking for with a Wilcoxon Rank-Order Test?

Answer

A

You want to know if two groups (such as an experimental group and a control group) differ from each other, but have nonparametric data.

Question 60

Q

Using the Wilcoxon Rank-Order Test, do you want it to be greater than or less than the critical value?

Answer

A

You want less than critical value

Question 61

Q

Why do you use Power when planning a study?

Answer

A

To help you decide how many participants you need

Important to understand power when you read a researcher article and want to make sense of how practical are the results.

Question 62

Q

What is beta and what type of error is it?

Answer

A

Beta = probability of a Type II error

Beta (β)

Question 63

Q

How is Power related to beta?

Answer

A

• Power: the probability correctly rejecting the null hypothesis (when the null hypothesis isn’t
true)

Type II error (b): the probability of failing to rejecting the null hypothesis (when the null hypothesis is not true)
- B; beta (B), since power is 1-b

Question 64

Q

What happens to Power as you increase effect size, sample size or power itself?

Answer

A

As effect size increases, power increases
As sample size increases, power increases
As power increases, beta decreases
One-tailed tests have a less stringent critical value than two-tailed tests, using a one-tailed test increases power

Answer 64

A

If your n is too small, you will increase your chances of making a type 1 error

If your n is too big, you may find statistically significant results, but they may not be practically important

Answer 65

A

Reducing alpha level(making the test more stringent) reduces power

Using two-tailed (non-directional) reduces power

Answer 66

A

A priori = before you run the analysis

Post hoc = after you run the analysis

Answer 67

A

Parametric Data
Normal Distribution
Interval Data
When there is large enough sample size (at least 30)
Standard deviation is known
Random sampling

Answer 68

A

lot

small

no

Y

Answer 69

A

A variable that is truncated and has limited variability, looks like a distorted variable. Can give false impressions

Answer 70

A

Can have fractions or percentages (multiply by totals)
Each observed frequency is generated by a different individual
Should not be performed with an ‘n’ less than 5

Answer 71

A

Contingency table (or Matrix) - a table in which the distribution of two nominal variables are set up so that you have combined observed frequencies and expected frequencies as well as the row and column totals.

Answer 72

A

Predictor, Criterion

Answer 73

A

Ŷ = bX+a

Answer 74

A

The degree of agreement among raters. It gives a score of how much homogeneity, or consensus, there is in the ratings given by judges.

Answer 75

A

Number of agreements(yes or no) / total number of samples

Answer 76

A

Whenever a sample has a restricted range of scores, the correlation will be reduced.

Answer 77

A

The more you play fantasy football the less dates you go on

Answer 78

A

r (N – 2) = .value, p = .significance, r2 = .value

Answer 79

A

1 dichotomous and 1 Interval / Ratio

Answer 80

A

When there is at least one ordinal value

Answer 81

A

When there are Two dichotomous variables

Answer 82

A

Small = .00 - .29
Medium = .30 - .69

Answer 83

A

With parametric you are making assumptions about the population parameters with nonparametric you are not assuming the population has any sort of distribution

Answer 84

A

The frequency value that is predicted from the proportions in the null hypothesis and the sample size (n).

The expected frequencies define an ideal, hypothetical sample distribution that would be obtained if the sample proportions were in perfect agreement with the proportions specified in the null hypothesis”

Answer 85

A

The number of individuals from the sample who are classified in a particular category.

Each individual is counted in one and only one category.

Answer 86

A

No Preference/Equal proportions or No Difference from a known population

Answer 87

A

Positively skewed, what effects the distribution?

Answer 88

A

X2 (df, n = ) = #, p < .05

Answer 89

A

You get Regression from Correlation

Answer 90

A

Regression allows you to ‘predict’ a value

Answer 91

A

Y = bX + a

a(y intercept)
b(slope)

Answer 92

A

30.5

.3 * 100 + .5 = 30.5

Answer 93

A

Both variables are measured at the interval level (interval data)
■ Data must be from a random sample
■ Normal data distribution (or have a large sample)

Answer 94

A

As the correlation gets closer to +/-1 the SEoE goes down.

The value you are predicting is more accurate

Answer 95

A

Test for Independence involves 2 Variables, Goodness of Fit involves just 1.

Answer 96

A

When it is larger than 2X2, how do you find the df for the formula?
■ You use the smaller of the two df

Answer 97

A

For 2x2 matrix

Answer 98

A

Df=(k-1)(k-1)

Answer 99

A

Allows you to determine how many participants you will need.

Answer 100

A

When you want to compare two groups but they are nonparametric data

Answer 101

A

Wilcoxon rank-order test.

Answer 102

A

Raw scores are converted into ranked scores (aka interval/ratio data is transformed into ordinal data) and the medians (not means!) are compared

Answer 103

A

Power increases

Answer 104

A

Power decreases

Answer 105

A

Reducing alpha (making the test MORE stringent) decreases power, thus one-tail has more power

Answer 106

A

P= 1-β, “The power of a test is the probability that the test will correctly reject a false null hypothesis” therefor decreasing the risk of type 2 error increase the power

Answer 107

A

■ the two variables are independent (no relationship exists)