Final Exam- Pearson's Correlation Flashcards

Question

Bimodal shapes

Answer 1

scores are clustered in two places

Answer 2

Scores are clustered in three places

Answer 3

Most frequently occurring score

Answer 4

Midpoint - Identifying the value that splits the distribution into two halves, each half having the same number of values. - Best measure of central tendency when the distribution includes extreme scores because it is less influenced by the extreme scores than is the mean

Answer 5

Average -Most commonly reported measure of central tendency and is determined by dividing the sum of the scores by the number of scores contributing to that sum

Answer 6

Difference between the highest and lowest scores

Answer 7

Spread between the middle 50% of the scores * Upper quartile: top 25% * Lower quartile: bottom 25%

Answer 8

Summarizes the degree of variability with a picture * "Box": middle 50% of scores * "Whiskers": extend to highest score, 1.5 times the height of the rectangle, or to the 5th and 95th percentile * Line in the middle corresponds with median * Helps identify outliers

Answer 9

Scores that lie far away from the data set | -Can lead to understanding or overestimating relationship

Answer 10

- Subotage - Misunderstandings - Extreme thinking - Data Entry - Participant is not part of population from which sample is intended - Participant is different from rest of sample

Answer 11

Data transformations

Answer 12

1. Application of mathematical procedures to make the data appear more normal 2. Several different types of transformation exist. Appropriate one depends on shape of data.

Answer 13

Assumption that there is a straight-line relationship between two variables -Important because most statistical tests only capture linear relationships

Answer 14

Residuals | Bivariate Scatterplots

Answer 15

Examing the differences between the predicted value and the plotted (actual) values -Also known as prediction errors

Answer 16

Subjective method of assessing linearity

Answer 17

1. Variance between groups is similar 2. Assessed using Levene's test - If significant at 0.05 level, homogeneity of variance can not be assumed - Done using General Linear Model menu

Answer 18

Assumption that the variability in scores for one continuous variable is roughly the same at all values of another continuous variable

Answer 19

* This violation of the assumption of homoscedasticity can be assessed through the examination of the bivariate scatterplots * This violation will not prove fatal to an analysis

Answer 20

A graph that is frequently used to depict the results of an experiment. The vertical or y axis is known as the ordinate and the horizontal or x axis is known as the abscissa.

Answer 21

Measurement and determination of the relation between two variables - Used when data on two variables are available, but variables only able to be measured, not manipulated. - Cannot determine cause-and-effect - Correlation Coefficient - -Strength: Number - -Direction: Sign

Answer 22

1. This type of correlation coefficient is calculated when both the X variable and the Y variable are interval or ratio scale measurements and the data appear to be linear 2. Other correlation coefficients can be calculated when one or both of the variable are not interval or ratio scale measurements or when the data do not fall on a straight line. 3. Involves two ratio or interval variables

Answer 23

Used when we have multiple correlations. Summarizes all correlations

Answer 24

1. Pearson's Product-Moment Correlation (Pearson's r) 2. Spearman's Rho 3. Coefficient of Determination

Answer 25

Calculated for ordinal data

Answer 26

1. Result when r is squared 2. Indicates proportion of variability in one variable that is associated with another variable 3. Times result by 100 to get percentage of explained variability (or shared variance)

Answer 27

0. 10 (or -0.10): small or weak 0. 30 (or -0.30): medium or moderate 0. 50 (or -0.50): large or strong

Answer 28

An association establishes that A B

Answer 29

Do we know which one came first in time?? Did A -> B or Did B -> A If we cannot tell which came first, we cannot infer causation.

Answer 30

Is there a C variable that is associated with both A and B, independently? -If there is plausible third variable, we cannot infer causation.

Answer 31

- Cause and effect? - Directionality - Third Variable Problem

Answer 32

Graphical representation of the percentage allocated to each alternative as

Answer 33

A graph in which the frequency for each category of a qualitative variable is represented as a vertical column. The columns of a bar graph do not touch.

Answer 34

A graph in which the frequency for each category of a quantitative variable is represented as a vertical column that touches the adjacent column.

Answer 35

A graph that is constructed by placing a dot in the center of each bar of a histogram and then connecting the dots.

Answer 36

1. Getting to know the data 2. Summarizing the data 3. Using Confidence Intervals to Confirm what the Data Reveal

Answer 37

Mean, median, mode | -indicate the score that the data tend to center around

Answer 38

Indicate the breadth, or variability, of the distribution - Range - Standard deviation

Answer 39

the standard deviation of this theoretical sampling distribution of the mean -Our ability to estimate the population mean on the basis of a sample depends on the size of the sample and on the variability in the population from which the sample was drawn, as estimated by the sample standard deviation

Answer 40

Typically, we do not know the standard deviation of the population, so we estimate it using the sample standard deviation (s)

Final Exam- Pearson's Correlation Flashcards

(64 cards)