Final Exam Flashcards

Question

N (ANOVA)

Answer 1

total number of data points in all groups

Answer 2

number of observations in group i

Answer 3

k-1 k=number of groups

Answer 4

N-k N= total number of data points k=number of groups

Answer 5

F(alpha)(# of tails)(numerator (k-1)), (denom (N-k))

Answer 6

the group portion of variation expressed as a fraction of the total SSgroups/SStotal = R^squared When R^squared is close to 0, group means are all very similar, most of the variation is within groups. When R^squared is close to 1, most of the variation is explained by the explanatory variable.

Answer 7

the probability of obtaining a test statistic as large as or larger than (as extreme as or more extreme than) the critical value under Ho

Answer 8

the method used to predict values of one numerical variable from values of another.

Answer 9

Draws a straight line through the data to predict the response variable from the explanatory variable (One type of study design)

Answer 10

indicates the rate of change

Answer 11

Measures aspects of the linear relationship between two numerical variables

Answer 12

Regression - fits a line through the data to predict one variable from another and to measure how steeply one variable changes with changes in the other. correlation - measures strength of association between two variables, reflecting the amount of scatter in the data.

Answer 13

-The relationship between the two variables is linear

Answer 14

Has the smallest deviations in Y (vertical axis, response var) between the data points and the regression line

Answer 15

the line for which the sum of all the squared deviations in Y is the smallest.

Answer 16

Y = a + bX Y - response variable a - the Y-intercept b - slope of the regression line if b is (+), then larger values of X predict larger values of Y if b is (-), then larger values of X predict smaller values of Y.

Answer 17

rate of change in Y per unit of X

Answer 18

a and b are sample estimates alpha and beta are population parameters

Answer 19

points on the line that correspond to specific values of X. the predicted value of Y from a regression line estimates the mean value of Y for all individuals having a given value of X. ~ Y-hat

Answer 20

Plug the X value into the equation to find the Y-hat

Answer 21

Measure the scatter of points above and below the least-squares regression line. Crucial for evaluating the fit of the line to the data.

Answer 22

Quantifies the spread of the scatter of points above and below the line.

Answer 23

measure the precision of the predicted mean Y for each value of X

Answer 24

measure the precision of the predicted single Y-value for each X.

Answer 25

Attemping to predict the Y value for X values beyond the range of the data

Answer 26

Compares each observation in the sample with its quantile expected from the standard normal distribution. Points should fall roughly along a straight line if the data come from a normal distribution.

Answer 27

1. Ignore the violation of assumptions - Work well for data comparing means when the normality assumption is violated, especially with a large sample size and violations are not too drastic 2. Transform the data : effective often 3. Use a non-parametric method 4. Use a permutation test: Uses a computer to generate a null distribution for a test stat.

Answer 28

evaluates the goodness of fit of a normal distribution to a set of data randomly sampled from a population

Answer 29

A statistical procedure is robust if the answer it gives is not sensitive to violations of the assumptions of the method

Answer 30

changes each measurement by the same mathematical formula.

Answer 31

- Used for ratios or products of variables - used when freq dist is skewed to the right - used when the group with the larger mean also has the larger standard dev - used when the data span several orders of mag

Answer 32

Used almost exclusively on data that are proportions

Answer 33

Used on count data.

Final Exam Flashcards

(58 cards)