Market Research Flashcards

Question

5 steps in hypothesis testing

Answer 1

1. State the hypothesis 2. Choose the appropriate test based on the problem 3. Develop a decision rule 4. Calculate the value of the test statistic/p-value

Answer 2

A standard to reject or fail to reject the null hypothesis. | P-value, Significant Value

Answer 3

Pearson Chi-Square & Value = Test statistic | Pearson Chi-Square & Asymptotic Significance = P-Value

Answer 4

With 95% confidence we can/ cannot reject the null hypothesis that there is not relationship between X and Y

Answer 5

Type 1: False Positive | Type 2: False Negative

Answer 6

Type 2: False Negative | Type 1: False Positive

Answer 7

Chi-square Test when you want to examine the relationship of two nominal/ordinal variables • Compare the proportions (nominal/ordinal) of different groups

Answer 8

Problem Type: Compare the mean of an (interval/ratio) variable to a number One-Sample T-Test

Answer 9

Problem Type: Compare the mean of an (interval/ratio) variable of different groups (2 groups) Independent Sample T-Test

Answer 10

Problem Type: Compare the mean of an (interval/ratio) variable of different groups (more than 2 groups) One-Way Anova

Answer 11

Problem Type: Compare the means of two (interval/ratio) variables Paired Sample

Answer 12

Compare the mean of an (interval/ratio) variable to a number -One-Sample T-Test Compare the mean of an (interval/ratio) variable of different groups (2 groups) -Independent Samples T-Test Compare the mean of an (interval/ratio) variable of different groups (more than 2 groups) -One-Way ANOVA Compare the means of two (interval/ratio) variables -Paired Samples T-Test

Answer 13

Null: There is no difference between variable one and categorical variable with 3+ categories IE mean 1=mean2=mean3 Alternative Hypothesis: at least one group has a different relationship from the other two

Answer 14

1 State Hypothesis: Null: There is no difference between the two means 2 Test: Independent sample test 3 Decision Rule: 0.05 4 P-Value: 0.199 ->Is there a difference between the *variance* of the two groups? Based off that you choose which P-value to look at when comparing the means 5 With 95% confidence we fail to reject the null hypothesis XYZ

Answer 15

When random sample is >=30 + proportion, metric, one or two means

Answer 16

Paired Samples

Answer 17

Degrees of Freedom

Answer 18

Chi-Square

Answer 19

Related Sample

Answer 20

Z = Sample mean - population Mean under H0 / Estimated Standard Error

Answer 21

SD: Sqrt[ {sum( Xbar-Xi)^2} / n-1] OR Variance sqrt SE: SD/ Sqrt(n)

Answer 22

T-test for independent samples

Answer 23

Test for proportion

Answer 24

Sampling Error

Answer 25

Analysis of Variance

Answer 26

1. C. 2. A 3. B

Answer 27

The degree of association between two variables

Answer 28

Criterion: Dependent Variable - Explained by the X variable Predictor: Independent Variable - affect the value of the Y variable

Answer 29

Bivariate regression & Simple regression

Answer 30

Scatterplot

Answer 31

Y= a+bX+e or Y = B0 + B1X

Answer 32

Describes the nature of the relationship between X & Y, a measure of the strength of the linear relationship btwn X & Y It is a measured percentage of the total variation in Y explained by the variation in X 0-1 where 1 is the strongest

Answer 33

Mean Variation - Unexplained variation / Mean Variation

Answer 34

Total Sum of Squares: Total variation | Sum of Squares due to Regression: Explained Variation

Answer 35

The Regression Coefficient | H0: B=0 Ha: B DNE 0

Answer 36

-1 <= p(X,Y) => 1 Weak: less than 0.3 Moderate: greater than or equal to 3 and less than and equal to 0.49 Strong: greater than 0.49

Answer 37

Outliers Effect size may be too small to be a useful r Non-linear realtionships High correlations are often tautological

Answer 38

Positive strong linear relationship between the way X & Y move P-Value: The correlation is different from zero

Answer 39

y= a + Bx + e a= intercept B=slope e= Random error

Answer 40

Ordinary Least Squares Regression a hat: the intercept, value of why when X is zero b hat: slope, estimated change in the average value of Y as a result of a one-unit change in X e is the cumulative difference between the regression line and the points

Answer 41

R-Squared indicates how well the variables fit with the regression line, and the more variables that are in the line, the better the fit

Answer 42

Loose confidence in the predictions when the results fall outside the current range of X

Answer 43

Type A: Long term data Maximize profit for an existing product Type B: Short term data Increase visibility of just launched product Type C: No data Predict how a new product will perform

Answer 44

Within Group: - Does mean differ from benchmark? One Sample T-Test - Does mean of x and mean of y differ? Paired Sample T-Test Between Groups: - Does frequencies differ between groups? Chi-Square Test - Does mean of X differ between 2 groups? Independent Sample T-Test - Does mean of X differ between 3+ groups? ANOVA

Answer 45

When your independent variables are highly collinear with each other - Look at the correlation matrix of the independent variables Bad b/c we cannot distinguish between the individual effects of the independent variable on the dependent variables

Answer 46

Get more data Don't include all of the independent variables Drop the correlated variables Or combine them to create a new variable, Factor Analysis

Answer 47

0 or 1 to let us know if there is or isn't the presence of a categorical value

Answer 48

A categorical variable should be recoded into a dummy variable in regression analysis

Answer 49

K-1 for k categories

Answer 50

Reference group | Gender: if X1= 1 if women, 0 otherwise & X2=1 if male, 0 otherwise. THE Reference group would be non-binary

Answer 51

The average annual spending on clothes for women at the age of 0 is $200 If the age increases by one year, average spending on clothes increased by $20 Men on average spend $50 less than women fo every year

Answer 52

Slope is the same interpretation alpha: When [the reference group] is activated then [alpha value] is [Y variable] Beta coefficient: Compared to the [Reference group] the average [Dummy variable] in/decreases by [Beta coefficient value]

Answer 53

It adjusts for more variables

Answer 54

when it is more than 2 times the standard error it is a good fit

Answer 55

Variance Inflation Factor, gives a measure of multicollinearity. Keep it below 10, is caused by too many variables

Answer 56

1. Calculate Retail Price= Wholesale * (1 + 10%) 2. Put price into regression model 3. Calculate profit: Profit = (Retail price-MC) * Sales Highest profit is your choice

Answer 57

Prediction would not be exactly 0 or 1 but some continuous number Predictions could be outside the range of [0,1]

Answer 58

Dependent: Outcome is binary Independent: What do you think can predict the outcome

Answer 59

ln (p/1-p) = a +B1X1 + ...+ BkXk p= exp(a +B1X1 + ...+ BkXk)/ 1+exp(a +B1X1 + ...+ BkXk) 0