PSYU2248 Design & Statistics Flashcards

Question

What is a regression line / line of best fit?

Answer 1

A regression line is an estimate of the line that describes the true, but unknown, linear relationship between the two variables.

Answer 2

A linear regression line has an equation of the form Y = a + bX, where X is the explanatory variable and Y is the dependent variable. The slope of the line is b, and a is the intercept (the value of y when x = 0).

Answer 3

There is no relationship between the two variables.

Answer 4

There is a strong linear relationship between the variables.

Answer 5

There is a negative strong linear relationship between the variables.

Answer 6

The line of best fit, or the regression line is straight.

Answer 7

Curvilinear relationship.

Answer 8

Research questions that ask for a prediction. prediction = regression Or if you were looking at a particular variable in a data set and the relationship in the results, it would be the dependent variable. Trying to understand variability in the DV. So if we have a single outcome DV question and many IV and we are looking at the relationship. If we have two variables, and we are looking at a relationship this could be a correlation type question. relationship=correlational

Answer 9

Y(hat)=a+betaX

Answer 10

Predicted IV score when x = 0, y intercept when x = 0

Answer 11

Beta is the slope of the regression line How much we would predict the DV to change per unit increase in the IV. Positive or neg relationship

Answer 12

F is about the whole regression model, predicting the DV, can be simple or multiple IV T is a single IV or a single predictor

Answer 13

The proportion of the variation in the DV that is predictable from the IV The EXPLAINED variance, how much variance in the DV we are explaining in our regression model. Measure of effect size, how much variance in the DV is explained

Answer 14

33% of the variation in PWB is explained from internet addiction. That much variance of your DV (33%) is explained by your IV /s. Large effect.

Answer 15

The slope in the regression line, negative or downwards slope, negative effect. As internet addition increases PWB decreases For every one point increase in the IV (internet addiction) the DV would decrease by .613 units.

Answer 16

P value will tell us if we have a significant result. So we have a statistically significant effect, we get this from the p value, that corresponds to the t statistic (degrees of freedom) so the effect of internet addition is statistically significant, significantly in predicting PWB

Answer 17

Whether the regression model as a whole is significant. Telling us the same thing as the t value. value, that corresponds to the t statistic (degrees of freedom) so the effect of internet addition is statistically significant, significantly in predicting PWB

Answer 18

1 Independence of observations (residuals) 2 Normal distribution of residuals 3 Homoscedacity (AKA constant variance AKA homogeneity of variance) 4 Linearity 5 No collinearity (only applies to multiple regression)

Answer 19

The assumptions of normality in regression is that the residuals are normally distributed. Caveat: This is what the actual regression model built around, the variables feed into the model, but there are multiple variables in a model, whereas the residuals are the more important bit as they are the bones of the regression model, they are the data points around the regression line.

Answer 20

DV - support for redistribution IV - political preference Used to run the regression The DV values you expect to get for that value of X - pred variable Example - Based on our regression line if someone had a political preference score of 5 we predict their predict for redistribution would be 3.75 - predicted valuables - resid variable - number 7 political preference of 4 we would predict their support for redistribution is a score of 4.05 and their actual score is 4.25. Take home message: Not predicting the DV using scores on the IV, you will not perfectly predict anything, how well we predict is partly what the regression model is telling us. How well we are predicting scores in the DV based on scores on the IV, is actually what the regression model tells us that is also what the r2 tells us, its how much variance in the DV we're actually predicting / explaining purely using scores on the IV. We never perfectly everything. How big that difference is for any individual person is what the residual is telling us. It tells us how big the discrepancy is between their predicted score of Y and their actual score of Y.

Answer 21

Dichotomous variables are nominal variables which have only two categories or levels. For example, if we were looking at gender, we would most probably categorize somebody as either "male" or "female". This is an example of a dichotomous variable (and also a nominal variable).

Answer 22

a numerical variable and a dichotomous variable The Point-Biserial Correlation is a special case of the Pearson Correlation and is used when you want to measure the relationship between a continuous variable and a dichotomous variable, or one that has two values (i.e. male/female, yes/no, true/false).

Answer 23

A paired t-test is designed to compare the means of the same group or item under two separate scenarios. ie rating of the same restaurant before and after COVID, same restaurant. An independent (unpaired) t-test compares the means of two independent or unrelated groups. In an unpaired t-test, the variance between groups is assumed to be equal. ie two different restaurants ratings of food quality In a paired t-test, the variance is not assumed to be equal.

Answer 24

When we are interested in the difference between two variables for the same subject.

Answer 25

Degrees of freedom refer to the maximum number of logically independent values, which may vary in a data sample. It's calculated as the sample size minus the number of restrictions. Found in a t-test under the t-value.

Answer 26

ANOVA, which stands for Analysis of Variance, is a statistical test used to analyse the difference between the means of more than two groups. A one-way ANOVA uses one independent variable, while a two-way ANOVA uses two independent variables.

Answer 27

The t-test is a method that determines whether two populations are statistically different from each other, whereas ANOVA determines whether three or more populations are statistically different from each other.

Answer 28

One of the primary benefits of the ANOVA test is its ability to compare means across three or more groups simultaneously. Instead of conducting multiple t-tests for each pair of groups, ANOVA allows researchers to analyse the variations between all groups in one comprehensive test.

Answer 29

The correlation coefficient quantifies the relationship between two variables It can be positive, negative or zero (no relationship at all) Just because two things are correlated does not mean that one caused the other The size of the correlation coefficient tells us the strength of the relationship between the two variables

Answer 30

Correlation, where is there an association between the two variables

Answer 31

If the research question asks for a prediction this leads to a regression analysis, if we're trying to understand variability in the dependent variable, and we're trying to use information on other variable or some other variables in order to predict or explain that, then we might have a regression type analysis. If we have a single outcome variable and a number of IV that could be a regression type question. If the research question asks about a relationship, this leads to a correlational analysis, if we are looking at two variables and we are looking for an association or a relationship between them that could be a correlation type question.

Answer 32

Ŷ=a+βx * Ŷ is the DV * a is the predicted DV score when x=0 * β is the o slope, regression line, gradient o how much we would expect the DV to change per unit increase in the IV o positive or negative - Does the dependent variable increase as the independent variable increases? That would be a positive relationship. Or does the dependent variable decrease as the independent variable increases? That would be a negative relationship. * x is the IV or explanatory variable

PSYU2248 Design & Statistics Flashcards

(57 cards)