Research skills 4 Flashcards

Question

what is the 'sample size' assumption? and what are the 2 approaches?

Answer 1

The sample size assumption for multiple regression analysis is typically related to the number of observations (cases or data points) relative to the number of predictor variables included in the model. 1. Liberal (Stevens, 1996): 15 participants per predictor You need at least 10 for every variable you enter, and some would argue for as many as 50. 2. Conservative (Green, 1991): a) N = 50 + 8m (where m is the number of IVs) for testing the multiple correlation b) N = 104 +m for testing individual predictors (partial correlation)

Answer 2

The linearity assumption in multiple regression refers to the assumption that there is a linear relationship between the independent variables (predictors) and the dependent variable (outcome). This assumption means that changes in the dependent variable are assumed to be proportional to changes in the independent variables, with a constant rate of change across all levels of the independent variables.

Answer 3

There are two kinds of outliers Univariate: only present in one variable Eg: one participant has a very different score from the rest of the sample Multivariate: they result from the combination of two or more variables together Eg: a participant’s scores are in the same range as the rest of sample in each variables (ie, not univariate outliers), but the overall pattern of their scores is off from the group (eg, one participant has the same exact score in all variables)

Answer 4

The normality assumption in multiple regression pertains to the distribution of the residuals (the differences between observed and predicted values) and not directly to the distributions of the independent or dependent variables themselves. The assumption states that the residuals should be normally distributed.

Answer 5

Multicollinearity exists if predictors are highly correlated. This assumption can be checked in SPSS, you access this option through the Statistics button when you set up a linear regression. Select Collinearity diagnostics

Answer 6

a. Residual Statistics Standardized Residuals (ΖΡΕ_1) - , if ZRE_1 is greater than +3 or smaller than -3, is likely to be an outlier and of concern. b. Mahalanobis distance (MAH_1) c. Influential cases Cook’s (COO_1) distance - Cook’s values greater than 1 (or closer to it) are a cause for concern.

Answer 7

A z-score, also known as a standard score, is a statistical measurement that describes a data point's position relative to the mean of a group of data points. It is expressed in terms of standard deviations from the mean.

Answer 8

Inferential statistics is a branch of statistics that focuses on making predictions or inferences about a population based on a sample of data drawn from that population

Answer 9

The Central Limit Theorem (CLT) is a fundamental principle in statistics that states that the distribution of the sample mean (or sum) of a sufficiently large number of independent, identically distributed (i.i.d.) random variables approaches a normal (Gaussian) distribution, regardless of the original distribution of the variables

Answer 10

The standard error of the mean (SEM) is a statistical measure that quantifies the amount of variability or dispersion in the sample mean estimates of a population mean

Answer 11

A t-score is a type of standardized score used in statistics to compare the difference between an observed sample mean and the population mean when the population standard deviation is unknown and the sample size is relatively small

Answer 12

A confidence interval (CI) is a range of values, derived from sample data, that is likely to contain the true population parameter (such as the mean or proportion) with a specified level of confidence

Answer 13

Writing the question response in any form the respondent feels is useful: (e.g., ‘What reasons are there for recycling?’) Advantages - Gets all the information - Does not lead the respondent - Is more naturalistic Disadvantages - Can be difficult to complete (requires listing) - Difficult to code and analyse - Poor when a numeric result is required

Answer 14

Require researcher to have a idea of the likely response options (e.g., Which of the following do you recycle?: Glass/paper/clothing/none of the above) Advantages - Easy to code and analyse - Good when a numerical result is required - Quick for respondents to answer Disadvantages - Can encourage bias - Can miss possible answers - Can create opinions where none exist

Answer 15

Split-half reliability measures the consistency of a test by dividing it into two equal halves and correlating the scores on each half. It assesses whether both halves produce similar results.

Answer 16

Internal reliability, also known as internal consistency, refers to the extent to which all items or components of a test, survey, or measurement instrument measure the same underlying construct consistently

Answer 17

Cronbach's Alpha is a measure of internal consistency, indicating how well a set of items in a test or survey measure a single unidimensional latent construct.

Answer 18

Factor analysis is a statistical method used to identify underlying relationships between variables by grouping them into factors. It reduces data dimensionality by detecting patterns of correlations.

Answer 19

Exploratory Factor Analysis (EFA): Used when the underlying structure is not known. It explores the potential factors without predetermined ideas. Confirmatory Factor Analysis (CFA): Used to test hypotheses or theories about the structure of factors, confirming whether the data fits a pre-specified factor model.

Answer 20

1. orthogonal – this type of rotation assumes that each factor is unique and has no shared associations. This tends to be used when testing a theoretical model that specifies independent factors (e.g. varimax). 2. oblique – this is more often used and determines the relationship of factors to one another rather than assuming independence (e.g. oblimin).

Answer 21

Correlation between the scores at two different times of testing. There are a number of factors that can affect this: the internal reliability of the test, external factors in the test sample (mood, fatigue etc.), carry over effects from the first testing period

Answer 22

The extent to which a test measures what it is intended to measure

Answer 23

Face validity is the extent to which a test, measurement, or instrument appears to measure what it is intended to measure, based on subjective judgment. It refers to the degree to which the items on a test look like they are assessing the intended construct, at face value.

Research skills 4 Flashcards

(47 cards)