quantitaive methods Flashcards

Question

positives of IQR?

Answer 1

+ve - not influenced by extreme values, stable measure= doesnt change a lot if we keep adding observations

Answer 2

to get rid of negatives, so the -/+ve values dont cancel each other out when added together. Also increases larger deviations more than smaller ones so that theyare weighted more heavily.

Answer 3

- non-negative - for observations who values near the mean, the variance will be small - values dispersed from the mean, variance will be large

Answer 4

+ve - uses all observations in the data set to measure the variation in the samle (vs range) -ve - variance measures squared value = intepretation sint straightforward

Answer 5

it is the most common/useful measure of dispersion = average distance of each obeservation from the mean. Advantage - uses all values of the data set, expresed in the same unit of measure as the observations.

Answer 6

linear (positive relationship) and non-linear

Answer 7

linear positive - one variable increases so does the other one linear negative - one variable increases the other decreases non-linear association - e.g. hours studied and test score no association - no. of people who go gymvs no. of tickets sold at museum

Answer 8

direction (positive/negative), shape (linear/curved), strength (strong/weak), outliers

Answer 9

it is a way to measure the strength and direction of the linear association between two variables - takes values between +1 and -1

Answer 10

+1 = perfect postive relationship -1 = perfect negative relationship 0 =no linear relationship

Answer 11

scatter diagram= shows relationship between two variables, useful first step in correlation/regression analysis correlation analysis= relationship between two vatriables to measure the strength of their association regression analysis= relationship between variables with aim to ascertain the dependent effect of one variable upon another

Answer 12

- the dependant vairable is a continuos numerical value - use LR to predict the value of a specific variable by using another variable - variable being predicted = dependent variable (y) and variable being used to predict the DV is called the independent variable (x)

Answer 13

y= a + bx + e where y = dependent variable, x= IV, e= residuals, a/b= estimate parameters (a=intercept, b=slope)

Answer 14

``` b= slope parameter, defines the impac that a unit of change of the IV (x) had on DV (y) a= is the intercept and the average value of y when x is equal to zero (when the IV has no effect on the dependent) ```

Answer 15

y(hat) = a+bx

Answer 16

difference between an observed and predicted y is called the residual i.e. e=y-y(hat) it reflects the factors that are not considered in the model that have an influence on the DV. can be psoitve or negative.

Answer 17

the LSP minimises the deviation of observation from the regression lines i.e. residuals Method - rarely can see scatter plot all in straight line, so we use least squares to find that line that best fits the data. LSM - finds the line with those a and b values that gives you the smallest possible overall vertical distance between the line and the points in the scatter diagram in relation to any other line that can be drawn

Answer 18

a change in the IV = change in the DV - whether this is significant we use 0.05 as the threshold. Low P-value - likely to be meaningful to the mdoel because changes in the predictor value are related to changes in the response variable large P-value - insignificant, suggests changes in the predictor are not associated with chnages in the response.

Answer 19

there is a correspondance between the t-value and p-value. standardised coefficients can be used to compare the relative impact of multiple IV's = a larger beta value indicates a higher impact, only useful when we have multiple IV's

Answer 20

R- squared, ranges from 1-0 and gives us info about the relationship between variables. R2 = proportion of the total vairbales in the DV that is explained or accoutned for by the variation in the IV - it is the squared coefficient of correlation. e.g. .951 is 95%

Answer 21

y=a +b1x1+b2+x2...+bn+xn+ e | where y= DV, e= residule, a,b1...= parameters, x1,x2...=IV's

Answer 22

refers to how many standard deviations a DV will change if an IV increases by one standard deviations. Use this when trying to investigate which IV exerts the highest effect on the DV = cannot directly estimate it because variables are measured in different units.

Answer 23

how well it fits a set of observations, typically summarises the discrepancy between observed values and the valus expected under the model in question. In the linear regression analysis we use R2 measure (coefficient of determination). Low R2 predictors may help improve it.

Answer 24

R2 in multiple regression analysis tends to increase with the number of variables in the modela nd it adds a 'fake' percentage of the difference in the values of the DV explained. Therefore, it is preferable to estimate the % of variation explained by the model = adjusted R2

Answer 25

Adj R2 = 1 - residual mean square/total residual square

Answer 26

R2 is for regression using single predictors, adj R2 is for regression using multiple predictors.

Answer 27

can allow us to make a more accurtae prediction about the values of the DV as it can allows us to explain a higher % of the difference in values of the DV

Answer 28

y= a +b1x1+b2x2+b3x3+...+e where x = IV, a= Y intercept, b1= the net change in y for each unit change in x1, holding x2 constant (partial regression coefficient) The least squares criterion is used to develop this equation, determing b1, b2 etc is very tedious you need software

Answer 29

needs relationship between the I and DV to be linear, improtant to check for outliers since LR is senstivie to outlier effects, best tested with scatter plots between x and y

Answer 30

1) multivariate Normality, 2)lack of multicollinearity, 3)no autocorrelation 4) homoscedasticity (constant variance)

Answer 31

- linear regression requies all variables to be multivariate normal or errors are normally distributed (bell curve) - Can be best checked with a histogram of residuals or a P-P plot which is a probability plot for assessing how closely two data sets agree, plots two cumulative distribution functions against each other. - when data is not normally distributed a non-linear transformation e.g. log transformation might fix this issue

Answer 32

- occurs when the IVs are not independent from each other - multicollinearity might be tested with the following criteria: correlation matrix (computing the matric of the pearsons correlation coefficients among all the IV's) or with the Variance Inflation factor (VIF) = defined as VIF=1/(1-R2), with VIF>10 there is an indication for multicollinearity, if >100 it is certain.

Answer 33

if it is found in the data, centering the data, that is deduting the mean score, might help to solve the problem. - need to remove similarities by conducting mean centre of the two numbers, after centre correction if we dont find such high correlations we are safe to continue with LR to ensure linear is the right model to use.

Answer 34

- independence of residuals and errors - occurs when the residuals are not independent from each other - typically occurs in stock prices - can be tested with the durbin-watson test - d is between 0 and 4, rule of thumb 1.5

Answer 35

- it means the error terms (residuals) along the regression are equal or have constant variance - the scatter plot between y(hat) and e is a good way to check whether homoscedasticity is given

Answer 36

- a particular type of regular categorical variable | - have two values (0,1) and are often used to indicate that an event has occured or that some characteristics is present

Answer 37

problem: is that y only takes values 0 and 1, so LR always return meaningless results of y(hat). solution: make some changes to y that allows meaningful interpretation on the parameters and regression outcomes. When y is binary, the LR model becomes the Linear probability model (LPM): - a+bx is the probability that y=1, given x (Pr(y=1/x) - the predicted value y(hat) is the predicted probability that y=1 (Pr(y=1/x)) for a given x, by changing x to x + Δx, the probability that y=1 changes to b

Answer 38

1) Unbounded Predicted Probabilities fundamental law of probability - states that the probability of an event occuring must be contained within the interval (0,1) BUT the nature of a LPM doesnt ensure this fundamental law of probability is satisified - some prohibited probabilties may have non-sensical values that are less than 0 or greater than 1. 2) Non-normality of the errors - the errors/residuals of an LPM do not have a normal distribution (since y only takes the values of 0 and 1) - the error has one of two possible values for a given x value (e= y-a-bx): if y=1, then e= 1-a-bx if y=0, then e= -a-bx 3) heteroscedasticity - the variance of the errors depends on the independent variables and is not constant 4) non-linear relationship - model is linear, a unit increase in x resuls in a constant change of b in the probabiltiy of an event, holding all other variables constant - the increase is the same regardless of the current value of x

Answer 39

it is the models probabiltiy as a linear function of X. +ve: simple to estimate and to interpret, inference is the same as for multiple regression -ve: unbounded predicted probabilties, non-normality of the errors, heteroscedasticity, non-linear relationship THESE DISADVANTAGES CAN BE studied by using a non-linear probability model= LOGIT MODEL

Answer 40

problem with the linear probability model is that it models the probability of y=1 as being linear and unbounded. Pr(y=1/x) = a=bx (not sufficient) Instead we want a non-linear transformation of a + bx

Answer 41

the target is to find a function form of a + bx that will only take values between 0,1: a+bx E(-infinity+infinity) -> exp(a+bx)E(o,+infinity) -> exp(a+bx)/1+exp(a+bx) -(0,1)

Answer 42

Pr(y+1/x) = exp(a+bx) / 1+exp (a+bx)

Answer 43

``` let p = Pr (y=1/x) target = transform p E(0,1) into f (p)E (-infinity,+infinity), wehre f (p) means a function of p odds: p E(0,1) -> odds ratio p/1-p E(0,+inifinity) -> logit = ln(p/1-p) E(-infinity,+inifnity) ```

Answer 44

ln (p/1-p)=a +bx or exp (a+bx)/ 1+exp (a+bx)

Answer 45

it measures the relationshop between the dummy categorical dependent variable and one/more independent variables by estimating probabilities using a logit link function.

Answer 46

a)linear b)logit

Answer 47

unconstrained (Mu) includes all predictors of interest

Answer 48

constrained model (Mc) is an intercept only model, excluding all predictors (i.e. constrain b=0)

Answer 49

using the likelihood ratio test which is equivalent to R2 for linear, which gives information on variables

Answer 50

whether there is evidence of the needs to move from a simple model to a more complicated one. You compare the likelihood function of a constrained and unconstrained model and examine whether adding predictors in the unconstrained model significantly improves the understanding of the changes in the dependent (categorical) variable. Essentially you take MC and minus MU = if the difference is greater than the critical value given it is valuable to include as it helps us understand it better.

Answer 51

linear - must satisfy all LR assumptions | logit - disadvantage of LPM

Answer 52

linear - regression model | logit - regression model (use odds ratio - non linear regressions model

Answer 53

linear - magnitude and direction significance logit - magnitude and direction (non-linear effect, clarify the starting value) significance (changes in x will lead to changes in logit p)

Answer 54

linear- Rsquare, adjusted Rsquare, evaluate the model | logit - likelihood ratio test, evaluate the model

Answer 55

NONE- MAKE RECOMMENDATIONS BASED ON THE RESULTS AND/OR SUGGEST HOW TO FURTHER IMPROVE THE MODEL.

quantitaive methods Flashcards

(79 cards)