Lecture 3: IRT Parameter estimation and model fit Flashcards

Question

How do you calculate marginal MLE in R?

Answer 1

Use package ltm in R (as well as many of the estimations for IRT) library(ltm) res1pl = rasch(data) res1pl

Answer 2

The conditional log likelihood indicates the value of the likelihood of the data given these parameters at the maximum. The difficulty parameters give the difficulty parameter estimate for each item. This gives the easiness parameters, the first beta listed is one of the easiest items with a high score and the last beta listed is one of the most difficult items with a low score (true only for this data).

Answer 3

You get a common discrimination parameter at the bottom since marginal likelihood assumes that the slopes are equal. The rest are normal difficulty parameters loosely listed from easier parameters to more difficult parameters (true only for this data) with lower scores indicating easier items The log.Lik indicates the log likelihood; the value of the likelihood of the data given these parameters at the maximum.

Answer 4

The marginal MLE of a two parameter model obtained in another way (rest_ltm = ltm(data~z1); summary(res_ltm)) This gives more or less the same information but also adds the std error and z value for each of the difficulty parameters as well as the AIC and BIC of the model

Answer 5

You get the item characteristic cruves of each of the items with probability on the y axis and ability (theta) on the x axis. See docs

Answer 6

These are items 1:10 and items 11:20 plotted separately, with the range for theta plotted specified at -10 to 5 and the lines made thicker achieved through: plot(res_ltm,item=1:10,z=seq(-10,5,by=.1),lwd=2) plot(res_ltm,item=11:20,z=seq(-10,5,by=.1),lwd=2)

Answer 7

These are items 1:10 and items 11:20 plotted separately, with the range for theta plotted specified at -10 to 5 and the lines made thicker achieved through: plot(res_ltm,item=1:10,z=seq(-10,5,by=.1),lwd=2) plot(res_ltm,item=11:20,z=seq(-10,5,by=.1),lwd=2)

Answer 8

It must be 'identified'; this is to do with the fact that the latent trait in the model is an unobserved variable and thus does not have a scale. A parameter in a 2 parameter model for instance could have a value of 1000, but this doesn't mean anything unless we specify what a score of 1000 means.

Answer 9

You can plug in different parameters into your model e.g: a1 = 2, b1 = 3, θ = 1; μθ=0 σθ=1 = 𝑒^(2× (1−3)) / 1+𝑒^(2× (1−3)) ×𝑁 (1;0,1 ) and a1 = 2, b1 = 5, θ = 3; μθ=2 σθ=1; = 𝑒^(2× (3−5)) / 1+𝑒^(2× (3−5)) ×𝑁 (3;2,1 ) both give a probability of 0.004355 despite different parameter values for both the difficulty and latent trait, you can play with these parameters and still get the same probability which is not very useful for calculating the maximum likelihood estimate where you want to find values which maximise the data.

Answer 10

The mean and distribution of the latent trait follows the difficulty parameters, the slopes determine the width of this distribution. If the model is not specified and a scale is not given to theta then the inferences are drawn relative to an arbitrary normal distribution which will draw the same conclusions without taking into account its location or span of the range of the latent trait. This is visualised in docs

Answer 11

𝜃𝑝 is an unobserved variable which has no unit. Therefore, you need to identify the unit; A latent variable is identified by fixing an arbitrary location and an arbitrary scale

Answer 12

You can set it so bi = 0 for some value of i. E.g the first difficulty item is a reference point for theta = 0; difficulty of first item = 0 Another option is fixing the sum/ average of the item difficulties = 0; E(|n, i=1|)bi = 0. This is used in R package 'eRm' A further option is to se the mean of the latent trait to 0; 𝜇𝜃 = 0. This is used in R package 'ltm' Each of these are visualised in docs

Answer 13

Similarly, you can set it so ai = 1 for some i. E.g the slope of the first item is a reference point for theta = 1 Another option is fixing the product of the item slopes = 1; ∏ (|n, i=1|)ai = 1. A further option is to se the standard deviation of the latent trait to 1; 𝜎|2,𝜃| = 1. This is used in R package 'ltm' Each of these are visualised in docs

Answer 14

You would get a value very close to zero as the sum of the difficulty parameters is used as a reference point of 0 for the latent trait

Answer 15

If the data are uni-dimensional, there should be one dominant component in the data →Therefore you can use principal component analyses of the tetrachoric correlation matrix (despite it not being a LVA, it can inform you about the dimensionality of data.)

Answer 16

Because in IRT we have ordinal or dichotomous data we cannot use a normal correlation matrix, thus we have to use a tetrahoric correlation matrix for categorical data.

Answer 17

Eigen values should always be decreasing and you should be able to eye this from the eigenvalue output, e.g: 13.76 1.81 1.09 1.06 1.01 0.99 0.97 0.96 0.94 0.91 0.90 0.88 0.86 0.86 0.84 0.82 0.81 0.80 0.79 0.79 0.77 0.75 0.73 0.73 0.72 0.70 0.70 0.68 0.67 0.67 0.66 0.65 0.63 You can then use Kaiser's criterion (eigen values larger than 1 are kept; not really recommended), a scree plot and/ or a parallel analysis to assess this further

Answer 18

R simulates data with uncorrelated variables which resemble your variables (so zero components in the data or as many components as you have items) so that its all noise. This then produces a good reference point (for what you would expect if there were no components in the data) which you can plot on your scree plot as a line, in which you extract the components above the line. This is shown in docs

Answer 19

You can carry out a rough check using item-test correlations/ item-rest correlations. If the items discriminate equally well, these correlations should be equal.

Answer 20

Xtotal = apply(data,1,sum) ou2 = c() for(i in 1:nit){ ou2[i] = cor(data[,i], Xtotal - data[,i]) #item rest correlations ou2 You can then look at the correlations produced and assess whether they are roughly equaly, you can also plot them on a histogram to assess this

Answer 21

Test on the basis of θ estimates Test on the basis of sum scores Test on the basis of the three parameter model

Answer 22

If there is no guessing, subjects in the lower range of θwill fail on the difficult items.

Answer 23

If you determine theta, you already assume an IRT model

Answer 24

If there is no guessing, subjects in the lower range of the sum score will fail on the difficult items. You can test this with item-test regressions; calculating the proportion correct for all subjects with a given test score. This can then be plotted, an idea of the difference in graphs of the people guessing and not guessing is given in docs; proportion of correct responses should approach zero as an overall test score approaches 0 especially for difficult items. If there are big jumps then there is likely guessing

Answer 25

First you would fit a 3 parameter model to the data: library(ltm) res3pl <- tpm(data, start.val = matrix(c(.5,1,1), 20, 3, T)) res3pl Then you can read the estimated values for the guessing parameter for each item to assess whether there is guessing happening for each item

Answer 26

Construct 𝑚 ability groups and determine the model residuals: 𝑃𝑖𝑗 −𝐸(𝑃𝑖𝑗) • Pij is the observed proportion correct in ability group j for item i • E(Pij) is the expected proportion correct in ability group j for item i according to the model, e.g.,: 𝐸(𝑃𝑖𝑗)=𝑃(𝑋𝑖𝑗 =1|𝜃)= e^(ai(𝜃j - bi)) / 1 + e^(ai(𝜃j - bi)) aka difference between the proportion observed and expected proportion- in an ideal world this is 0 and you model perfectly predicts the proportions observed. You can then plot the observed proportion against the observed proportion and adjust your model accordingly; e.g adjust ai to make it more steep

Answer 27

To standardise the residuals you use the following formula: Zij = 𝑃𝑖𝑗 −𝐸(𝑃𝑖𝑗) / sqrt(𝐸(𝑃𝑖𝑗)[1 - 𝐸(𝑃𝑖𝑗)] / Nj ) You then square it and sum over people to get the Qi statistic Qi = E Nj[𝑃𝑖𝑗 −𝐸(𝑃𝑖𝑗)]^2 / 𝐸(𝑃𝑖𝑗)[1 - 𝐸(𝑃𝑖𝑗)] / Nj where m is the number of ability groups Nj is the number of subjects in ability group j It has a nice property of forming a chi squared distribution: 𝑄𝑖~𝜒2 (𝑚−𝑘) where k is number of parameters per item (i.e., 2 for two-parameter model, but also for one-parameter model!)

Answer 28

item.fit(res2pl) where the model is stored in res2pl the x^2 statistic in the output is the Qi statistic. A significant result indicates that there is a significant difference between the observed value and the one predicted by the model

Answer 29

Adding parameters will always increase the likelihood; adding more parameters will always fit the data better but more complex models often do not model important patterns in the data and instead model noise, overfitting the data. It is better to choose a more simple model rather than a complex model if they perform around the same

Answer 30

A likelihood ratio test; null hypothesis test to compare statistical models

Answer 31

1. Models need to be nested - one model needs to be a constrained version of the other model e.g A one parameter model is a constrained version of the two parameter model 2. Constraints cannot be boundary constraints - e.g., correlation of 1; variance of 0; guessing of 0 : a boundary constraint would be fixing the guessing parameter to 0 since that is the absolute minimum that the guessing parameter can be (lower bound) and its also the boundary of the parameter space. Same logic for the correlation and variance above

Answer 32

likelihood ratio: Likelihood of the constraint / Likelihood unconstraint aka less complex model / more complex model e.g L(1 parameter model) / L(2 parameter model)

Answer 33

If you can go from one model to another by fixing some parameters then the models are nested. E.g to go from the three parameter model to the two parameter model you can fix ci to 1 or 0, you can further go to a one parameter model by equating ai across items in a two parameter model

Answer 34

Boundary constraint

Answer 35

Whether some constraints hold e.g: H0: a = a1 = a2 = ... = an aka H0: testing that discrimination parameters are equal

Answer 36

Whether some constraints hold e.g: H0: a = a1 = a2 = ... = an aka H0: testing that discrimination parameters are equal

Answer 37

get the likelihood ratio: likelihood ratio: Likelihood of the constraint / Likelihood unconstraint aka less complex model / more complex model Then get the -2log of the ratio for the test statistic: test statistic = -2log(likelihood ratio) = -2(log(L(constraint) ) - log(L(unconstraint)))

Answer 38

To get a chi squared distribution and because the log likelihoods are much nicer to work with than the likelihoods

Answer 39

𝐿𝑅𝑇~𝜒2 (𝑘 𝑢𝑛𝑐𝑜𝑛𝑠𝑡𝑟𝑎𝑖𝑛𝑡 −𝑘 𝑐𝑜𝑛𝑠𝑡𝑟𝑎𝑖𝑛𝑡) | Where 𝑘 denotes the number of parameters in the unconstraint and constraint model

Answer 40

Take the log likelihood for each e.g logL(1 parameter / constraint) = -13774 logL(2 parameter / unconstraint) = -13667 likelihood ratio: LRT = -2 -2(-13774- -13667) = 215.36 Degrees of freedom: ktwoPM - KRasch = 40 - 21 = 19 • 𝑘𝑅𝑎𝑠𝑐ℎ: 1 discrimination and 20 difficulties • 𝑘2𝑃𝑀: Two-PM: 20 discrimination and 20 difficulties

Answer 41

res2pl = ltm(data~z1) res1pl =rasch(data) anova(res1pl, res2pl) note: nothing to do with an anova

Answer 42

If it significant it means that the constraints do not hold; they are not equal across items

Answer 43

They can be treated as a fit index. Whichever model has the smallest value is the best fit. It assesses the accuracy while punishing models with higher numbers of parameters

Answer 44

They can use a boundary constraint so they could compare a three parameter IRT model with a two parameter model

Lecture 3: IRT Parameter estimation and model fit Flashcards

(68 cards)