reliability Flashcards

1
Q

What is Classical Test Theory (CTT)?

A

A theoretical framework for reliability defined by assumptions describing how measurement errors influence observed test scores.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the fundamental equation of reliability theory? (Assumption 1 of CTT)

A

X = T + E, where X is the observed test score, T is the true score, and E is the random error score.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What does ε(E) = 0 signify in CTT?

A

The average error score across repeated testing is zero, meaning positive and negative errors cancel each other out.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are the implications of ε(X) = T in CTT?

A

It allows us to derive that ε(E) must be 0, confirming that measurement errors are random.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What does ‘E’ represent in the context of reliability?

A

Unsystematic, or random, measurement error that deviates an examinee’s observed score from the true score.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are the assumptions of CTT regarding error scores? (Assumption 3)

A

Errors are independent and do not correlate with true scores. P(ET) = 0

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What does it mean if two tests are parallel according to CTT?

A

They are tau-equivalent:
They satisfy Assumptions 1 through 5, measure the construct equally well (T = T’), and have the same level of error variance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How is the reliability coefficient defined?

A

It is the proportion of observed score variance attributable to true-score variance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What does a reliability coefficient of 1 indicate?

A

Observed-score variance reflects entirely true-score variance, indicating perfect reliability.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What does a reliability coefficient of 0 indicate?

A

Observed-score variance reflects entirely error-score variance, indicating zero reliability.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the significance of the equation σ²_X = σ²_T + σ²_E?

A

It means observed-score variance is equal to the sum of true-score variance and error-score variance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the implication of having a heterogeneous sample for reliability estimation?

A

Greater variability among people increases true-score variance, which enhances reliability.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Fill in the blank: According to CTT, if two tests are essentially tau-equivalent, they have true scores that are the same except for an _______.

A

additive constant.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

True or False: Congeneric measures have perfectly correlated true scores.

A

True.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What does a higher reliability indicate about estimating true scores from observed scores?

A

The higher the reliability, the more confident we can estimate true scores from observed scores.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What does it mean when reliability falls between 0 and 1?

A

Observed-score variance includes some true-score variance and some error-score variance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What is the relevance of error variance in relation to the reliability coefficient?

A

Reliability reflects the degree to which error variance is minimal compared to the variance of observed scores.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What is a primary challenge in estimating reliability based on CTT?

A

There is no way of knowing the true scores or the error associated with test responses.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What is the difference between parallel tests and essentially tau-equivalent tests?

A

Parallel tests have equal error variance, while essentially tau-equivalent tests do not.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What are the assumptions that must be satisfied for two tests to be considered parallel?

A

They must meet Assumptions 1 through 5 and measure the construct equally well.

21
Q

How does the reliability coefficient relate to observed and true scores?

A

It represents the correlation between observed and true scores.

22
Q

What does the assumption ε(X) = T imply in terms of test scores?

A

It implies that the observed scores reflect the true scores without systematic error.

23
Q

What is the relationship between true-score variance and observed score variance in parallel tests?

A

The correlation between scores on two parallel forms of a test is equal to the ratio of true-score variance to observed score variance.

24
Q

how to prove reliability in the context of parallel tests?

A

Reliability is proven by equal observed score variance for parallel tests, assuming errors are random and uncorrelated.

25
Q

Fill in the blank: The correlation between scores on two parallel forms of a test is equal to the ratio of _______ to observed score variance.

A

[true-score variance]

26
Q

What is cov(X, X) in relation to the covariance of scores?

A

cov(X, X) is equal to the variance of X.

27
Q

What does the notation ρ^2 represent?

A

ρ^2 represents the ratio of true-score variance to observed score variance.

28
Q

True or False: Errors in parallel tests are correlated.

29
Q

What assumptions are made about errors in the calculation of reliability?

A

Errors are assumed to be random and uncorrelated.

30
Q

What does the formula cov(T, T’) represent?

A

It represents the covariance between true scores T and T’.

31
Q

List the components of the covariance formula for true and error scores.

A
  • cov(T, T)
  • cov(T, E)
  • cov(E, T)
  • cov(E, E)
32
Q

What does the notation σ represent in this context?

A

σ represents the standard deviation of the scores.

33
Q

What is the significance of Assumption 6 in this context?

A

Assumption 6 is used to derive relationships between the variances and covariances of scores.

34
Q

True or False: The observed score variance is equal to the sum of true score variance and error variance.

35
Q

Fill in the blank: The formula for the correlation coefficient is _______.

A

[cov(T, T) / (σ_T^2)]

36
Q

assumption 1-5 of ctt

A

X = T + E (1)
E(X) = T (2)
ρ_ET=0 (3)
ρ_E1T2=0 (4)
ρ_E1E2=0 (5)

37
Q

assumption 6 of ctt

A

Observed scores X and X’ satisfy Assumptions (1) to (5)
Tau-equivalent condition: The two tests measure the construct equally well, thus T = T’  T1 = T2
The tests have the same level of error variance (〖ϑ_E〗^2= 〖ϑ_E’〗^2)

38
Q

Assumption (7): Two tests are considered essentially tau-equivalent if (when tau-equivalent cannot be met):

A
  • They have observed scores X and X’ that satisfy Assumptions (1) to (5)
  • They have true scores that are the same except for an additive constant (i.e., T1 = T2 + c)
39
Q

What is Test-retest reliability

A
  • Correlate the scores obtained from the same test administered on two different occasions, based on the assumptions:
    o (1) that the true scores are stable across the two occasions
    o (2) the error variance of the first testing equals to that of the second testing.
  • This provides an estimation of the stability of the test scores.
40
Q

Problems due to assumptions of test-retest reliability

A

Assumptions
o (1) that the true scores are stable across the two occasions
o (2) the error variance of the first testing equals to that of the second testing.

Problems:
o Length of test-retest interval is critical.
 Too short: Possibility of carryover effects e.g. memory/practice effects
 Too long: if true scores are not stable across time, and true score changed over time
o Estimate inappropriate if the construct measured is not stable over time (e.g., a state-like construct).
 Need to try to maintain the error variance of measurement
o Hence, only use test-retest reliability if the construct is known to be stable across time/less susceptible to carryover effects e.g. visual acuity

41
Q

Assumptions of test-retest reliability

A

o (1) that the true scores are stable across the two occasions
o (2) the error variance of the first testing equals to that of the second testing.

42
Q

Advantages of using Internal consistency reliability rather than test-retest reliability

A
  • A useful practical alternative because it requires respondents to complete only one test at only one point in time, thus avoiding the problems associated with repeated testings.
    o premise is that different parts of a test can be treated as different forms of the test* and correlating the scores for these different parts provide a reliability estimate. –> Mitigates the issue that the construct needs to be stable across time
43
Q

How is internal consistency reliability obtained?

A
  • This reliability provides an estimate of how different parts of the test are consistent with each other.
  • The premise is that different parts of a test can be treated as different forms of the test* and correlating the scores for these different parts provide a reliability estimate.
  • Three different approaches to the internal consistency method of estimating reliability:
    o (1)split-half
    o (2) Cronbach’s alpha
    o (3) standardized alpha
44
Q

assumptions of using internal consistency reliability (and its problem)

A

o Two halves have are as good as each (same validity)
 Problem: two forms may not be parallel if the validity of the two halves are not equal

45
Q

assumptions of split-half approach

A

o Items added/removed are parallel (Test items are good)

45
Q

Split-half approach (how it is done)

A

The test is split into two halves (e.g., odd/even or first/second
half), and scores for the two halves (Y and Y’) are correlated.
As the correlation is only a measure of the reliability of one half of the test, the reliability of the entire test (X = Y + Y‘) would be greater and is estimated using the Spearman-Brown formula:
Reliability is a function of test length (Longer test  Higher reliability)
If we just correlate the two halves  underestimation of reliability as the actual test is longer  need for estimation of the reliability of the entire test using the Spearman-Brown formula
ρ_XX’=(2ρ_YY’)/(1+ρ_YY’ ) where ρ_YY’ is the correlation of the spilt halves of the tests Y and Y’ are parallel and X and Y and Y’ are parallel; X is the real test
Hence, should adhere to assumption 6 of CTT

46
Q

disadvantage of split half approach

A
  • Different ways to split – odd-even; first-second half - reliability estimates are not the same
47
Q

spearman brown formula

A

refer to notes
ρ_XX’=(Nρ_YY’)/[1+(N-1)ρ_YY’ ]
where N is the factor by which the test is increased if split into half then N =2, ρ_YY’ is the correlation based on split halves

48
Q

what is cronbach’s alpha and how does it split tests

A
  • Alpha is a measure of internal consistency, which refers to the interrelatedness of a set of items.
  • Splits test to the item-level
    o Such that we get a best estimate of reliability