Chapter 5.2: True Score Model of Measurement and Alternatives Flashcards
AKA the true score model or measurement
Regarded for its simplicity and it’s notion that everyone has a true score on a test. Ina-assume ng theory na ito na may true score kaya lang naaapektuhan siya ng mga ibang variance at ibat ibang errors
CLASSICAL TEST THEORY
value that according to classical test theory genuinely reflects an individual’s ability (or traits) level as measured by a particular test
True score
Observed score= True Score + Error Variance
What theory is this formula used?
Classical Test Theory
Explain the weakness of Classical Test Theory
It has an assumptions that all items contribute equally to the measurement of the construct—it doesn’t work that way parang group work lang, di equal ang contribution ng lahat.
It favors longer tests—–kaya yung mga classic na test ang hahaba
True or False. In Psychometrics, CTT assumptions are strong compared to IRT
False
Explain the problem/disadvantage of CTT
- its assumption that all items contributing equally to the total score
- very test dependent
- It favors the development of longer rather than shorter
According to this theory, a person’s true score would be obtained by having them respond to all items in the universe of items
Domain Sampling theory
This theory rebels against the concept of the true score existing with respect to the measurement of psychological construct
Domain Sampling Theory
What is the most compatible reliability estimates for Domain Sampling Theory?
internal consistency
This theory assumes that items that have been selected on a test are just sample of items from an infinite domain of a potential items
Domain Sampling Theory
True or false. According to Domain Sampling Theory, as a sample of test gets larger, it represents the domain lesser and lesser. The lower the items, the higher the reliability
false
According to Domain Sampling Theory, what should we do to represent the domain?
If we take random questions to domain para bumuo ng test, then it means narerepresent nito yung domain
According to this Theory, a person’s test scores vary from testing to testing because of variables in the testing situation
Generalizability Theory
According to ____ theory, a “universe score” replaces that of a “true score” based on the idea that a person’s test scores vary from testing to testing because of variables in the testing situation
Generalizability Theory
Explain the concept of universe in generalizability theory
instead of conceiving all variability in a person’s scores as error, test developers are
encouraged to describe the details of the particular test situation (universe) leading to a
specific test score
universe
described in terms of its facets:
number of items in the test
amount of training the test scorers have had
purpose of the test administration
according to the generalizability theory, given the exact same conditions of all the facets in the universe, the exact same test score should be obtained
universe score is analogous to a true score in the true score model
the person will ordinarily have a different universe score for each universe
in other words, ____is a specific score for a specific time or setting or
“universe”
true score
True or False. According generalizability theory, if the observed scores from a procedure agree closely with the universe score, we can
say that the observation is accurate, or reliable, or generalizable
True
This theory emphasis that tests’ reliability does not reside within the test itself
Generalizability theory
Who is the proponent of Generalizability theory
Lee Cronbach
It examines how generalizable scores from a particular test are if the test is administered in different situations and how much of an impact different facets of the universe have on the test score
Generalizability Study
____is the influence of particular facets on the tests scores
coefficient of generalizability
Explain the decision study
developers examines the usefulness of the test score in helping the test user in making decisions. It is also designed to tell test users how test score should be use and how dependable those scores as a basis for decisions
It focuses on how the test items can measure the latent (innate characteristic) trait of the test taker.
Item response Theory (IRT)
what are the different weights of items in ITR?
- Difficulty
- Discrimination
- Dichotomous test items
- Polychotomous test items
- Rasch Model
Explain difficulty in IRT
attribute of not being easily accomplished, solved, or comprehended
Explain Discrimination in IRT?
degree to which an item differentiates among people with higher or lower levels of the trait, ability, or whatever is being measured
Explain dichotomous test items in IRT?
test items that can be answered with only one of two alternative responses
Explain Polytomous test items in IRT
test items with three or more alternative responses, where only one is scored correct
or scored as being consistent with a targeted trait or other construct
Explain Rash Model
each item on the test is assumed to have an equivalent relationship with the
construct being measured by the test
Explain the reliability and individual scores
May mga ranges or specific spectrum yung individual test taker sa mga scores nya kahit anong kalagayan or kung saan man siya nagtake
Ex. Binigyan ko si client ng ng intelligence test then nakuha ko yung score nya. Makikita doon kung saang band ng score or spectrum yung pwede nyang maging score. Kapag lumampas siya sa spectrum na yon sar third take nya ibig sabihin na hindi na reliable yung test na dinevelop ko.
provides an estimate of the amount of error inherent in an observed score or measurement.
standard error of measuement
_______ is a range or band of test scores that is likely to contain the true score.
Confidence Interval
Yung observe score is equal to true score + error variance, ito yung nag cause to fluctuate our score. Range from one end of the spectrum to the other end. So yung spectrum na yon na kung saan pwede natin masabi na reliable parin yon ang most likely ang probability na ang true scores na yon ay nandon sa mga band ng scores n ayon is tinatawag na
confidence interval
Explain the standard error of the difference
used to make comparisons between scores
a statistical measure that can aid a test user in determining how large a difference should be before it is considered statistically significant
A test can be reliable but not _____. but a test cannot be valid if it is not _______
valid; reliable