2. Test worthiness and Statistics Flashcards

Question 1

Q

What is an important test in creation and measures of worthiness?

Answer

A

Correlations

Question 2

Q

what is the range for correlations?

Question 3

Q

what are the ways to observe correlation?

Answer

A

Linear - line of best fit

Curvilinear - what is the shape of the line?

Question 4

Q

What does shared variance question?

Answer

A

What are the factors that contribute to shared variance?

Question 5

Q

what is another term for shared variance

Answer

A

Squared coefficient

Question 6

Q

what is the only way to determine a variance when you have a correlation coefficient?

Answer

A

square the variance then you know how much variance there are between two variables

Question 7

Q

what does reliability refer to?

Answer

A

Refers to how free the test is from measurement error; are you going to get the same, or a very close score if you sit the test again? It’s about internal consistency, reliability & dependability

Question 8

Q

what does reliability depend on?

Answer

A

construction of test & environment administered in

Question 9

Q

why will there always be error?

Answer

A

There’s no perfect test or environment so will always be some error but we want to minimise it.

Question 10

Q

how is reliability usually reported as?

Answer

A

correlation coefficient

Question 11

Q

why do different types of tests have different levels of reliability?

Answer

A

reliability e.g., well constructed achievement tests may have reliability coefficients of .90 but personality tests often much lower (.70) because the concept is abstract & potentially fluctuates

Question 12

Q

there are many ways to test reliability. What are some measures of reliability?

Answer

A

test-retest, alternative forms , consistency of measurement

Question 13

Q

what is test-retest reliability?

Answer

A

Give test twice to same group, usually a couple of weeks apart. then correlate results

Question 14

Q

what does a higher correlation suggest in test-retest reliability?

Answer

A

Higher the correlation, more reliable the test

Question 15

Q

why does the fluctuations in results of test-retest reliability occur?

Answer

A

Results can fluctuate depending on things such as the time between test being taken, forgetting information, might learn more about test contents by studying in the interim, more familiar with test format second time around

Question 16

Q

When is test-retest reliability likely to fluctuate?

Answer

A

when testing a stable construct (e.g. IQ)

Question 17

Q

what is ALTERNATIVE, PARALLEL OR EQUIVALENT FORMS RELIABILITY?

Answer

A

Making two or more versions of the same test

Question 18

Q

what issues ALTERNATIVE, PARALLEL OR EQUIVALENT FORMS RELIABILITY prevent?

Answer

A

`Stops issues like people remembering or studying particular answers between test-retest

Question 19

Q

what is the difficulty of ALTERNATIVE, PARALLEL OR EQUIVALENT FORMS RELIABILITY?

Answer

A

Hard to make tests equal in terms of content & levels of difficulty, or ensuring administration was exactly the same

Question 20

Q

what must test developers demonstrate in order to have good ALTERNATIVE, PARALLEL OR EQUIVALENT FORMS RELIABILITY?

Answer

A

Test developer must demonstrate the versions are truly parallel

Question 21

Q

what is reliability as internal consistency measuring?

Answer

A

Measures how test items relate to each other & the test as a whole

Question 22

Q

where does reliability as internal consistency look into?

Answer

A

Looks within the test to measure reliability

• E.g., test to measure of anxiety – respondents should answer items that tap aspects of anxiety in a similar way

Question 23

Q

what is them most common form of internal consistency measure?

Answer

A

Most common forms of internal consistency (i. e., reliability) are split-half and Cronbach’s alpha

Question 24

Q

What is split-half reliability?

Answer

A

Use one form of the test administered at the same time. Split the test in two and correlate the scores

Question 25

Q

what are the issues of the split-half reliability?

Answer

A

test may get harder as you go along so first half is not equal to second. May compare odd and even numbered questions but still the halves may not be equal & the test is shorter which can decrease reliability

Question 26

Q

what equation can be used to compensate for shorter tests in split-half reliability?

Answer

A

Can use Spearman-Brown equation to compensate for shorter test (rx2/1+r)

Question 27

Q

what does CRONBACH’S COEFFICIENT ALPHA AND KUNDER-RICHARDSON attempt to rate?

Answer

A

Try to rate internal consistency by estimating reliability of all possible split-half combinations by correlating each item with the total and averaging

Question 28

Q

what is Kuder-Richardson used with?

Answer

A

Kuder-Richardson used with forced-choice format tests

Question 29

Q

what is validity?

Answer

A

The extent to which all the available evidence supports that the test is actually measuring what it is intended to measure.

Question 30

Q

what is validity essential for?

Answer

A

It is a central requirement for test without which the test items/tasks would not have meaning

Question 31

Q

what are the categories of content validity?

Answer

A

face validity

Question 32

Q

what are the categories of construct validity?

Answer

A

Criterion-related validity

* Predictive validity

Question 33

Q

what is content validity testing?

Answer

A

The content of a test reflects what the test is aiming to measure. It is sometimes enough for a validity test

Question 34

Q

What is construct validity not enough to ascertain test validity for?

Answer

A

Not enough to ascertain test validity beyond achievement type tests e.g., for more abstract constructs

Question 35

Q

what is face validity?

Answer

A

It refers to the look of the test but maybe superficial.
o E. g., the items in the test look as though they ought to measure what you are aiming to measure
Where some tests may look valid and not be or others dont look valid but are

Question 36

Q

what are constructs?

Answer

A

Constructs are theoretically driven ways of talking about certain features in the world

Question 37

Q

what does construct validity ask?

Answer

A

Construct validity asks how well a test can give a construct meaning
o e.g., anxiety – only exists in so much as the construct represents a set of behaviours, thoughts and feelings

Question 38

Q

what is construct irrelevance?

Answer

A

Scores are influenced by something other than what the test is supposed to measure e.g., anxiety or illness impacting exam score.

Question 39

Q

what does construct validity provide?

Answer

A

Scientific evidence demonstrating that the construct (model, concept, idea, notion) is actually being measured by the test.

Question 40

Q

when is construct validity most important?

Answer

A

Most important when developing tests to measure abstract constructs like depression, anxiety, happiness, love, empathy.

Question 41

Q

what is construct validity measured with?

Answer

A

Measured with statistical tools and methods…

Question 42

Q

what does criterion-related validity question?

Answer

A

What is the relationship between the criterion (another standard) for the test and the test scores?

Question 43

Q

what does concurrent validity question?

Answer

A

It refers to the extent to which the results of a particular test, or measurement, correspond to those of a previously established measurement for the same construct.

Question 44

Q

what does concurrent validity relate to?

Answer

A

Relates to what is known at this point in time

Question 45

Q

what is convergent validity?

Answer

A

e.g. If you think your test measures post-traumatic growth, you would expect it to be related to other instruments designed to measure positive post-trauma perception (e.g., SRG & Thriving)

Question 46

Q

why dont you want a perfect correlation with other tests when measuring convergent validity?

Answer

A

You don’t want a perfect correlation with other tests as that would make yours redundant.

Question 47

Q

how does convergent validity vary?

Answer

A

Depending on how close the theoretical connection is between the tests, the coefficient will vary.

Question 48

Q

what does discriminant validity use to determine validity?

Answer

A

As with convergent validity, discriminant validity uses other established tests to test construct validity.

Question 49

Q

when using discriminant validity, what are you looking for?

Answer

A

This time, you are looking for little or no correlation between your measure (e.g., IES-R & PTGI).

Question 50

Q

what can significantly influence discriminant validity in a clinical context?

Answer

A

Knowing what you are dealing with may have significant clinical influence (e.g., Shakespeare-Finch & de Dasell, 2009).

Question 51

Q

when does one turn to predictive validity?

Answer

A

If a standard to compare to is not available, interest turns to Predictive validity

Question 52

Q

what does predictive validity test?

Answer

A

The relationship between the test scores now and a standard in the future.
o For example, OP scores and first year academic success – depends on age…
o OP for school leavers better predictor than learning strategies which is more predictive for mature age
o Combining evidence often offers more predictive validity e.g., OP. introversion and learning strategies in school leavers.

Question 53

Q

what is cross-cultural fairness>

Answer

A

The idea that ethnicity, gender, class, background etc impact on results

Question 54

Q

why were there lots of laws passed in the US about tests being culturally fair?

Answer

A

Lots of laws passed in the US about tests being culturally fair so as not to repeat errors of the past which disadvantaged minority groups e.g., employment tests must be able to demonstrate the test is relevant to the job sought

Question 55

Q

what is practicality?

Answer

A

Choosing the right test to administer; time, format, cost etc

Question 56

Q

what are the elements of practicality?

Answer

A

time, cost, format, readability, ease of administration, scoring and interpretation

Question 57

Q

what is the time aspect of practicality?

Answer

A

Time taken to do test needs to reflect person targeted e.g., attention span, age, time available to do the test

Question 58

Q

what is the cost aspect of practicality?

Answer

A

Many tests cost & some cost a lot. Balance cost of test with reliability of it and level of need to take it.

Question 59

Q

what is the format aspect of practicality?

Answer

A

Types of questions, font size, layout, MC lower anxiety but not always culturally fair (e.g., white males & MC)

Question 60

Q

what is the readability aspect of practicality?

Answer

A

Test items be reviewed for readability – our school

Question 61

Q

what is the Ease of administration, scoring & interpretation aspect of practicality?

Answer

A

o Understanding the test manuals
o How many people are taking the test & does this impact on ease of administration?
o Level of training needed to administer, score & interpret the results
o How long will scoring take and how long will report take?
o Time needed to explain results to test taker
o Other materials like publisher’s preformatted sheets

Question 62

Q

how does one select and administer a good test?

Answer

A

Sourcing tests through articles, books, publisher’s catalogues, test library, online and examining the research about the test such as reliability data and validity

Question 63

Q

what are questions to considered when selecting and administering a good test?

Answer

A

There are 000s of tests – how to choose?
What are the goals of the client or researcher?
Which tests can achieve that goal?

Question 64

Q

what do the major test categories include?

Answer

A

IQ, aptitude, achievement, behaviour, development, personality, neuropsychological, science, sensory perception, speech, hearing…

Answer 64

A

Raw scores have not been manipulated in any way

Answer 65

A

They mean nothing without putting them in context so might compare the score an individual gets with the ‘normed score’ (Norm data usually generated from hundreds and hundreds of samples)

Answer 66

A

o How did the client fair next to others from the same type of group who have taken the test before?
o Compare people from 2 groups e.g., using percentiles
o Compare results for one person on 2 or more tests – sometimes discrepancies between these scores indicate an impairment of sorts

Answer 67

A

What was the score and how often did it occur?

Answer 68

A

Listing scores in numerical order you can easily see if this person scored higher or lower than most on the distribution. Might list scores or groups of scores. Histograms & frequency polygons etc assist in getting an overview of the data. We can learn a lot about the data from its shape

Answer 69

A

tall distribution

Answer 70

A

flat distribution

Answer 71

A

tail to the right

Answer 72

A

tail to the left

Answer 73

A

percentiles

Answer 74

A

the score, less the mean, divided by the standard deviation and is therefore sensitive to all components of the variance equations including sample size.

Answer 75

A

personality testing

Answer 76

A

T = z(SD)+M

Answer 77

A

o	Deductive
o	Positivist 
o	Realist
o	Objective
o	Reductionist
o	Generalisation
o	Numbers

Answer 78

A

o	Inductive
o	Interpretive
o	Constructivist
o	Subjective
o	Holisitc
o	Uniqueness
o	Words

Answer 79

A

quantitative measures and inform quantitative studies