Literatuur week 8 Interpretation test results Flashcards

1
Q

A person’s raw score had little meaning without which 2 things?

A

1) A comparison to a normative sample
2) A method for interpreting the meaning of the comparison

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What do we call a measure that can be used to compare values from different data sets?

A

Relative standing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is it called when interpreting and communicating test performance depends on having an appropriate comparative sample and a common ‘language’ of descriptions?

A

Rule of thumb

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is a positively skewed distribution?

A

More scores fall below the mean compared to above the mean (vanaf links aflopend)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is a negatively skewed distribution?

A

More scores fall above the mean compared to below the mean (vanaf links oplopend)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What happens when scores from a normative sample are not normally distributed?

A

The mean and median are not identical and z scores will not accurately translate into sample percentile rank values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

A …. sample size will produce a …. normal distribution, but only if the underlying characteristic in the population distribution obtained is normal

A

Larger, more

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

When can a truncated distribution occur?

A

1) When scores are restricted at one side of the distribution
2) When specific subgroups are purposefully excluded from inclusion in the normative sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

A truncated distribution of scores can lead to?

A

1) Identification of normal individuals as low functioning
2) Difficulty estimating the severity of impaired performance
3) An increase in number of persons identified as impaired

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

When is it useful to compare scores between tests?

A

1) The raw score distributions for tests that are being compared are approximately normal in the population
2) The scores that are being compared are derived from similar samples

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

When comparing test scores, it is important to consider the … of two measures and their ….

A

Reliability, intercorrelation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

The relationship between normative scores and percentiles are lineair/ non-lineair

A

Non-lineair

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is defined as the presence of truncated tails in the context of limitations in range of item difficulty?

A

Ceiling and floor effects

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What does a high floor in scores mean?

A

When a large proportion of the examinees obtain raw scores at or near the lowest possible score

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What indicates a high floor in test scores?

A

That the test lacks a sufficient number and range of easier items

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Floor and ceiling effects can lead to?

A

Misinterpretations results

17
Q

What does extrapolation entail?

A

The action of estimating or concluding something by assuming that existing trends will continue. When norms fall short in terms of rang this technique is often used.

18
Q

Comparison of performance across tests is affected by: (5)

A

1) Measurement error
2) Score magnitude
3) Extreme scores
4) Ceiling and floor effects
5) Extrapolation/ interpolation of derived scores

19
Q

It is important to carefully consider how to interpret isolated low scores. The likelihood of obtaining low scores increases when?

A

1) the number of tests increases
2) the cut off for defining low scores becomes more open-minded
3) with lower levels of baseline cognitive functioning

20
Q

The degree of agreement between different
people that are observing or assessing the same thing = (Inter-rater reliability/ Test-retest reliability/ Parallel-forms reliability/ Internal consistency reliability)

A

Inter-rater reliability

21
Q

Measure the consistency of the result when
you repeat the measure the same thing at a different point of time = (Inter-rater reliability/ Test-retest reliability/ Parallel-forms reliability/ Internal consistency reliability)

A

Test-retest reliability

22
Q

Measures the correlation between two
equivalent versions of a test. This can help to avoid practice effects, but the versions should be equivalent = (Inter-rater reliability/ Test-retest reliability/ Parallel-forms reliability/ Internal consistency reliability)

A

Parallel-forms reliability

23
Q

The correlation between items
within a test that are mean to measure the same construct = (Inter-rater reliability/ Test-retest reliability/ Parallel-forms reliability/ Internal consistency reliability)

A

Internal consistency reliability

24
Q

What is validity?

A

Validity is the degree to which a test is measuring what is was intended to measure

25
Q

What is sensitivity?

A

Sensitivity is the probability of a positive test, given that the person is affected

26
Q

What is specitifity?

A

The probability of a negative test, given that a person is healthy

27
Q

What does a p-value NOT measure?

A
  1. Does not measure the probability that the studied hypothesis is true, or the probability that the data were produced by random chance alone.
  2. They do not provide a good measure of evidence regarding a model of hypothesis
  3. They do not measure the size of an effect or the importance of a result
28
Q

What is circular analysis?

A

Circular analysis is any form of analysis that retrospectively selects features of the data
to characterise the dependent variables, resulting in a distortion of the resulting statistical test.

= based on data that were selected for showing the effect of interest or a related effect

29
Q

What is p-hacking?

A

the misreporting of true effect sizes in published studies. It occurs when researchers try out several statistical analyses and then selectively report those that produce significant results.

30
Q

What is a spurious correlation?

A

Occurs when two factors appear casually related to one another but are not. Spurious correlations most commonly arise if one or several outliers are present for one of the two variables

31
Q

Is the test fully representative of what it aims to measure, refers to which validity?

A

Content validity

32
Q

Evaluates how accurately a test measures the outcome it was designed to measure, for now or in the future, refers to?

A

Criterion related validity

33
Q

Which two types of criterion-related vaidity are there?

A

Concurrent validity and predictive validity

34
Q

Does the test measure the concept that it is intended to measure, refers to which validity?

A

Construct validity

35
Q

What does a p-value not measure?

A
  1. Does not measure the probability that the studied hypothesis is true, or the probability that the data were
    produced by random chance alone.
  2. They do not provide a good measure of evidence regarding a model of hypothesis
  3. They do not measure the size of an effect or the importance of a result