Assessment and Statistics Flashcards

Question 1

Q

Accessibility

Answer

A

The notion that all examinees should have an unobstructed opportunity to demonstrate their capabilities on the construct(s) being assessed.

Question 2

Q

Accommodations

Answer

A

An action taken in response to an individual with a disability in which there is a departure from the standard testing protocol in an attempt to adjust for the disability.

Question 3

Q

Acculturation

Answer

A

A process of change and adaption that individuals undergo as a result of contact with another culture.

Question 4

Q

Achievement test

Answer

A

An assessment in which the person has “achieved” knowledge, information, or skills through instruction, training, or experience. Achievement tests measure acquired knowledge and do not make any predictions about the future

Question 5

Q

Adaptation

Answer

A

The change in original instrument in terms of design or administration to increase the accessibility to individuals (e.g., visually impaired, limited English proficiency).

Question 6

Q

Affective instrument

Answer

A

An instrument that assesses interest, attitudes, values, motives, temperaments, and the noncognitive aspects of personality.

Question 7

Q

Age or grade equivalent scores

Answer

A

Scores used to compare individuals with other individuals at the same age that are calculated by item response theory or by using a norm-referenced approach

Question 8

Q

Alternate or parallel forms

Answer

A

Two forms of an instrument that can be correlated, resulting in an estimate of reliability.

Question 9

Q

Analogue observation

Answer

A

In this type of observation, the counselor creates a simulated environment that is reflective of the client’s natural environment

Question 10

Q

Appraisal

Answer

A

Another term for assessment.

Question 11

Q

Aptitude test

Answer

A

A test that provides a prediction about the individual’s future performance or ability to learn based on his or her performance on the test. Aptitude tests often predict either future academic or vocational/ career performance.

Question 12

Q

Assessment

Answer

A

A procedure for gathering client information that is used to facilitate clinical decisions, provide clients with information, or for evaluative purposes.

Question 13

Q

Authentic Assessment

Answer

A

Performance assessments that involve the performance of “real” or authentic applications rather than proxies or estimators of actual learning.

Question 14

Q

Behavioral Assessment

Answer

A

An assessment method in which the focus is typically on observing and recording of the precise behaviors of the client

Question 15

Q

Bias Testing

Answer

A

A term that refers to the degree to which construct-irrelevant factors systematically affect a specific group’s performance.

Question 16

Q

Cluster Sampling

Answer

A

A technique that involves using existing units or cluster rather than selecting individuals.

Question 17

Q

Coaching

Answer

A

It involves longer training or practice on questions that are the same or similar to the items on the test.

Question 18

Q

Coefficient of Determination

Answer

A

This statistic estimates the percent of shared variance between two sets of variables that have been correlated. The coefficient of determination (r2) is calculated by squaring the correlation coefficient.

Question 19

Q

Computer Adaptive Testing (CAT)

Answer

A

A testing in which the computer adapts the next question for the student based on his or her response

Question 20

Q

Concurrent Validity

Answer

A

A type of validation evidence in which there is no delay between the time the instrument is administered and the time the criterion information is gathered.

Question 21

Q

Conditional Standard Errors of Measurement

Answer

A

Type of standard error of measurement that takes into account the different score levels.

Question 22

Q

Construct Underrepresentation

Answer

A

The degree to which the instrument is unable to capture significant aspects of the construct.

Question 23

Q

Construct Irrelevance

Answer

A

The degree to which scores or results are affected by that are extraneous to the instrument’s intended purpose.

Question 24

Q

Construct Validity

Answer

A

One of the three traditional forms of validity that is broader than either content or criterion-related validity. Many experts in assessment now argue that evidence of construct validity, which includes the other traditional forms of validity, applies in all types of psychological and educational assessment. This type of validation involves the gradual accumulation of evidence. Evidence of construct validity is concerned with the extent to which the instrument measures some psychological trait or construct and how the results can be interpreted.

Question 25

Q

Content-Related Validity

Answer

A

One of the three traditional forms of validity in which the focus was on whether the instrument’s content adequately represented the domain being assessed. Evidence of content-related validity is often reflected in the steps the authors used in developing the instrument

Question 26

Q

Convergent Evidence

Answer

A

Validation evidence that indicates the measure is positively related with other measures of construct

Question 27

Q

Correlation Coefficient

Answer

A

A statistic that provides an indication of the degree to which two sets of scores are related. A correlation coefficient (r) can range from —1.00 to +1.00 and, thus, provides an indicator of both the strength and direction of the relationship. A correlation of +1.00 represents a perfect positive relationship; a correlation of —1.00 represents a perfect negative or inverse relationship. A correlation coefficient of .00 indicates the absence of a relationship.

Question 28

Q

Correlation Method

Answer

A

A statistical tool often used in providing validation evidence related to an instrument’s relationship with other variables.

Question 29

Q

Criterion-Referenced Instrument

Answer

A

Instruments designed to compare an individual’s performance to a stated criterion or standard. Often criterion-referenced instruments provide information on specific knowledge or skills and on whether the individual has “mastered” that knowledge or skill. The focus is on what the person knows rather than how he or she compares to other people

Question 30

Q

Criterion-Related Validity

Answer

A

One of the three traditional types of validity in which the focus is the extent to which the instrument confirms (concurrent validity) or predicts to (predictive validity) a criterion measure.

Question 31

Q

Cronbach’s Alpha or Coefficient Alpha

Answer

A

It is one of the methods of estimating reliability through the examination of the internal consistency of the instrument. This method is appropriate when the instrument is not dichotomously scored, such
as an instrument that uses a Likert scale.

Question 32

Q

Decision Theory

Answer

A

A method that examines the relationship between an instrument and a criterion or predictor variable, which usually involves an expectancy table. Expectancy tables frequently are used to determine cutoff scores or to provide clients with information regarding the probability of a certain performance on the criterion that is based on scores on the assessment.

Question 33

Q

Differential Item Functioning (DIF)

Answer

A

A set of statistical methods for investigating item bias that examines differences in performance among individuals who are equal in ability but are from different groups (e.g., different ethnic groups).

Question 34

Q

Discriminant Evidence

Answer

A

Validation evidence that indicates the measure is not related to measures of different psychological constructs.

Question 35

Q

Domain Sampling Theory

Answer

A

Another term for generalizability theory.

Question 36

Q

Duty to Warn

Answer

A

The requirement or permission for mental health practitioners to disclose information about a client when that client is going to harm someone else.

Question 37

Q

Event Recording

Answer

A

One of the methods used in behavioral assessment, where the counselor records the number of times a target behavior or behaviors occur during a specified time period.

Question 38

Q

Expectancy Table

Answer

A

A method of providing validity evidence that involves charting performance on the criterion based on the instrument’s score. It is often used to predict who would be expected to fall in a certain criterion category (e.g., who is likely to succeed in graduate school) and to determine cutoff scores.

Question 39

Q

Factor Analysis

Answer

A

A term that covers various statistical techniques that are used to study the patterns of relationship among variables with the goal of explaining the common underlying dimensions (factors). In assessment, factor analysis is often used to examine if the intended internal structure of an instrument is reflected mathematically. For example, a researcher would analyze whether all items on each subscale “load” with the other items on the appropriate subscale (factor) and not with another factor.

Question 40

Q

False Negative

Answer

A

In decision theory, a term used to describe when the assessment procedure is incorrect in predicting a negative outcome on the criterion.

Question 41

Q

False Positive

Answer

A

In decision theory, a term used to describe when the assessment procedure is incorrect in predicting a positive outcome on the criterion.

Question 42

Q

Formative Evaluation

Answer

A

A continuous or intermediate evaluation typically performed to examine the counseling services process

Question 43

Q

Frequency Distribution

Answer

A

A chart that summarizes the scores on an instrument and the frequency or number of people receiving that score. Scores are often grouped into intervals to provide an easy-to-understand chart that summarizes overall performance.

Question 44

Q

Frequency Polygon

Answer

A

A graphic representation of the frequency of scores. The number or frequency of individuals receiving a score or falling within an interval of scores is plotted with points that are connected by straight lines

Question 45

Q

General Ability Test

Answer

A

Another term for intelligence test.

Question 46

Q

Generalization Theory

Answer

A

An alternative model to the true score model of reliability. The focus of this theory is on estimating the extent to which specific sources of variation under defined conditions influence scores on an instrument.

Question 47

Q

Grade Equivalent Norms

Answer

A

Norms that are typically used in achievement tests and provide scores in terms of grade equivalents. In some instruments, grade equiv- alent scores are not validated on each specific grade but are extrapolated scores based on group performance at each grade level

Question 48

Q

High-Stales Testing

Answer

A

A type of testing where the outcome of such tests have significant consequences (e.g., high school graduation examination),

Question 49

Q

Histogram

Answer

A

A graphic representation of the frequency of scores in which columns are utilized

Question 50

Q

Hypomanic Episode

Answer

A

Characterized by similar symptoms as a manic episode except it lasts for only four days

Question 51

Q

Individualized Education Plan (IEP)

Answer

A

An educational plan that is developed for each student who is receiving special education and related services. The plan is developed by a team of educators and the child’s parents or guardians.

Question 52

Q

Instrument

Answer

A

An assessment tool that typically is not related to grading. In this book, instruments include tests, scales, checklists, and inventories.

Question 53

Q

Intelligence Tests

Answer

A

Instruments that are designed to measure the mental capabilities of an individual. These assessments are also referred to as general ability tests.

Question 54

Q

Intercepts

Answer

A

In regression, it is the constant in the regression equation or it is where the line crosses the y-axis.

Question 55

Q

Interrater Reliability

Answer

A

A measure used to examine how consistently different raters evaluate the answers to the items on the instrument.

Question 56

Q

Interval Recording

Answer

A

An assessment method that focuses on whether specific behavior(s) occur within a certain interval of time. It is also referred to as time sampling, interval sampling, or interval time sampling

Question 57

Q

Interval Scale

Answer

A

A type of measurement scale in which the units are in equal intervals. Many of the statistics used to evaluate an instrument’s psychometric qualities require an interval scale.

Question 58

Q

Item Analysis

Answer

A

An analysis that focuses on examining and evaluating each item within assessment.

Question 59

Q

Item Difficulty

Answer

A

An item analysis method in which the difficulty of individual items is determined. The most common item difficulty index (p) is the percentage of people who get the item correct.

Question 60

Q

Item Descrimination

Answer

A

A form of item analysis that examines the degree to which an individual item dis- criminates on some criterion. For example, in achievement testing, item discrimination would indicate whether the item discriminates between people who know the information and people who do not

Question 61

Q

Item Response Theory (IRT)

Answer

A

A measurement approach in which the focus is on each item and on establishing items that measure the individual’s ability or level of a latent trait. This approach involves examining the item characteristic function and the calibration of each individual item.

Question 62

Q

Kuder-Richardson Formulas

Answer

A

Two formulas (KR 20 and KR 21) that were developed to estimate reliability.
Both of these methods are measures of internal consistency. KR 20 has been shown to approximate the aver- age of all possible split-half coefficients. KR 21 is easier to compute, but the items on the instrument must be homogeneous.

Question 63

Q

Latent Trait Theory

Answer

A

Another term for item response theory

Question 64

Q

Major Depressive Episode

Answer

A

A period in which the client is in a depressed mood or has lost interest or pleasure in nearly all activities for at least two weeks.

Answer 65

A

A distinct period of abnormally elevated, expansive, or irritable moods and abnormally increased goal-directed activity or energy that lasts at least one week and is present most of the day. It is characterized by a feeling of becoming so driven that it causes marked impairments in occupational functioning, social activities, or relationships.

Answer 66

A

The arithmetic average of the scores. It is calculated by adding scores together and dividing by the number in the group.

Answer 67

A

The middle score, with 50% of the scores falling below it and 50% of the scores falling above it.

Answer 68

A

A syndrome characterized by clinically significant disturbance in an individual’s cognition, emotion regulation, or behavior that reflects a dysfunction in the psychological, biological, or developmental processes underlying mental functioning.

Answer 69

A

A series of yearbook that contain critiques of many of the commercially available psychological, educational, and career instruments.

Answer 70

A

An examination used to describe a client’s level of functioning and self-pre- sentation. It is generally conducted during the initial session or intake interview and is a statement of how a person appears, functions, and behaves during the initial session

Answer 71

A

The most frequent score in a distribution.

Answer 72

A

The changes being made that influence the construct being measured, and, therefore, the scores do not retain the same meaning as the original instrument.

Answer 73

A

A matrix that includes information on correlations between the measure and traits that it should be related to and traits that it should not theoretically be related to. The matrix also includes correlations between the measure of interest and other same-methods measures and measures that use different assessment methods

Answer 74

A

Recordings that typically do not involve any quantitative recording procedures. They can be completed by the counselor, parents, other family members, teachers, or the client.

Answer 75

A

A type of observation in which the counselor observes the client in a typical environment and the counselor does not manipulate any aspect of the environment during the observation.

Answer 76

A

A type of distribution where the majority of scores are on the higher end of the distribution.

Answer 77

A

An assessment of cognitive impairments, specifically the behavioral effects of possible brain injury or damage

Answer 78

A

A scale of measurement characterized by assigning numbers to name or representing mutually exclusive groups (e.g., 1 = male, 2 = female).

Answer 79

A

A bell-shaped, symmetrical, and unimodal curve. The majority of cases are concentrated close to the mean, with 68% of the individual scores falling between one standard deviation below the mean and one standard deviation above the mean.

Answer 80

A

A distribution of scores with certain specific characteristics (e.g., 68% of the sample approximately falls between one standard deviation be- low the mean to one standard deviation above the mean

Answer 81

A

Instruments in which the interpretation of performance is based on the comparison of an individual’s performance with that of a specified group of people.

Answer 82

A

The most common method counselors use to assess personality in which they observe clients from the first meeting and begin to make clinical judgments based on those initial observations

Answer 83

A

Type of measurement scale in which the degree of magnitude is indicated by the rank ordering of the data.

Answer 84

A

An abrupt surge of fear or intense discomfort that reaches its peak quickly, and clients experience symptoms such as palpitations, sweating, trembling, choking, chest pain, nausea, and fear of losing control.

Answer 85

A

A ranking that provides an indication of the percent of scores that fall at or below a given score. For example: “Mary’s percentile of 68 means that if there were 100 people who had taken this instrument, 68 of them would have a score at or below Mary’s.”

Answer 86

A

An alternate method of assessing individuals, other than through multiple-choice types of items, in which the focus is on evaluation the performance of tasks or activities.

Answer 87

A

Tests that require the manipulation of objects with minimal verbal influences.

Answer 88

A

A type of distribution in which the majority of scores are at the lower end of the range of scores. (Tail to the right).

Answer 89

A

A type of validation evidence in which there is a delay between the time the instrument is administered and the time the criterion information is gathered.

Answer 90

A

A type of personality assessment that provides the client with a relatively ambiguous stimulus, thus encouraging a nonstructured response. The assumption underlying these techniques is that the individual will project his or her personality into the response. The interpretation of projective techniques is subjective and requires extensive training in the technique.

Answer 91

A

A summary of a client’s assessment results that is often geared toward other professionals. Frequently written by a psychologist, a typical report includes background information, behavioral observations, test results and interpretations, recommendations, and a summary.

Answer 92

A

An objective and standardized measure of a sample of behavior.

Answer 93

A

A detailed interview that gathers background information and information about the client’s current psychological and social situation.

Answer 94

A

Descriptive information sought from the evaluation study, with the intent to produce “rich” interpretative data.

Answer 95

A

Information that is more numerical in nature where the intent is to quantify the results.

Answer 96

A

The gold standard in intervention research. Adopted from the medical model, where patients are randomly assigned to receive the medication or the placebo, in counseling evaluation studies, clients are randomly assigned to either the intervention group or the placebo/control group.

Answer 97

A

A category of behavioral assessment in that rating scales are completed, but often by either the counselor or some other observer (e.g., parents or teachers).

Answer 98

A

A scale of measurement that has both interval data and a meaningful zero (e.g., weight, height). Because ratio scales have a meaningful zero, ratio interpretations can be made.

Answer 99

A

Raw scores are the unadjusted scores on an instrument before they are transformed into standard scores. An example of a raw score is the number of answers an individual gets correct on an achievement test.

Answer 100

A

The possible change that may occur in clients’ behavior, thoughts, or performance that are a result of being observed, assessed, or evaluated.

Answer 101

A

A commonly used statistical technique in which the researcher examines whether independent variables predict to a criterion or dependent variable. Regression is used to determine if there is a linear relationship among the variables or a line of best fit

Answer 102

A

An equation that describes the linear relationship between the predictor variable(s) and the criterion variable. These equations are often used to determine if it is possible to predict the criterion based on the instrument’s scores.

Answer 103

A

Concerns the degree to which a measure or a score is free of random error. In classical test theory, it is the ratio of true variance to observed variance.

Answer 104

A

A meta-analytic method that combines estimates of reliability across studies in order to calculate an estimate based on multiple indicators of reliability.

Answer 105

A

The replacement for the discrepancy approach of diagnosing a learning disability, the focus of which is to use data (e.g., achievement tests, classroom activities) to identify students at risk for poor learning outcomes

Answer 106

A

A number or letter that is the product of a client taking an assessment. A score cannot be interpreted without additional information about the assessment.

Answer 107

A

The practice of observing and recording one’s own behavior.

Answer 108

A

An interview that is a combination of a structured and unstructured format in which there are a set of established questions and the clinician can also ask additional questions for elaboration or to gather additional information

Answer 109

A

The use of mental abilities to arrange stimuli in sequential, or serial order in order to process the information.

Answer 110

A

A type of sample in which every individual in the population has an equal chance of being selected.

Answer 111

A

The use of mental abilities to integrate information in a unified manner, with the individual integrating fragments of information in or- der to comprehend the whole.

Answer 112

A

The distribution is not symmetrical and the majority of people either scored in the low range or the high range, as compared with a normal distribution in which the majority scored in the middle.

Answer 113

A

Distributions in which the majority of scores are either high or low. Skewed distributions are asymmetrical, and the mean, mode, and median are different. In positively skewed distributions, the majority of scores are on the lower end of the distribution; in negatively skewed distributions, the majority of scores are on the upper end of the distribution.

Answer 114

A

A term referring to a situation in which a test yields significantly different validity coefficients for different groups, resulting in different regression lines.

Answer 115

A

A formula for correcting a split-half reliability coefficient that estimates what the coefficient would be if the original number of items were used

Answer 116

A

A two-factor theory of intelligence that postulates everyone has a general ability factor influencing their performance on intellectual tasks, and also specific factors correlated to g that influence performance in specific areas.

Answer 117

A

One of the internal consistency measures of reliability in which the instrument is administered once and then split into two halves. The scores on the two halves are then correlated to provide an estimate of reliability. Often the split-half reliability coefficients are corrected using the Spearman-Brown formula. This formula adjusts the coefficient for using only half of the total number of items to provide an estimate of what the correlation coefficient would be if the original number of items was used.

Answer 118

A

The most common statistic used to describe the variability of a set of measurements. It is the square root of the variance.

Answer 119

A

A measure used by a counselor to examine the difference between two scores and determine if there is a significant difference.

Answer 120

A

A numerical result that indicates the margin of expected error in the individual’s predicted criterion score as a result of imperfect validity.

Answer 121

A

This deviation provides an indication of what an individual’s true score would be if he or she took the instrument repeated times. Counselors can use standard error of measurement to determine the range of scores 68%, 95%, or 99.5% of the time.

Answer 122

A

A type of sample in which individuals are selected for the norming group based on certain demographic chracteristics.

Answer 123

A

An interview that is conducted using a predetermined set of questions that is asked in the same manner and sequence for every client.

Answer 124

A

Formalized assessments in which clients respond to a fixed set of questions or items

Answer 125

A

A cumulative evaluation of services that are typically completed at the endpoint of the service. These types of evaluation are designed to provide an overall indication of the effectiveness of the services.

Answer 126

A

The behavior that is being assessed in behavior assessment.

Answer 127

A

An individual instrument in which the focus is on evalutation

Answer 128

A

A process of giving clients tests and/or instruments.

Answer 129

A

One in which the reliability coefficient is obtained by correlating a group’s performance on the first administration of an instrument with the same group’s performance on the second ad- ministration of that same instrument

Answer 130

A

A term applied to an individual’s level of knowledge in test-taking skills. It is not related to knowledge of the content but rather to the format of the tests and the skills required for maneuvering through that format.

Answer 131

A

An approach to instrument design with the goal of maximizing accessibility for all intended examinees.

Answer 132

A

An interview in which the clinician gears the questions toward each individual client and there is no established set of questions.

Answer 133

A

The correlation between scores on an instrument and the criterion measure.

Answer 134

A

Term applied to findings in- dicating that the validity of cognitive ability tests can be generalized and that cognitive ability is highly related to job performance

Answer 135

A

The average of the squared deviation from the mean. It is a measure of variability and its square root is the standard deviation of the set of measurements.

Answer 136

A

A standard score that always has a mean of 0 and a standard deviation of 1.

Assessment and Statistics Flashcards

All key terms from the Assessment in Counselling textbook.