MIDTERM Flashcards by Antoinette Cruz

What is Psychological Assessment and Testing all about?

To measure behavior (overt and covert)
To describe and predict behavior and personality (traits, states,personality types, attitudes, interests, values, etc.)
To determine signs and symptoms of dysfunctionality (for case formulation, diagnosis, and basis for intervention/plan for action)

How well did you know this?

Not at all

Perfectly

Gathering and integration of psychology-related data for the purpose of making psychological evaluation that is accomplished through the use of tools (test,interviews, case studies, behavioral observations)and specially designed and measurement procedures.

Psychological Assessment

How well did you know this?

Not at all

Perfectly

Process of measuring psychology-related variables by means of devices on procedures designed to obtain sample of behavior.

Psychological Testing

How well did you know this?

Not at all

Perfectly

A standardized measuring device or procedure used to describe the ability, knowledge, skills or attitude of the individual.

Psychological Test(s)

How well did you know this?

Not at all

Perfectly

The process of quantifying the amount or number of a particular occurrence of event, situation, phenomenon, object or person.

Measurement

How well did you know this?

Not at all

Perfectly

The process of synthesizing the results of measurement with reference to some norms and standards.

Assessment

How well did you know this?

Not at all

Perfectly

Tools of Psychological Assessment

1) Psychological Tests
2) Interviews
3) Portfolio Assessment
4) Case-History Data
5) Behavioral Observation
6) Role Play Tests
7) Computers as Tools

How well did you know this?

Not at all

Perfectly

The process of judging the worth of any occurrence of event, situation, phenomenon, object or person which concludes with a particular decision.

Evaluation

How well did you know this?

Not at all

Perfectly

A tool of assessment in which information is gathered through direct, reciprocal communication.

Ideally conducted face to face
Telephone: pitch, pause are signs of emotion

Interviews

How well did you know this?

Not at all

Perfectly

Three types of interviews

Structured
Semi-structured
Unstructured

How well did you know this?

Not at all

Perfectly

Method of gathering information through direct communication involves:

1) Reciprocal exchange
2) Take note of verbal and non-verbal actions—facial expressions, eye contact and general reaction to the demand of the interview

How well did you know this?

Not at all

Perfectly

A type of work sample is used as an assessment tool—sample of one’s ability and accomplishment.

Education (writing samples) tools for hiring instructors.

Portfolio Assessment

How well did you know this?

Not at all

Perfectly

Records, transcripts, and other accounts in any media that preserve archival information, official and informal accounts, and other data and items relevant to the assessee

Records, transcripts and other accounts in written, pictorial or other form that present archival information, official and informal accounts and other data and items relevant to assessee
Files/excerpts from files maintained in situation and agencies
Letters, written correspondence, photos, family albums, newspaper,magazine clippings, home news and pictures, movies and audio tapes
Shed light on individuals
Past and current adjustment as well as on the events/circumstances that may have contributed to any changes in assessment

Case-History Data

How well did you know this?

Not at all

Perfectly

Monitoring the actions of others or oneself by visual or electronic means while recording quantitative/qualitative information regarding those sections—can be used as diagnostic aid (inpatient facilities, behavioral research lab, classroom)

Behavioral Observation

How well did you know this?

Not at all

Perfectly

Tool for assessment wherein assesses are directed to act as if they were in a particular situation—used when real settings are too impractical.

Substance abusers can used as both a tool for assessment and measure of outcome

Role Play Tests

How well did you know this?

Not at all

Perfectly

As test administrators, computers do much more than replace the “equipment” that was so widely used in the past (a number 2 pencil).
Computers can serve as test administrators (online or off) and as highly efficient test scorers. Within seconds they can derive not only test scores but patterns of test scores.

Computers as Tools

How well did you know this?

Not at all

Perfectly

Types of Tests Based on the Number of Examinees

1) Individual Test
2) Group Test

How well did you know this?

Not at all

Perfectly

The examiner/test administrator gives the test to only one person

Individual Test

How well did you know this?

Not at all

Perfectly

The examiner/test administrator gives to more than one person

Group Test

How well did you know this?

Not at all

Perfectly

Tests Based on the Type of Behavior They Measure

1) Ability Test
a) Achievement Test
b) Aptitude Test
c) Intelligence Test
2) Personality Test
3) Interest Test

How well did you know this?

Not at all

Perfectly

Cognitive Performance—Based Measures
Measures what people can do
Pertains to capacity or potential; items are scored according to speed, accuracy or both
Variable measurement
Presence of right and wrong
IQ, Aptitude, Achievement

Ability Test

How well did you know this?

Not at all

Perfectly

Measures previous learning

Achievement Test

How well did you know this?

Not at all

Perfectly

Measures potential for learning or acquiring a specific skill

Aptitude Test

How well did you know this?

Not at all

Perfectly

General potential to solve problems, adapt to changing circumstances, think abstractly, and profit from experience.

Intelligence Test

How well did you know this?

Not at all

Perfectly

It has to do with an individual's covert and overt dispositions, such as a person's tendency to act in a certain way or respond in a certain way in a given situation.

Personality Test

Provides a self-report statement to which the person responds "True" or "False", "Yes" or "No".

Structured Personality Test

Provides an ambiguous test stimulus

Projective Personality Test

Originally developed for vocational guidance but later found its way to employee selection and career development

Interest Test

Three-Tier System of Psychological Tests

1) Level A 2) Level B 3) Level C

These tests are those that can be administered, scored and interpreted by responsible non-psychologist who have carefully read the manual and are familiar with the overall purpose of testing. - Educational achievement tests fall into this category - Examples: Achievement tests and other specialized (skill-based) aptitude tests

Level A

These tests require technical knowledge of test construction and use of appropriate advanced coursework in psychology and related courses - Examples: group intelligence tests and personality tests

Level B

These tests require an advanced degree in Psychology or License as Psychologist and advanced training/supervised experience in a particular test - Examples: Projective tests, Individual Intelligence tests, Diagnostic tests

Level C

Testing was instituted as a means of selecting who, of the many applicants, would obtain government jobs

Chinese Civilization

Tests were used to measure intelligence and physical skills

Greek Civilization

These universities relied on formal exams in conferring degrees and honors

European Universities

Believed that despite our similarities, no two humans are exactly alike. Some of these individual differences are more “adaptive than others and these differences lead to more complex, intelligent organisms over time.

Charles Darwin

He established the testing movement; introduced the anthropometric records of students; pioneered the application of-rating-scale and questionnaire method, and the free association technique; he also pioneered the use of statistical methods for the analysis of psychological tests. - Moreover, he also noted that persons with mental retardation tend to have diminished ability to discriminate among heat, cold and pain.

Francis Galton

Visual Discrimination Length

Galton Bar

determining the highest audible pitch

Galton whistle

Mathematical models of the mind; father of pedagogy as an academic discipline; went against Wundt

Johan Friedrich Herbart

Sensory thresholds; just noticeable differences (JND)

Ernst Heinrich Weber

Mathematics of sensory thresholds of experience; founder of psychophysics; considered one of the founders of experimental psychology

Gustav Theodor Fechner

First to relate sensation and stimulus

Weber-Fechner Law

Considered one of the founders of Psychology; first to set up a psychology laboratory

Wilhelm Wundt

Succeeded Wundt; brought Structuralism to America; his brain is still on display in the psychology department at Cornell

Edward Titchner

Pioneer of human ability testing; conducted seminars that changed the field of psychological testing

Guy Montrose Whipple

Large contributor of factor analysis; approach to measurement was termed as the law of comparative judgment

Louis Leon Thurstone

Provided the first accurate description of mental retardation as an entity separate from insanity

Jean Esquirol

Pioneered modern educational methods for teaching people who are mentally retarded/intellectually disabled

Edouard Seguin

An American psychologist who coined the term “mental test”

James McKeen Cattell

The father of IQ testing

Alfred Binet

Introduced the concept of IQ as determined by the mental age and chronological age

Lewis M. Terman

Introduced the two-factor theory of intelligence - General ability or “g”: required for performance on mental tests of all kinds - Special abilities or “s”: required for performance on mental test of only one kind

Charles Spearman

Primary Mental Abilities

Thurstone

Wechsler Intelligence Tests (WISC, WAIS)

Wechsler

Introduced the components of “g” - Fluid “g”: ability to see relationships as in analogies and letter and number series, also known as the primary reasoning ability which decreases with age - Crystalized “g”: acquired knowledge and skills which increases with age

Raymond Cattell

Theorized the “many factor intelligence theory” (6 types of operations X 5 types of contents X 6 types of products = 180 elementary abilities)

Guilford

Introduced the 3 “g’s” - Academic g, Practical g, and Creative g

Sternberg

Conceptualized the multiple intelligences theory

Howard Gardner

Translated the Binet-Simon test into French

Henry Goddard

Pioneered the first group intelligence test known as the Army Alpha (for literate) and Army Beta (for functionally illiterate)

Robert Yerkes

Introduced multiple-choice and other “objective” item type of tests

Arthur S. Otis

Devised the Personal Data Sheet (known as the first personality test) which aimed to identify soldiers who are at risk for shell shock

Robert S. Woodworth

Slow rise of projective testing - _________ Inkblot Test

Herman Rorschach

Thematic Apperception Test

Henry Murray & Christina Morgan

Structure tests were being developed based on their better psychometric properties

Early 1940’s

16 Personality Factors

Raymond B. Cattell

Big 5 Personality Factors

McCrae & Costa

Panukat ng Ugali at Pagkatao or PUP

Virgilio Enriquez

Panukat ng Katalinuhang Pilipino or PKP

Aurora R. Palacio

Panukat ng Pagkataong Pilipino or PPP

Anadaisy Carlota

Masaklaw na Panukad ng Loob or Mapa ng Loob

Gregorio E.H. Del Pilar

Philippine Thematic Apperception Test (PTAT)

Alfredo Lagmay

Initial ideas or thoughts of the psychologists - No brainer

Some Assumptions about Psychological Testing and Assessment / Basic Assumptions

Some Assumptions about Psychological Testing and Assessment

Assumption 1: Psychological Traits and States Exist Assumption 2: Psychological Traits and States Can Be Quantified and Measured Assumption 3: Test-Related Behavior Predicts Non-Test-Related Behavior Assumption 4: Tests and Other Measurement Techniques Have Strengths and Weaknesses Assumption 5: Various Sources of Error Are Part of the Assessment Process Assumption 6: Testing and Assessment Can Be Conducted in a Fair and Unbiased Manner Assumption 7: Testing and Assessment Benefit Society

Defined as “any distinguishable, relatively enduring way in which one individual varies from another” - Specific and unique

Trait

Distinguish one person from another but are relatively less enduring - Arises depending on context - Part of the personality

States

An informed, scientific concept developed or constructed to describe or explain behavior.

Construct

Refers to an observable action or the product of an observable action, including test- or assessment-related responses.

Overt Behavior

Reminder that a trait is not expected to be manifested in behavior 100% of the time. - Thus, it is important to be aware of the context or situation in which a particular behavior is displayed.

Relatively enduring

The test score is presumed to represent the strength of the targeted ability or trait or state and is frequently based on _______________

Cumulative Scoring

May refer to either: 1) a sample of behaviors from all possible behaviors that could conceivably be indicative of a particular construct or 2) a sample of test items from all possible items that could conceivably be used to measure a particular construct.

Domain Sampling

Refers to a long-standing assumption that factors other than what a test attempts to measure will influence performance on the test. - Test scores are always subject to questions about the degree to which the measurement process includes _______.

Error

The component of a test score attributable to sources other than the trait or ability measured.

Error Variance

An assumption is made that each testtaker has a true score on a test that would be obtained but for the random action of measurement error.

Classical or True Score Theory

- Assess what a person usually does - There are no right or wrong - Values, Personality, Interest

Test of Typical Performance

Specific Types of Psychological Tests

1) Intelligence Test 2) Aptitude Test 3) Achievement Test 4) Personality Test 5) Projective Test 6) Interest Test 7) Attitude Inventory 8) Values Inventory 9) Diagnostic Test (For remedial test) 10) Powered / Power Test — easy to difficult (measures the ability) 11) Speed Test — uniformed 12) Creativity Test 13) Neuropsychological Test

- Decision theory as applied to psychological testing and measurement - Making inferences and decisions

Base rate Hit rate Miss rate

Is the extent to which a particular trait, behavior, characteristic, or attribute exists in the population (expressed as a proportion). - 10/100 people have depression

Base rate

May be defined as the proportion of people a test accurately identifies as possessing or exhibiting a particular trait, behavior, characteristic, or attribute. - Could refer to the proportion of people accurately predicted to be able to perform work at the graduate school level or to the proportion of neurological patients accurately identified as having a brain tumor. - Who are the 10 out of 100?

Hit rate

May be defined as the proportion of people the test fails to identify as having, or not having, a particular characteristic or attribute. - Amounts to an inaccurate prediction.

Miss rate

The category of misses may be further subdivided:

1) False Positive 2) False Negative

- Is a miss wherein the test predicted that the testtaker did possess the particular characteristic or attribute being measured when in fact the testtaker did not. - Accepting what should not be accepted

Type 1 / False Positive

- Is a miss wherein the test predicted that the testtaker did not possess the particular characteristic or attribute being measured when the testtaker actually did. - Rejecting what should not be rejected

Type 2 / False Negative

Basic Principles in the Use of Psychological Test

1) Tests are samples of behavior 2) Tests do not reveal traits and capacities directly 3) Psychological maladjustments selectively and differentially affects scores 4) Psychometric and projective approach are mutually complementary

Steps in Clinical Psychology Assessment

1) Deciding what is being assess 2) Determining the goals of assessment 3) Selective standards for making decisions 4) Collecting assessment data 5) Making assessment and judgment 6) Communicating results

Approaches Use in Psychological Assessment and Testing

Nomothetic Idiographic

- General / Population - Norms - Attempts to generalize - Objective - Numerical data

Nomothetic

- Focus on one / individual - Subjective experiences - Comparative only to itself

Idiographic

Cross Cultural Testing Parameters:

1) Language 2) Test Content 3) Education 4) Speed

A test or assessment process designed to minimize the influence of culture with regard to various aspects of the evaluation procedures, such as administration instructions, item content, responses required of testtakers, and interpretations made from the resulting data.

Culture-Fair Intelligence Test

May be defined as the extent to which a test incorporates the vocabulary, concepts, traditions, knowledge, and feelings associated with a particular culture.

Culture loading

- To isolate nature - Interaction between nature and nurture are not relative but cumulative

Culture-Free Test

As act of assigning numbers or symbols to characteristics of things (people, events, whatever) according to rules.

Measurement

Is a set of numbers (or other symbols) whose properties model empirical properties of the objects to which the numbers are assigned.

Scale

Primary Scales of Measurement

Nominal Ordinal Interval Ratio

- Simplest form of measurement - Weakest - These scales involve classification or categorization based on one or more distinguishing characteristics, where all things measured must be placed into mutually exclusive and exhaustive categories. For example, people may be characterized by gender in a study designed to compare performance of men and women on some test. - No magnitude, equal intervals, and absolute 0 - Nonparametric but can be quantified

Nominal

- Permit classification - Rank ordering on some characteristic is also permissible - Has a magnitude but no equal intervals and absolute 0 - Nonparametric - Median

Ordinal

- Contain equal intervals between numbers - Each unit on the scale is exactly equal to any other unit on the scale - Contains no absolute zero point - Parametric

Interval

- Has a true zero point - Strongest - All mathematical operations can meaningfully be performed because there exist equal intervals between the numbers on the scale as well as a true or absolute zero point - Contains nominal, ordinal, interval, and ratio - Parametric

Ratio

- To describe the data - Merely describes the results

Descriptive Statistics

May be defined as a set of test scores arrayed for recording or study.

Distribution

Is a straightforward, unmodified accounting of performance that is usually numerical. - may reflect a simple tally, as in number of items responded to correctly on an achievement test.

Raw Score

- All scores are listed alongside the number of times each score occurred. - Distribution of raw scores

Frequency of Distributions / Frequency Distributions

- Is a statistic that indicates the average or midmost score between the extreme scores in a distribution. - Mean, median, and mode

Measures of Central Tendency

- Statistics that describe the amount of variation in a distribution - Range: Interquartile and semi-quartile - Standard deviation

Measures of Variability

Indication of how scores in a distribution are scattered or dispersed.

Variability

- An indication of how the measurements in a distribution are distributed. - Distributions can be characterized by their _________, or the nature and extent to which symmetry is absent.

Skewness

When relatively few of the scores fall at the high end of the distribution. - Low scores >

Positive skew / Positively skewed

When relatively few of the scores fall at the low end of the distribution. - High scores >

Negative skew / Negatively skewed

The term testing professionals use to refer to the steepness of a distribution in its center

Kurtosis

Relatively flat distribution

Platykurtic

Relatively peak distribution

Leptokurtic

Somewhere in the middle / normally distributed

Mesokurtic

A bell-shaped, smooth, mathematically defined curve that is highest at its center.

Normal Curve

- Normal Distribution - Homogenous Variance - Interval or Ratio Data - Pearson’s Correlation, Independent Measures T-Test, One-way / Independent-Measures ANOVA, Paired T-Test, One-way / Repeated Measures ANOVA

Parametric Test

- Normal Distribution is not required - Homogenous Variance is not required - Nominal or Ordinal Data - Spearman’s Correlation, Mann-Whitney U Test, Kruskal-Wallis H Test, Wilcoxon Signed-Rank Test, Friedman’s Test

Non-Parametric Test

Measures of Correlation

1) Pearson's Product Moment Correlation 2) Spearman Rho's Correlation 3) Kendall's Coefficient of Concordance 4) Phi Coefficient 5) Lambda

Parametric test for interval data

Pearson's Product Moment Correlation

Non-parametric test for ordinal data

Spearman Rho's Correlation

Non-parametric test for ordinal data

Kendall's Coefficient of Concordance

Non-parametric test for dichotomous nominal data

Phi Coefficient

Non-parametric test for 2 groups (dependent and independent variable) of nominal data

Lambda

Measures of Prediction

1) Biserial Correlation 2) Point-Biserial Correlation 3) Tetrachoric Correlation 4) Simple Linear Regression 5) Multiple Linear Regression 6) Ordinal Regression

Predictive test for artificially dichotomized and categorical data as criterion with continuous data as predictors

Biserial Correlation

Predictive test for genuinely dichotomized and categorical data as criterion with continuous data as predictors

Point-Biserial Correlation

Predictive test for dichotomous data with categorical data as criterion and categorical data as predictors

Tetrachoric Correlation

A predictive test which involves one criterion that is continuous in nature with only one predictor that is continuous

Simple Linear Regression

A predictive test which involves one criterion that is continuous in nature with more than one continuous predictor

Multiple Linear Regression

A predictive test which involves a criterion that is ordinal in nature with more than one predictors that are continuous

Ordinal Regression

Chi-Square Test

1) Goodness of Fit 2) Test of Independence

Used to measure differences and involves nominal data and only one variable with 2 or more categories

Goodness of Fit

Used to measure correlation and involves nominal data and two variables with two or more categories

Test of Independence

Comparison of two groups

1) Paired T-Test 2) Unpaired T-Test 3) Wilcoxon Signed-Rank Test 4) Mann-Whitney U Test

A parametric test for paired groups with normal distribution

Paired T-Test

A parametric test for unpaired groups with normal distribution

Unpaired T-Test

A non-parametric test for paired groups with non-normal distribution

Wilcoxon Signed-Rank Test

A non-parametric test for unpaired groups with non-normal distribution

Mann-Whitney U Test

Comparison of three or more groups

1) Repeated Measures ANOVA 2) One-way/Two-Way ANOVA 3) Friedman F Test 4) Kruskal-Wallis H Test

A parametric test for matched groups with normal distribution

Repeated Measures ANOVA

A parametric test for unmatched groups with normal distribution

One-way/Two-Way ANOVA

A non-parametric test for matched groups with non-normal distribution

Friedman F Test

A non parametric test for unmatched groups with non-normal distribution

Kruskal-Wallis H Test

- The stability or consistency of the measurement Goals: A) Estimate errors in psychological measurement B) Devise techniques to improve testing so errors are reduced

Reliability

Types of Reliability

1) Test-Retest Reliability 2) Parallel-Forms / Alternate Forms Reliability 3) Split-Half Reliability 4) Inter-Rater / Inter-Observer Reliability 5) Standard Error of Measurement

- Compare the scores of individuals who have been measured twice by the instrument - This is not applicable for tests involving reasoning and ingenuity - Longer interval will result to lower correlation coefficient while shorter interval will result to higher correlation - The ideal time is 2-4 weeks - Source of error variance is time sampling - Utilizes Pearson R or Spearman Rho

Test-Retest Reliability

- Same persons are tested with one form on the first occasion and with another equivalent form on the second - The administration of the second, equivalent form either takes place immediately or fairly soon. - The two forms should be truly paralleled, independently constructed tests designed to meet the same specifications, contain the same number of items, have items which are expressed in the same form, have items that cover the same type of content, have items with the some range of difficulty, and have the same instructions, time limits illustrative examples, format and all other aspects of the test - Has the most universal applicability - For immediate, the source of error variance is content sampling - For delayed, the source of error variance is time sampling and content sampling - Utilizes Pearson R or Spearman Rho

Parallel-Forms / Alternate Forms Reliability

- Two scores are obtained for each person by dividing the test into equivalent halves (odd-even split or top-bottom split) - The reliability of the test is directly related to the length of the test - The source of error variance is content sampling - Utilizes the Spearman-Brown Formula

Split-Half Reliability

- Degree of agreement between raters on a measure - Source of error variance is inter-scorer differences - Often utilizes Cobon's Kappa statistic

Inter-Rater / Inter-Observer Reliability

- An index of the amount of inconsistency of the amount of expected error in an individual's score - The higher the reliability, the lower SEM

Standard Error of Measurement

Long standing assumption that factors other than what a test attempts to measure will influence performance on the test

Error

The component of test score attributable to sources other than the trait or ability being measured

Error Variance

Are those sources of errors that reside within an individual taking the test (such as: I didn't study enough, I felt bad that I missed blind date, I forgot to set the alarm, excuses)

Trait Error

Are those sources of errors that reside in the testing situation, such as lousy test instructions, too warm room, or missing pages

Method Error

A range or band of test scores that is likely to contain the true score

Confidence Interval

A statistical measure that can aid a test user in determining how large a difference should be before it is considered statistically significant

Standard Error of the Difference

A judgment or estimate of how well a test measures what it purports to measure in a particular test

Validity

Types of Validity

1) Face Validity 2) Content Validity 3) Criterion-Related Validity 4) Construct Validity

The least stringent type of validity: whether a test looks valid to test users, examiners and examinees

Face Validity

- Definitions and concepts - Whether the test covers the behavior domain to be measured which is built through the choice of appropriate content areas, question, tasks and items

Content Validity

Issues arising from lack of content validity:

1) Construct Underrepresentation-Failure 2) Construct-Irrelevant Variance

To capture important components of a construct (e.g. An English test which only contains vocabulary items but no grammar items will have a poor content validity.)

Construct Underrepresentation-Failure

Happens when scores are influenced by factors irrelevant to the construct le.g. test anxiety, reading speed, reading comprehension, illness)

Construct-Irrelevant Variance

Types of Criterion-Related Validity

1) Concurrent Validity 2) Predictive

Standard against which a test or a test score is evaluated

Criterion

The extent to which test scores may be used to estimate an individual's present standing on a criterion

Concurrent Validity

The scores on a test can predict future behavior or scores on another test taken in the future

Predictive

- Assembling evidence about what a test means - Series of statistical analysis that one variable is a separate variable - Is like proving a theory through evidences and statistical analysis

Construct Validity

Discriminant Validation

1) Convergent Validity 2) Divergent Validity

A test correlates highly with other variables with which it should correlate (example: Extraversion which is highly correlated sociability)

Convergent Validity

A test does not correlate significantly with variables from which it should differ (example: Optimism which is negatively correlated with Pessimism)

Divergent Validity

A retained statistical technique for analyzing the interrelationships of behavior data

Factor Analysis

A method of data reduction

Principal Components Analysis

Items do not make a factor, the factor should predict scores on the item and is classified into two (Exploratory Factor Analysis for summarizing data and Confirmatory Factor Analysis for generalization of factors)

Common Factor Analysis

May be defined as a method of evaluation and a way of deriving meaning from test scores by evaluating an individual’s score with reference to a set standard - To be eligible for a high-school diploma, students must demonstrate at least a sixth-grade reading level. - Has also been referred to as Domain or Content-Referenced Testing

Criterion-Referenced Testing

One way to derive meaning from a test score is to evaluate the test score in relation to other scores on the same test - Percentile - NMAT

Norm-Referenced Testing

Is an expression of the degree and direction of correspondence between two things.

Correlation

MIDTERM Flashcards

(187 cards)