MIDTERM Flashcards
What is Psychological Assessment and Testing all about?
- To measure behavior (overt and covert)
- To describe and predict behavior and personality (traits, states,personality types, attitudes, interests, values, etc.)
- To determine signs and symptoms of dysfunctionality (for case formulation, diagnosis, and basis for intervention/plan for action)
Gathering and integration of psychology-related data for the purpose of making psychological evaluation that is accomplished through the use of tools (test,interviews, case studies, behavioral observations)and specially designed and measurement procedures.
Psychological Assessment
Process of measuring psychology-related variables by means of devices on procedures designed to obtain sample of behavior.
Psychological Testing
A standardized measuring device or procedure used to describe the ability, knowledge, skills or attitude of the individual.
Psychological Test(s)
The process of quantifying the amount or number of a particular occurrence of event, situation, phenomenon, object or person.
Measurement
The process of synthesizing the results of measurement with reference to some norms and standards.
Assessment
Tools of Psychological Assessment
1) Psychological Tests
2) Interviews
3) Portfolio Assessment
4) Case-History Data
5) Behavioral Observation
6) Role Play Tests
7) Computers as Tools
The process of judging the worth of any occurrence of event, situation, phenomenon, object or person which concludes with a particular decision.
Evaluation
A tool of assessment in which information is gathered through direct, reciprocal communication.
- Ideally conducted face to face
- Telephone: pitch, pause are signs of emotion
Interviews
Three types of interviews
Structured
Semi-structured
Unstructured
Method of gathering information through direct communication involves:
1) Reciprocal exchange
2) Take note of verbal and non-verbal actions—facial expressions, eye contact and general reaction to the demand of the interview
A type of work sample is used as an assessment tool—sample of one’s ability and accomplishment.
- Education (writing samples) tools for hiring instructors.
Portfolio Assessment
Records, transcripts, and other accounts in any media that preserve archival information, official and informal accounts, and other data and items relevant to the assessee
- Records, transcripts and other accounts in written, pictorial or other form that present archival information, official and informal accounts and other data and items relevant to assessee
- Files/excerpts from files maintained in situation and agencies
- Letters, written correspondence, photos, family albums, newspaper,magazine clippings, home news and pictures, movies and audio tapes
- Shed light on individuals
- Past and current adjustment as well as on the events/circumstances that may have contributed to any changes in assessment
Case-History Data
Monitoring the actions of others or oneself by visual or electronic means while recording quantitative/qualitative information regarding those sections—can be used as diagnostic aid (inpatient facilities, behavioral research lab, classroom)
Behavioral Observation
Tool for assessment wherein assesses are directed to act as if they were in a particular situation—used when real settings are too impractical.
- Substance abusers can used as both a tool for assessment and measure of outcome
Role Play Tests
- As test administrators, computers do much more than replace the “equipment” that was so widely used in the past (a number 2 pencil).
- Computers can serve as test administrators (online or off) and as highly efficient test scorers. Within seconds they can derive not only test scores but patterns of test scores.
Computers as Tools
Types of Tests Based on the Number of Examinees
1) Individual Test
2) Group Test
The examiner/test administrator gives the test to only one person
Individual Test
The examiner/test administrator gives to more than one person
Group Test
Tests Based on the Type of Behavior They Measure
1) Ability Test
a) Achievement Test
b) Aptitude Test
c) Intelligence Test
2) Personality Test
3) Interest Test
- Cognitive Performance—Based Measures
- Measures what people can do
- Pertains to capacity or potential; items are scored according to speed, accuracy or both
- Variable measurement
- Presence of right and wrong
- IQ, Aptitude, Achievement
Ability Test
Measures previous learning
Achievement Test
Measures potential for learning or acquiring a specific skill
Aptitude Test
General potential to solve problems, adapt to changing circumstances, think abstractly, and profit from experience.
Intelligence Test
It has to do with an individual’s covert and overt dispositions, such as a person’s tendency to act in a certain way or respond in a certain way in a given situation.
Personality Test
Provides a self-report statement to which the person responds “True” or “False”, “Yes” or “No”.
Structured Personality Test
Provides an ambiguous test stimulus
Projective Personality Test
Originally developed for vocational guidance but later found its way to employee selection and career development
Interest Test
Three-Tier System of Psychological Tests
1) Level A
2) Level B
3) Level C
These tests are those that can be administered, scored and interpreted by responsible non-psychologist who have carefully read the manual and are familiar with the overall purpose of testing.
- Educational achievement tests fall into this category
- Examples: Achievement tests and other specialized (skill-based) aptitude tests
Level A
These tests require technical knowledge of test construction and use of appropriate advanced coursework in psychology and related courses
- Examples: group intelligence tests and personality tests
Level B
These tests require an advanced degree in Psychology or License as Psychologist and advanced training/supervised experience in a particular test
- Examples: Projective tests, Individual Intelligence tests, Diagnostic tests
Level C
Testing was instituted as a means of selecting who, of the many applicants, would obtain government jobs
Chinese Civilization
Tests were used to measure intelligence and physical skills
Greek Civilization
These universities relied on formal exams in conferring degrees and honors
European Universities
Believed that despite our similarities, no two humans are exactly alike. Some of these individual differences are more “adaptive than others and these differences lead to more complex, intelligent organisms over time.
Charles Darwin
He established the testing movement; introduced the anthropometric records of students; pioneered the application of-rating-scale and questionnaire method, and the free association technique; he also pioneered the use of statistical methods for the analysis of psychological tests.
- Moreover, he also noted that persons with mental retardation tend to have diminished ability to discriminate among heat, cold and pain.
Francis Galton
Visual Discrimination Length
Galton Bar
determining the highest audible pitch
Galton whistle
Mathematical models of the mind; father of pedagogy as an academic discipline; went against Wundt
Johan Friedrich Herbart
Sensory thresholds; just noticeable differences (JND)
Ernst Heinrich Weber
Mathematics of sensory thresholds of experience; founder of psychophysics; considered one of the founders of experimental psychology
Gustav Theodor Fechner
First to relate sensation and stimulus
Weber-Fechner Law
Considered one of the founders of Psychology; first to set up a psychology laboratory
Wilhelm Wundt
Succeeded Wundt; brought Structuralism to America; his brain is still on display in the psychology department at Cornell
Edward Titchner
Pioneer of human ability testing; conducted seminars that changed the field of psychological testing
Guy Montrose Whipple
Large contributor of factor analysis; approach to measurement was termed as the law of comparative judgment
Louis Leon Thurstone
Provided the first accurate description of mental retardation as an entity separate from insanity
Jean Esquirol
Pioneered modern educational methods for teaching people who are mentally retarded/intellectually disabled
Edouard Seguin
An American psychologist who coined the term “mental test”
James McKeen Cattell
The father of IQ testing
Alfred Binet
Introduced the concept of IQ as determined by the mental age and chronological age
Lewis M. Terman
Introduced the two-factor theory of intelligence
- General ability or “g”: required for performance on mental tests of all kinds
- Special abilities or “s”: required for performance on mental test of only one kind
Charles Spearman
Primary Mental Abilities
Thurstone
Wechsler Intelligence Tests (WISC, WAIS)
Wechsler
Introduced the components of “g”
- Fluid “g”: ability to see relationships as in analogies and letter and number series, also known as the primary reasoning ability which decreases with age
- Crystalized “g”: acquired knowledge and skills which increases with age
Raymond Cattell
Theorized the “many factor intelligence theory” (6 types of operations X 5 types of contents X 6 types of products = 180 elementary abilities)
Guilford
Introduced the 3 “g’s”
- Academic g, Practical g, and Creative g
Sternberg
Conceptualized the multiple intelligences theory
Howard Gardner
Translated the Binet-Simon test into French
Henry Goddard
Pioneered the first group intelligence test known as the Army Alpha (for literate) and Army Beta (for functionally illiterate)
Robert Yerkes
Introduced multiple-choice and other “objective” item type of tests
Arthur S. Otis
Devised the Personal Data Sheet (known as the first personality test) which aimed to identify soldiers who are at risk for shell shock
Robert S. Woodworth
Slow rise of projective testing
- _________ Inkblot Test
Herman Rorschach
Thematic Apperception Test
Henry Murray & Christina Morgan
Structure tests were being developed based on their better psychometric properties
Early 1940’s
16 Personality Factors
Raymond B. Cattell
Big 5 Personality Factors
McCrae & Costa
Panukat ng Ugali at Pagkatao or PUP
Virgilio Enriquez
Panukat ng Katalinuhang Pilipino or PKP
Aurora R. Palacio
Panukat ng Pagkataong Pilipino or PPP
Anadaisy Carlota
Masaklaw na Panukad ng Loob or Mapa ng Loob
Gregorio E.H. Del Pilar
Philippine Thematic Apperception Test (PTAT)
Alfredo Lagmay
Initial ideas or thoughts of the psychologists
- No brainer
Some Assumptions about Psychological Testing and Assessment / Basic Assumptions
Some Assumptions about Psychological Testing and Assessment
Assumption 1: Psychological Traits and States Exist
Assumption 2: Psychological Traits and States Can Be Quantified and Measured
Assumption 3: Test-Related Behavior Predicts Non-Test-Related Behavior
Assumption 4: Tests and Other Measurement Techniques Have Strengths and Weaknesses
Assumption 5: Various Sources of Error Are Part of the Assessment Process
Assumption 6: Testing and Assessment Can Be Conducted in a Fair and Unbiased Manner
Assumption 7: Testing and Assessment Benefit Society
Defined as “any distinguishable, relatively enduring way in which one individual varies from another”
- Specific and unique
Trait
Distinguish one person from another but are relatively less enduring
- Arises depending on context
- Part of the personality
States
An informed, scientific concept developed or constructed to describe or explain behavior.
Construct
Refers to an observable action or the product of an observable action, including test- or assessment-related responses.
Overt Behavior
Reminder that a trait is not expected to be manifested in behavior 100% of the time.
- Thus, it is important to be aware of the context or situation in which a particular behavior is displayed.
Relatively enduring
The test score is presumed to represent the strength of the targeted ability or trait or state and is frequently based on _______________
Cumulative Scoring
May refer to either:
1) a sample of behaviors from all possible behaviors that could conceivably be indicative of a particular construct or
2) a sample of test items from all possible items that could conceivably be used to measure a particular construct.
Domain Sampling
Refers to a long-standing assumption that factors other than what a test attempts to measure will influence performance on the test.
- Test scores are always subject to questions about the degree to which the measurement process includes _______.
Error
The component of a test score attributable to sources other than the trait or ability measured.
Error Variance
An assumption is made that each testtaker has a true score on a test that would be obtained but for the random action of measurement error.
Classical or True Score Theory
- Assess what a person usually does
- There are no right or wrong
- Values, Personality, Interest
Test of Typical Performance
Specific Types of Psychological Tests
1) Intelligence Test
2) Aptitude Test
3) Achievement Test
4) Personality Test
5) Projective Test
6) Interest Test
7) Attitude Inventory
8) Values Inventory
9) Diagnostic Test (For remedial test)
10) Powered / Power Test — easy to difficult (measures the ability)
11) Speed Test — uniformed
12) Creativity Test
13) Neuropsychological Test
- Decision theory as applied to psychological testing and measurement
- Making inferences and decisions
Base rate
Hit rate
Miss rate
Is the extent to which a particular trait, behavior, characteristic, or attribute exists in the population (expressed as a proportion).
- 10/100 people have depression
Base rate
May be defined as the proportion of people a test accurately identifies as possessing or exhibiting a particular trait, behavior, characteristic, or attribute.
- Could refer to the proportion of people accurately predicted to be able to perform work at the graduate school level or to the proportion of neurological patients accurately identified as having a brain tumor.
- Who are the 10 out of 100?
Hit rate
May be defined as the proportion of people the test fails to identify as having, or not having, a particular characteristic or attribute.
- Amounts to an inaccurate prediction.
Miss rate
The category of misses may be further subdivided:
1) False Positive
2) False Negative
- Is a miss wherein the test predicted that the testtaker did possess the particular characteristic or attribute being measured when in fact the testtaker did not.
- Accepting what should not be accepted
Type 1 / False Positive
- Is a miss wherein the test predicted that the testtaker did not possess the particular characteristic or attribute being measured when the testtaker actually did.
- Rejecting what should not be rejected
Type 2 / False Negative
Basic Principles in the Use of Psychological Test
1) Tests are samples of behavior
2) Tests do not reveal traits and capacities directly
3) Psychological maladjustments selectively and differentially affects scores
4) Psychometric and projective approach are mutually complementary
Steps in Clinical Psychology Assessment
1) Deciding what is being assess
2) Determining the goals of assessment
3) Selective standards for making decisions
4) Collecting assessment data
5) Making assessment and judgment
6) Communicating results
Approaches Use in Psychological Assessment and Testing
Nomothetic
Idiographic
- General / Population
- Norms
- Attempts to generalize
- Objective
- Numerical data
Nomothetic
- Focus on one / individual
- Subjective experiences
- Comparative only to itself
Idiographic
Cross Cultural Testing Parameters:
1) Language
2) Test Content
3) Education
4) Speed
A test or assessment process designed to minimize the influence of culture with regard to various aspects of the evaluation procedures, such as administration instructions, item content, responses required of testtakers, and interpretations made from the resulting data.
Culture-Fair Intelligence Test
May be defined as the extent to which a test incorporates the vocabulary, concepts, traditions, knowledge, and feelings associated with a particular culture.
Culture loading
- To isolate nature
- Interaction between nature and nurture are not relative but cumulative
Culture-Free Test
As act of assigning numbers or symbols to characteristics of things (people, events, whatever) according to rules.
Measurement
Is a set of numbers (or other symbols) whose properties model empirical properties of the objects to which the numbers are assigned.
Scale
Primary Scales of Measurement
Nominal
Ordinal
Interval
Ratio
- Simplest form of measurement
- Weakest
- These scales involve classification or categorization based on one or more distinguishing characteristics, where all things measured must be placed into mutually exclusive and exhaustive categories. For example, people may be characterized by gender in a study designed to compare performance of men and women on some test.
- No magnitude, equal intervals, and absolute 0
- Nonparametric but can be quantified
Nominal
- Permit classification
- Rank ordering on some characteristic is also permissible
- Has a magnitude but no equal intervals and absolute 0
- Nonparametric
- Median
Ordinal
- Contain equal intervals between numbers
- Each unit on the scale is exactly equal to any other unit on the scale
- Contains no absolute zero point
- Parametric
Interval
- Has a true zero point
- Strongest
- All mathematical operations can meaningfully be performed because there exist equal intervals between the numbers on the scale as well as a true or absolute zero point
- Contains nominal, ordinal, interval, and ratio
- Parametric
Ratio
- To describe the data
- Merely describes the results
Descriptive Statistics
May be defined as a set of test scores arrayed for recording or study.
Distribution
Is a straightforward, unmodified accounting of performance that is usually numerical.
- may reflect a simple tally, as in number of items responded to correctly on an achievement test.
Raw Score
- All scores are listed alongside the number of times each score occurred.
- Distribution of raw scores
Frequency of Distributions / Frequency Distributions
- Is a statistic that indicates the average or midmost score between the extreme scores in a distribution.
- Mean, median, and mode
Measures of Central Tendency
- Statistics that describe the amount of variation in a distribution
- Range: Interquartile and semi-quartile
- Standard deviation
Measures of Variability
Indication of how scores in a distribution are scattered or dispersed.
Variability
- An indication of how the measurements in a distribution are distributed.
- Distributions can be characterized by their _________, or the nature and extent to which symmetry is absent.
Skewness
When relatively few of the scores fall at the high end of the distribution.
- Low scores >
Positive skew / Positively skewed
When relatively few of the scores fall at the low end of the distribution.
- High scores >
Negative skew / Negatively skewed
The term testing professionals use to refer to the steepness of a distribution in its center
Kurtosis
Relatively flat distribution
Platykurtic
Relatively peak distribution
Leptokurtic
Somewhere in the middle / normally distributed
Mesokurtic
A bell-shaped, smooth, mathematically defined curve that is highest at its center.
Normal Curve
- Normal Distribution
- Homogenous Variance
- Interval or Ratio Data
- Pearson’s Correlation, Independent Measures T-Test, One-way / Independent-Measures ANOVA, Paired T-Test, One-way / Repeated Measures ANOVA
Parametric Test
- Normal Distribution is not required
- Homogenous Variance is not required
- Nominal or Ordinal Data
- Spearman’s Correlation, Mann-Whitney U Test, Kruskal-Wallis H Test, Wilcoxon Signed-Rank Test, Friedman’s Test
Non-Parametric Test
Measures of Correlation
1) Pearson’s Product Moment Correlation
2) Spearman Rho’s Correlation
3) Kendall’s Coefficient of Concordance
4) Phi Coefficient
5) Lambda
Parametric test for interval data
Pearson’s Product Moment Correlation
Non-parametric test for ordinal data
Spearman Rho’s Correlation
Non-parametric test for ordinal data
Kendall’s Coefficient of Concordance
Non-parametric test for dichotomous nominal data
Phi Coefficient
Non-parametric test for 2 groups (dependent and independent variable) of nominal data
Lambda
Measures of Prediction
1) Biserial Correlation
2) Point-Biserial Correlation
3) Tetrachoric Correlation
4) Simple Linear Regression
5) Multiple Linear Regression
6) Ordinal Regression
Predictive test for artificially dichotomized and categorical data as criterion with continuous data as predictors
Biserial Correlation
Predictive test for genuinely dichotomized and categorical data as criterion with continuous data as predictors
Point-Biserial Correlation
Predictive test for dichotomous data with categorical data as criterion and categorical data as predictors
Tetrachoric Correlation
A predictive test which involves one criterion that is continuous in nature with only one predictor that is continuous
Simple Linear Regression
A predictive test which involves one criterion that is continuous in nature with more than one continuous predictor
Multiple Linear Regression
A predictive test which involves a criterion that is ordinal in nature with more than one predictors that are continuous
Ordinal Regression
Chi-Square Test
1) Goodness of Fit
2) Test of Independence
Used to measure differences and involves nominal data and only one variable with 2 or more
categories
Goodness of Fit
Used to measure correlation and involves nominal data and two variables with two or more categories
Test of Independence
Comparison of two groups
1) Paired T-Test
2) Unpaired T-Test
3) Wilcoxon Signed-Rank Test
4) Mann-Whitney U Test
A parametric test for paired groups with normal distribution
Paired T-Test
A parametric test for unpaired groups with normal distribution
Unpaired T-Test
A non-parametric test for paired groups with non-normal distribution
Wilcoxon Signed-Rank Test
A non-parametric test for unpaired groups with non-normal distribution
Mann-Whitney U Test
Comparison of three or more groups
1) Repeated Measures ANOVA
2) One-way/Two-Way ANOVA
3) Friedman F Test
4) Kruskal-Wallis H Test
A parametric test for matched groups with normal distribution
Repeated Measures ANOVA
A parametric test for unmatched groups with normal distribution
One-way/Two-Way ANOVA
A non-parametric test for matched groups with non-normal distribution
Friedman F Test
A non parametric test for unmatched groups with non-normal distribution
Kruskal-Wallis H Test
- The stability or consistency of the measurement
Goals:
A) Estimate errors in psychological measurement
B) Devise techniques to improve testing so errors are reduced
Reliability
Types of Reliability
1) Test-Retest Reliability
2) Parallel-Forms / Alternate Forms Reliability
3) Split-Half Reliability
4) Inter-Rater / Inter-Observer Reliability
5) Standard Error of Measurement
- Compare the scores of individuals who have been measured twice by the instrument
- This is not applicable for tests involving reasoning and ingenuity
- Longer interval will result to lower correlation coefficient while shorter interval will result to higher correlation
- The ideal time is 2-4 weeks
- Source of error variance is time sampling
- Utilizes Pearson R or Spearman Rho
Test-Retest Reliability
- Same persons are tested with one form on the first occasion and with another equivalent form on the second
- The administration of the second, equivalent form either takes place immediately or fairly soon.
- The two forms should be truly paralleled, independently constructed tests designed to meet the same specifications, contain the same number of items, have items which are expressed in the same form, have items that cover the same type of content, have items with the some range of difficulty, and have the same instructions, time limits illustrative examples, format and all other aspects of the test
- Has the most universal applicability
- For immediate, the source of error variance is content sampling
- For delayed, the source of error variance is time sampling and content sampling
- Utilizes Pearson R or Spearman Rho
Parallel-Forms / Alternate Forms Reliability
- Two scores are obtained for each person by dividing the test into equivalent halves (odd-even split or top-bottom split)
- The reliability of the test is directly related to the length of the test
- The source of error variance is content sampling
- Utilizes the Spearman-Brown Formula
Split-Half Reliability
- Degree of agreement between raters on a measure
- Source of error variance is inter-scorer differences
- Often utilizes Cobon’s Kappa statistic
Inter-Rater / Inter-Observer Reliability
- An index of the amount of inconsistency of the amount of expected error in an individual’s score
- The higher the reliability, the lower SEM
Standard Error of Measurement
Long standing assumption that factors other than what a test attempts to measure will influence performance on the test
Error
The component of test score attributable to sources other than the trait or ability being measured
Error Variance
Are those sources of errors that reside within an individual taking the test (such as: I didn’t study enough, I felt bad that I missed blind date, I forgot to set the alarm, excuses)
Trait Error
Are those sources of errors that reside in the testing situation, such as lousy test instructions, too warm room, or missing pages
Method Error
A range or band of test scores that is likely to contain the true score
Confidence Interval
A statistical measure that can aid a test user in determining how large a difference should be before it is considered statistically significant
Standard Error of the Difference
A judgment or estimate of how well a test measures what it purports to measure in a particular test
Validity
Types of Validity
1) Face Validity
2) Content Validity
3) Criterion-Related Validity
4) Construct Validity
The least stringent type of validity: whether a test looks valid to test users, examiners and examinees
Face Validity
- Definitions and concepts
- Whether the test covers the behavior domain to be measured which is built through the choice of appropriate content areas, question, tasks and items
Content Validity
Issues arising from lack of content validity:
1) Construct Underrepresentation-Failure
2) Construct-Irrelevant Variance
To capture important components of a construct (e.g. An English test which only contains vocabulary items but no grammar items will have a poor content validity.)
Construct Underrepresentation-Failure
Happens when scores are influenced by factors irrelevant to the construct le.g. test anxiety, reading speed, reading comprehension, illness)
Construct-Irrelevant Variance
Types of Criterion-Related Validity
1) Concurrent Validity
2) Predictive
Standard against which a test or a test score is evaluated
Criterion
The extent to which test scores may be used to estimate an individual’s present standing on a criterion
Concurrent Validity
The scores on a test can predict future behavior or scores on another test taken in the future
Predictive
- Assembling evidence about what a test means
- Series of statistical analysis that one variable is a separate variable
- Is like proving a theory through evidences and statistical analysis
Construct Validity
Discriminant Validation
1) Convergent Validity
2) Divergent Validity
A test correlates highly with other variables with which it should correlate (example: Extraversion which is highly correlated sociability)
Convergent Validity
A test does not correlate significantly with variables from which it should differ (example: Optimism which is negatively correlated with Pessimism)
Divergent Validity
A retained statistical technique for analyzing the interrelationships of behavior data
Factor Analysis
A method of data reduction
Principal Components Analysis
Items do not make a factor, the factor should predict scores on the item and is classified into two (Exploratory Factor Analysis for summarizing data and Confirmatory Factor Analysis for generalization of factors)
Common Factor Analysis
May be defined as a method of evaluation and a way of deriving meaning from test scores by evaluating an individual’s score with reference to a set standard
- To be eligible for a high-school diploma, students must demonstrate at least a sixth-grade reading level.
- Has also been referred to as Domain or Content-Referenced Testing
Criterion-Referenced Testing
One way to derive meaning from a test score is to evaluate the test score in relation to other scores on the same test
- Percentile
- NMAT
Norm-Referenced Testing
Is an expression of the degree and direction of correspondence between two things.
Correlation