PA DX INTENSIVE (On-going) (4/29/24) Flashcards

Question 1

Q

What is the best statistical tool to use in establishing the split-half reliability of a test with a limited number of items?

a. Spearman brown formula
b. Cronbach’s Coefficient Alpha
c. Kuder-Richardson 20
d. Pearson r

Answer

A

a. Spearman brown formula

Reliability - Consistency, accuracy, dependability of test results

Test-retest reliability

Administering a test at 2 different times
time sampling (consistency of test over time)
Pearson r
Parallel Forms reliability

Compares 2 equivalent forms of a test that measure the same attributes
Item sampling (Diff items, but same difficulty/# of items/content)
Alternate forms/equivalent forms
Internal consistency (measures only one construct)

Split–half reliability
Divide into halves then score separately (odd-even or random)
Spearman-Brown formula (the more items, the higher reliability); good if you have limited # of items
Kuder-Richardson 20
Used for dichotomous items; only one correct answer
Test has varying degrees of item difficulty
Kuder-Richardson 21
Used for dichotomous items; only one correct answer
Test has same level of item difficulty (usually for speed test)
Cronbach’s Coefficient Alpha
Used for polychotomous (several possible answers); no one correct answer
Used for likert scales

Dichotomous items: Only 1 correct answer; Polychotomous: No one correct answer; does NOT refer to number of choices

Interrater Reliability

Consistency of judges/raters evaluating the same behavior
Observer differences
Kappa statistics

Question 2

Q

A study reveals that the higher a person’s food intake is in the evening, the lesser his sleep quality gets. This kind of results show

a. Negative relationship
b. Positive relationship
c. Significant difference
d. No significant relationship

Answer

A

a. Negative relationship

Question 3

Q

The Children Personality Questionnaire-R is an example of what kind of test?

a. Unstructured test
b. Projective test
c. Structured test
d. Intelligence test

Answer

A

c. Structured test

Ability Test
- Achievement test - previous learning (past)
- Aptitude test - potential for learning or acquiring new skill (future)
- Intelligence test - general potential (present)

Personality test - overt and covert dispositions
- Structured - usually self-report; evaluate yourself
- Projective - either stimulus or response is ambiguous; unstructured

Question 4

Q

The test taker neglected to have breakfast before the test. As he proceeds with the examination, his stomach starts to growl. What type of challenge to internal validity does this situation pose?

a. Testing
b. Instrumentation
c. History
d. Selection

Answer

A

C. History

Threats to internal validity
> History - occurrence of events (before they took test)
> Maturation - internal/physical changes (longitudinal studies)
> Testing - effects of pretest to the post test (practice effect)
> Instrumentation - inconsistent use of measurement instrument (mistake in test material or administration)
> Statistical regression - tendency of extreme scores to regress toward mean score
> Selection - no random assignment
- Quasi experiment: no random assignment or no control group
> Subject mortality - loss of subjects
> Selection interaction - family of threats (multiple threats)

Question 5

Q

Which of the following elements must be present before an experiment can be called as a true experiment?

a. Random assignment
b. Control group
c. Any of the above
d. All of the above

Answer

A

d. All of the above

True experiment is random assignment + control group, If either one is missing, it’s quasi experiment

Question 6

Q

The Draw-A-Person test is developed by:

a. Lewis Terman
b. John Buck
c. R.B. Cattell
d. Florence Goodenough

Answer

A

d. Florence Goodenough

> Terman - Revised Binet-Simon scale to Standord-Binet
Buck - Developed house tree person test
R.B. Cattell - Developed CFIT, 16 PF, conceptualized fluid and crystallized intelligence, nagpasikat ng factor analysis
James Cattell - Coined term “mental test” and launched beginning of mental testing
Goodenough - Developed draw-a-person test

Question 7

Q

When the distribution of scores includes outliers, it is better not to use

a. Mean
b. Median
c. Mode
d. All of the above

Answer

A

a. Mean

Never get the mean if there are outliers
Get Median if there are outliers; least affected by extreme scores

Question 8

Q

This is the test used to determine the current developmental level of infants.

a. Apgar test
b. Kaufman Assessment Battery
c. Woodcock-Johnson III
d. Bayley Scale

Answer

A

d. Bayley Scale

– Apgar - given to newborn babies to check if they have abnormalities; given twice (1 min after born then 5 mins after born); give third time in 10 mins if results still aren’t good

– Kaufman Assessment Battery - Intelligence test for young children

– Woodcock-Johnson III - Intelligence test; For detecting learning disabilities

– Bayley Scale - for current developmental level (more on motor skills)

Question 9

Q

In order to determine the concurrent validity of a test, the statistical tool to be employed is

a. Spearman-brown formula
b. Kuder-Richardson 20
c. Point-biserial correlation
d. Pearson r

Answer

A

d. Pearson r

Validity - meaning and usefulness of results; if test is appropriate

Criterion Validity
- How well it corresponds to a particular criterion
- Types:
> Criterion test - well-established test, sure na the test is valid
> Criterion data - any data you can use that’s related to your test/as a basis (eg. performance appraisal, diagnosis, records)

 - Predictive validity - Forecasting function
 - Concurrent validity - simultaneously relationship between test and criterion, no significant time has passed

Content validity
- Adequacy of representation of conceptual domain the test is designed to cover
- Experts judge validity of test items, use critical/logical thinking skills

Construct validity
- Degree to which a test measures what it purports to measure
- Used if test measures abstract variables (exists but hard to measure)
- Based on theoretical perspective
- All encompassing; if you establish construct validity, you establish other validities
- Convergent Validity - measures well with other related constructs; theory said constructs are related
- Divergent/discriminant validity - low correlations with unrelated constructs; theory said constructs are unrelated

Face validity
- Test subjectively viewed that it measures what it purports to measure
- Physical appearance of test

Question 10

Q

The split-half reliability is used to determine

a. If the test can disregard bias despite the number of factors being measured
b. If all the items in the test measures the same dimension
c. Whether consistent scores would be obtained regardless of the characteristics of the test taker
d. All of the above

Answer

A

b. If all the items in the test measures the same dimension

Split half - internal consistency

Reliability - Consistency, accuracy, dependability of test results
— Test-retest reliability
- Administering a test at 2 different times
- time sampling (consistency of test over time)
- Pearson r

— Parallel Forms reliability
- Compares 2 equivalent forms of a test that measure the same attributes
- Item sampling (Diff items, but same difficulty/# of items/content)
- Alternate forms/equivalent forms

— Internal consistency (measures only one construct)
– Split–half reliability
- Divide into halves then score separately (odd-even or random)
- Spearman-Brown formula (the more items, the higher reliability); good if you have limited # of items
– Kuder-Richardson 20
- Used for dichotomous items; only one correct answer
- Test has varying degrees of item difficulty
– Kuder-Richardson 21
- Used for dichotomous items; only one correct answer
- Test has same level of item difficulty (usually for speed test)
– Cronbach’s Coefficient Alpha
- Used for polychotomous (several possible answers); no one correct answer
- Used for likert scales

Dichotomous items: Only 1 correct answer; Polychotomous: No one correct answer; does NOT refer to number of choices

— Interrater Reliability
- Consistency of judges/raters evaluating the same behavior
- Observer differences
- Kappa statistics

Question 11

Q

The 16-PF is a personality test that could also determine unusual responses. Specifically, which of the following?

a. Impression Management
b. Infrequency
c. Acquiescence
d. All of the above

Answer

A

d. All of the above

16 PF can detect 3 kinds of unusual responses

 -> Impression management: social desirability
 ------ IM score is 95% and above; person is faking good
 ------ IM score is 5% and below; person is faking bad

 -> Infrequency: person is playing safe
 ------ IN score is 95% and above; person is playing safe

 -> Acquiescence: tendency to agree to most questions
 ------ IAC score is 95% and above; person is agreeing to everything

Question 12

Q

When our code of ethics conflicts with the law, what is the best step to do?

a. Maintain our stance regardless of the law as our ethical code is formulated to protect the safety of our clients and patients
b. Adhere to the law, but at the same time try to minimize the inconvenience it might inflict to the people involved
c. Obey the law as it is the highest order of the land.
d. Resolve the conflict while being committed to the code of ethics

Answer

A

d. Resolve the conflict while being committed to the code of ethics

-> Do D (resolve conflict while adhering to ethics) first as much as possible
-> If you’ve tried everything and hindi talaga pwede, ADHERE TO THE LAW

Question 13

Q

For individuals who are legally incapable of providing consent, we must

a. Nevertheless explain appropriately to the client
b. Obtain informed assent from them
c. Obtain appropriate permission from their legally authorized person
d. All of the above

Answer

A

d. All of the above

—> Informed assent: For minors (17 and below)
—> Informed consent: For adults (18 and above)

—> If incapable of providing consent, get both informed consent and assent

Question 14

Q

Mark and Arvin are scheduled to get their height and weight in the clinic. They said it was unnecessary because they just took their measurements the day before. To their surprise, Mark’s weight went from 55kg to 58kg, and Arvin’s weight moved from 60kg to 61kg, while bother of them maintained their height. This could imply that

a. The weighing scale has systematic error
b. The weighing scale has random error
c. Mark ate more than Arvin did the night before
d. None of the above

Answer

A

b. The weighing scale has random error

—-> Systematic error - error is fixed; still possible to get true score (eg. consistently adding 5kg to all scores)

—-> Random error - error is not consistent; now difficult to get true score

Question 15

Q

When releasing test data, we divulge which of the following?

a. Release raw and scaled scores
b. Release client’s responses to the test questions
c. Observation notes
d. None of the above

Answer

A

d. None of the above

—-> Don’t release raw and scaled scores because the client does not know the interpretation
—-> Don’t release responses because that’s confidential
—-> Don’t release observation notes because you’re not obligated to give them (baka messy or judgmental pa lol)

Question 16

Q

In creating a research, which one of the following types of statements is always false?

a. Analytical statement
b. Falsifiable statement
c. Contradictory statement
d. Hypothetical statement

Answer

A

c. Contradictory statement

> Analytical statement - always true
Falsifiable statement - can be disapproved by research
Contradictory statement - always false or contradicting

—- We don’t want analytical and contradictory statements
—- We WANT falsifiable statements

Question 17

Q

One of the primary objectives in developing this scale was to provide an intelligence test suitable for adults, as previously available tests were all designed for school children.

a. Weschler-Bellevue Intelligence Scale
b. Raven’s Progressive Matrices
c. Stanford-Binet Intelligence Scale
d. Culture Fair Intelligence Test

Answer

A

a. Weschler-Bellevue Intelligence Scale

—-> Stanford Binet was first developed for school children (gifted or not) and was verbal

—-> Weschler said we need a test for adults and with nonverbal component

—-> In future revisions, SB added nonverbal

Question 18

Q

States that measurement error is always random, and advocates standardization of tests.

a. Domain Sampling Model
b. Classical Test Score Theory
c. Item Discriminability Analysis
d. Item Response Theory

Answer

A

b. Classical Test Score Theory

In reliability, we have two theories: CTT and IRT

——– Classical test score theory ——
—-> Assumes that each person has a true score that would be obtained if there were no errors in measurement (Possible makuha ang true score kaso mahirap kasi laging may random error, therefore, we need to standardized the test to minimized the error)
—-> Total score = true score + error
—-> Advocates standardization - Minimize error by standardizing test

—-> Domain sampling model - considers the problem created by using a limited number of items (the more items, the higher the reliability)

Question 19

Q

A company, in order to determine whether the new computer system will significantly reduce the amount of time spent in data processing, divided their Finance department into two groups. The first group used the new system, while the second group used the old one. However, upon learning of this set-up, the workers in the second group worked overtime to make sure that they would finish processing their data faster. This is an example of

a. Reactivity
b. Pygmalion effect
c. Rosenthal effect
d. John Henry effect

Answer

A

d. John Henry effect

Reactivity - altering behavior due to awareness of being observed

—-> Hawthorne Effect - Know they’re being studied/observed, therefore they are performing good.
—-> John Henry Effect - Control group in competition with experimental group

Question 20

Q

How do you establish Alternate Forms reliability?

a. You administer the form A of your personality inventory to your sample. After they are finished, you administer the form B.
b. You administer the form A to your sample, wait a couple of weeks, and then administer the form B to the same sample.
c. All of the above
d. None of the above

Answer

A

c. All of the above

Can do either

Question 21

Q

Subjects serve more than one condition of the independent variable.

a. Between-subjects design
b. Within-subjects design
c. Mixed design
d. Factorial design

Answer

A

b. Within-subjects design

———- Experimental designs ———-
> Between subjects - subjects serve only one condition of IV.
———-> Two/more diff groups, only one condition of IV each.
Ex. I have a total of 60 respondents, I gave white chocolate to the 30 respondents and dark to the other 30

> Within-subjects - aka repeated measures; subjects serve more than one condition of IV
———-> One group, more than one condition of IV.
———-> Longitudinal studies
Ex. I have a total of 60 respondents, I gave white and dark chocolate to all my 60 respondents

> Mixed - one factor is within subject, the other is between subject; has two IV.
———-> Each group only experiences one condition of 1 IV, but also multiple conditions of the other IV.
Ex. Whitening Soap & Frequent Bathing
2 4 6
Kagayaku Grp 1:30
Kayakukayamu Grp 1:30
Kayanatinglahat Grp 1:30

Total of 90 respondents all in all
Between subject is the Whitening Soap
Within subject is the Frequency of bathing

Question 22

Q

Michelle developed a test with two sets. In order to identify its reliability, she should employ what statistical tool?

a. Pearson r
b. Kuder-Richardson 20
c. Cronbach’s alpha
d. Kappa statistics

Answer

A

a. Pearson r

Two sets, correlation to one another

Reliability - Consistency, accuracy, dependability of test results
— Test-retest reliability
- Administering a test at 2 different times
- time sampling (consistency of test over time)
- Pearson r

— Parallel Forms reliability
- Compares 2 equivalent forms of a test that measure the same attributes
- Item sampling (Diff items, but same difficulty/# of items/content)
- Alternate forms/equivalent forms

— Internal consistency (measures only one construct)
– Split–half reliability
- Divide into halves then score separately (odd-even or random)
- Spearman-Brown formula (the more items, the higher reliability); good if you have limited # of items
– Kuder-Richardson 20
- Used for dichotomous items; only one correct answer
- Test has varying degrees of item difficulty
– Kuder-Richardson 21
- Used for dichotomous items; only one correct answer
- Test has same level of item difficulty (usually for speed test)
– Cronbach’s Coefficient Alpha
- Used for polychotomous (several possible answers); no one correct answer
- Used for likert scales

Dichotomous items: Only 1 correct answer; Polychotomous: No one correct answer; does NOT refer to number of choices

— Interrater Reliability
- Consistency of judges/raters evaluating the same behavior
- Observer differences
- Kappa statistics

Question 23

Q

In reliability, what range estimate is acceptable in the clinical setting?

a. .50
b. .80
c. .70
d. .95

Answer

A

d. .95

Dapat sure na sure talaga/almost perfect because when it comes to the clinical setting, it’s determining a person’s fate

If basic research, .70 is acceptable

Question 24

Q

Harry wants to increase the reliability of his 35-item test. In order to do so, what should he do?

a. Conduct pilot testing
b. Add more items
c. find experts to validate the test
d. correlate it to similar tests

Answer

A

b. Add more items

The more the merrier!

——– Classical test score theory ——
—-> Assumes that each person has a true score that would be obtained if there were no errors in measurement (Possible makuha ang true score kaso mahirap kasi laging may random error, therefore, we need to standardized the test to minimized the error)
—-> Total score = true score + error
—-> Advocates standardization - Minimize error by standardizing test

—-> Domain sampling model - considers the problem created by using a limited number of items (the more items, the higher the reliability)

Question 25

Q

According to the code of ethics, which of the following is true when it comes to disclosing information?

a. We disclose information only when the client provides permission to do so
b. We disclose information to the source of referral even without the consent of the client
c. When the people need to be protected from harm
d. All of the above

Answer

A

d. All of the above

A - true; disclose with consent
B - true; need to give result to source of referral
C - true; duty to protect

Question 26

Q

Allen asked his professor for help regarding a sensitive case of a research subject he is currently handling in his study. He gave his professor the necessary information about the case, but not the name of the subject. Allen is protecting his subject’s

a. Anonymity
b. Confidentiality
c. Obscurity
d. Privacy

Answer

A

a. Anonymity

> Anonymity - Protect identity
Confidentiality - Protect information (test scores)

Question 27

Q

Alice was given an intelligence test on Monday and she obtained a score of 100. She took the same test on Wednesday, and she obtained a score of 130. Based on this, the intelligence test is therefore

a. Reliable but not valid
b. Valid but not reliable
c. Not reliable and not valid
d. Reliable and valid

Answer

A

c. Not reliable and not valid

A test can be reliable but not valid; but a test CANNOT be valid unless it’s reliable

Reliability limits the validity of the test

Question 28

Q

Obtained when the test measures what it purports to measure.

a. Criterion validity
b. Construct validity
c. Reliability
d. Concurrent validity

Answer

A

b. Construct validity

Validity - meaning and usefulness of results; if test is appropriate

Criterion Validity
- How well it corresponds to a particular criterion
- Types:
> Criterion test - well-established test, sure na the test is valid
> Criterion data - any data you can use that’s related to your test/as a basis (eg. performance appraisal, diagnosis, records)

 - Predictive validity - Forecasting function
 - Concurrent validity - simultaneously relationship between test and criterion, no significant time has passed

Content validity
- Adequacy of representation of conceptual domain the test is designed to cover
- Experts judge validity of test items, use critical/logical thinking skills

Construct validity
- Degree to which a test measures what it purports to measure
- Used if test measures abstract variables (exists but hard to measure)
- Based on theoretical perspective
- All encompassing; if you establish construct validity, you establish other validities
- Convergent Validity - measures well with other related constructs; theory said constructs are related
- Divergent/discriminant validity - low correlations with unrelated constructs; theory said constructs are unrelated

Face validity
- Test subjectively viewed that it measures what it purports to measure
- Physical appearance of test

Question 29

Q

Which of the following personality tests does not score ambiguous responses?

a. Sack’s Sentence Completion Test
b. Rotter Incomplete Sentence Blank
c. Purpose in Life test
d. None of the above

Answer

A

d. None of the above

A and B are sentence completion tests; they have ambiguous responses

C also scores ambiguous responses

Question 30

Q

If there is evidence that the association between two variables is not significantly different from 0, then we

a. Reject the null hypothesis
b. Reject the alternative hypothesis
c. Accept the alternative hypothesis
d. Both a and b

Answer

A

b. Reject the alternative hypothesis

Not significantly different from 0 - there is NO significant difference

Null: There is NO significant difference
Alternative: There IS a significant difference

Question 31

Q

At the very least, what should be the item difficulty of a multiple-choice item with four choices for it to be reasonable?

a. .25
b. .30
c. .20
d. .15

Answer

A

b. .30

> Item analysis - Set of methods to evaluate items

> Item difficulty - number of people who got the item correct; the higher, the easier the question/test. the lower the item difficulty, the harder the question/test/
—– Item easiness
—– Optimal difficulty: Halfway between 100% and level of success expected by chance alone (item difficulty should be higher than probability)
—– .30-.70 to maximize information about the differences among individuals

> Item Discriminability - determines whether people who have done well on item have also done well on the whole test (can discriminate high scorers from low scorers)
—–> Extreme group model: compares those who have done well to those who have done poorly
—————In each item, check which group scored more (high scorers should have more to have good discriminability)
ex. If the group of top performers got the correct answer on the item and the low scorers did not, Accept the Item.
If the group of top performers got the wrong answer on the item and the low scorers got the correct answer, Reject the Item.

—–> Point-Biserial method: correlation between the performance on the item and on the test; used when One of the variables is dichotomous/categorical and the other is continuous

Question 32

Q

This is developed when the classical test score theory is deemed inadequate in identifying the true ability of the test takers.

a. Domain Sampling Model
b. Item discriminability
c. Item Response Theory
d. Item Analysis

Answer

A

c. Item Response Theory (IRT)

Item response theory (IRT)
—– Focuses on the range of item difficulty that helps assess an individual’s ability
—– Need an item bank, with each item having its own difficulty
—– Item branching - Administering items based on response to previous item; can administer harder or easier items to gauge ability of testtaker
—– Computerized Adaptive Testing (CAT)

Question 33

Q

Test-retest reliability only applies to

a. Overt behaviors
b. Covert traits
c. Stable traits
d. Dominant traits

Answer

A

c. Stable traits

> Test-retest: consistency of test over timeReliability - Consistency, accuracy, dependability of test results
— Test-retest reliability
- Administering a test at 2 different times
- time sampling (consistency of test over time)
- Pearson r

— Parallel Forms reliability
- Compares 2 equivalent forms of a test that measure the same attributes
- Item sampling (Diff items, but same difficulty/# of items/content)
- Alternate forms/equivalent forms

— Internal consistency (measures only one construct)
– Split–half reliability
- Divide into halves then score separately (odd-even or random)
- Spearman-Brown formula (the more items, the higher reliability); good if you have limited # of items
– Kuder-Richardson 20
- Used for dichotomous items; only one correct answer
- Test has varying degrees of item difficulty
– Kuder-Richardson 21
- Used for dichotomous items; only one correct answer
- Test has same level of item difficulty (usually for speed test)
– Cronbach’s Coefficient Alpha
- Used for polychotomous (several possible answers); no one correct answer
- Used for likert scales

Dichotomous items: Only 1 correct answer; Polychotomous: No one correct answer; does NOT refer to number of choices

— Interrater Reliability
- Consistency of judges/raters evaluating the same behavior
- Observer differences
- Kappa statistics

Question 34

Q

The researchers are in the middle of the experiment seeking to identify if noise affects concentration when suddenly the aircon turned off. If the experiment continues, what could be an extraneous variable in this case?

a. Noise
b. Concentration
c. Volume of the sound
d. Temperature

Answer

A

d. Temperature

Extraneous variables - variables that are not part of the experiment, but they do exist and affect the results

Question 35

Q

Jovie is a newly-hired psychometrician in a company. Just before her scheduled employee testing, her boss spoke to her and asked if she could finish the assessment, which normally takes 2 hours, within 30 minutes justifying that the testing procedures are just formality and the employee would be accepted no matter what. What should Jovie do?

a. Finish the assessment within 30 minutes, as apparently the test would not be used as a basis for hiring selection
b. Compromise with the boss to give her at least 30 minutes more
c. Agree, but inform the applicant about the change
d. Do not agree and explain the testing procedures to the boss

Answer

A

d. Do not agree and explain the testing procedures to the boss

Question 36

Q

Establishing this psychometric property requires good logical skills and intuition.

a. Construct validity
b. Content validity
c. Face validity
d. Interrater Reliability

Answer

A

b. Content validity

Validity - meaning and usefulness of results; if test is appropriate

Criterion Validity
- How well it corresponds to a particular criterion
- Types:
> Criterion test - well-established test, sure na the test is valid
> Criterion data - any data you can use that’s related to your test/as a basis (eg. performance appraisal, diagnosis, records)

 - Predictive validity - Forecasting function
 - Concurrent validity - simultaneously relationship between test and criterion, no significant time has passed

Content validity
- Adequacy of representation of conceptual domain the test is designed to cover
- Experts judge validity of test items, use critical/logical thinking skills

Construct validity
- Degree to which a test measures what it purports to measure
- Used if test measures abstract variables (exists but hard to measure)
- Based on theoretical perspective
- All encompassing; if you establish construct validity, you establish other validities
- Convergent Validity - measures well with other related constructs; theory said constructs are related
- Divergent/discriminant validity - low correlations with unrelated constructs; theory said constructs are unrelated

Face validity
- Test subjectively viewed that it measures what it purports to measure
- Physical appearance of test

Question 37

Q

He recognized the need for the rapid classification of recruits with respect to general intellectual level during World War I.

a. Alfred Binet
b. Robert Yerkes
c. Karl Pearson
d. Sir Francis Galton

Answer

A

b. Robert Yerkes

WWI: Army Alpha and Army Beta; Developed by Yerkes

Army Alpha - Literates; verbal
Army Beta - Illiterates; nonverbal

Question 38

Q

The more items a test has, the higher the reliability it will possess. The concept behind this is called

*
0/1
a. Domain Sampling Model
b. Item Response Theory
c. Classical Test Score Theory
d. Item Bank Analysis

Answer

A

a. Domain Sampling Model

See number 18

Question 39

Q

In test retest, carryover effects do not harm the reliability when

*
0/1
a. The changes in score happened on only a proportion of the test takers
b. The changes in score happened on all of the test takers
c. The changes in score affected only few test items
d. The changes in score affected all the test items

Answer

A

b. The changes in score happened on all of the test takers

Question 40

Q

Which of the following is an example of external consistency?

*
0/1
a. Interrater reliability
b. Alternate Forms reliability
c. Split-half reliability
d. None of the above

Answer

A

a. Interrater reliability

External Consistency - Interrater

Temporal Consistency - Test retest

Form Consistency - Alternate forms

Split-Half Consistency - Split-half

Question 41

Q

You just discovered your partner cheating in your apartment last night. You broke your relationship with him/her, and spent the whole night losing sleep. You have to go to work the next morning for your scheduled sessions with your client. In your current situation:

a. You should go to work and meet your clients tomorrow as it is our duty to provide service unconditionally
b. You should only choose to continue the sessions that are deemed crucial and take the rest of the day off
c. You should reschedule or refer them to other professionals as you are unfit to provide services to the clients
d. You should choose the cases related to your current situation in order to provide more effective service to the clients

Answer

A

c. You should reschedule or refer them to other professionals as you are unfit to provide services to the clients

You’re not in right/best condition to administer services

Question 42

Q

Your client just disclosed to you that although he has AIDS, he still engages in sexual relations with other people. He added that as today is the anniversary of his marriage with his wife, he also plans to sleep with her tonight. Based on the ethics, you should

*
0/1
a. Talk the client out of doing it, and never let him go for the night
b. Inform the wife and the corresponding institution/authorities to address the situation
c. Adhere to the confidentiality of the information but secretly inform the authorities to handle the matter
d. Never divulge the information, although try your hardest to deter your client from doing it

Answer

A

b. Inform the wife and the corresponding institution/authorities to address the situation

Duty to protect; harm may come to other people

Question 43

Q

This is established by identifying the total test scores of those who have answered correctly in a particular item of the test.

*
0/1
a. Item analysis
b. Item difficulty
c. Discriminability Analysis
d. Convergent analysis

Answer

A

c. Discriminability Analysis

Item discriminability is also called discriminability analysis

Question 44

Q

Which of the following is true about validity and reliability?

*
0/1
a. A test can be valid but not reliable
b. A test can only be both valid and reliable
c. A test can be reliable but not valid
d. A test once reliable is naturally valid

Answer

A

c. A test can be reliable but not valid

See number 27

Question 45

Q

Patrisya’s IQ is one standard deviation above the mean. Her IQ is

*
0/1
a. 110
b. 85
c. 125
d. 115

Question 46

Q

In his study, Alex seeks to determine whether the level of satisfaction of employees in Company A is higher than that of the employees in Company B. The statistical treatment to use would be

*
0/1
a. ANOVA
b. Spearman rho
c. One-tailed test
d. Two-tailed test

Answer

A

c. One-tailed test

If you specify direction, one-tailed test

If you did not specify direction/only want to know level, two-tailed test

Question 47

Q

Cary is trying to establish the construct validity of her newly-developed test about heroism. She learned, based on her literature, that heroism is not related to narcissism. A correlation was made between her participant’s scores in the heroism test and in the test for narcissism. For Cary to establish validity, what should the result be?

*
0/1
a. There should be a correlation between the two tests
b. There should not be a correlation between the two tests
c. The results would be insufficient to determine the convergent validity of the test
d. The results would be insufficient as criterion validity must first be established

Answer

A

b. There should not be a correlation between the two tests

Asking for Divergent Validity
See number 9

Question 48

Q

Which of the following does not describe Projective tests?

*
0/1
a. Projective instruments are more susceptible to faking
b. Most projective techniques are inadequately standardized with respect to both administration and scoring
c. Coefficients of Internal Consistency, when computed, have usually been low
d. Interpretation of scores is often as projective to the examiner as the test stimuli are for the examinee

Answer

A

a. Projective instruments are more susceptible to faking

Projective tests are hard to fake

Question 49

Q

Changes the causes the performance to improve as the experiment goes on

a. Practice effect
b. Fatigue effect
c. Progressive effect
d. Test-retest effect

Answer

A

a. Practice effect

Practice effect - performance improves
Fatigue effect - performance declines

Question 50

Q

Which is not true about case studies?

a. Low degree of manipulation of antecedent conditions
b. High degree of manipulation of antecedent conditions
c. Low imposition of units
d. High imposition of units

Answer

A

b. High degree of manipulation of antecedent conditions

Non-experimental approaches

Phenomenology

Lived experience
Low degree of manipulation of antecedent conditions (IVs); low imposition of units (limiting data you’re getting from the subject)
Case studies

Descriptive record of experiences
Low manipulation of antecedent conditions; low to high imposition of units
Field Studies

Non-experimental approaches in the field
Naturalistic observation (you just watch activities of group); participant observer studies (you join activities of group)
Archival studies

Existing records reexamined for a new purpose
Qualitative research

Words than numbers

Question 51

Q

When talking about psychometric soundness, you are referring to

a. Reliability
b. Validity
c. Reliability and Validity
d. Reliability, Validity, and Utility

Answer

A

c. Reliability and Validity

Does not include utility (practicality/usefulness of test)

Question 52

Q

The first intelligence test to include nonverbal scale as a measure of human intelligence is developed by

a. David Weschler
b. James McKeen Cattell
c. Alfred Binet
d. Lewis Terman

Answer

A

a. David Weschler

See number 17

Question 53

Q

Which of the following is not attributed to Raymond B. Cattell?

a. Factor analysis
b. Crystallized intelligence
c. the term “Mental tests”
d. Culture-fair

Answer

A

c. the term “Mental tests”

See number 6

Question 54

Q

When the test measures what it is supposed to measure, it is

a. Reliable
b. Valid
c. Valid only
d. Both a and b

Answer

A

d. Both a and b

A test be reliable but not valid, but a test cannot be valid unless it’s reliable

Question 55

Q

In 16-PF, a score in the fifth percentile and below in Impression Management indicates that the client is

a. Faking good
b. Faking bad
c. agreeing to most questions
d. responding infrequently

Answer

A

b. Faking bad

95th above, faking good
5th below, faking bad

See number 11

Question 56

Q

The tendency of test takers to score on the low end of the scale.

*
0/1
a. Leniency error
b. Central tendency error
c. Strictness error
d. Distribution error

Answer

A

d. Distribution error

Distribution Error
Leniency - Always high
Central Tendency - Always in middle
Strictness - Always low

Question 57

Q

Which of the following is not true about the APGAR test?

*
0/1
a. It is the very first test given to a newborn
b. It stands for Appearance, Pulse, Grimace, Activity, Respiration
c. It is given to the infant only once before leaving the hospital
d. It is administered one minute after the infant is born, and again after 5 minutes

Answer

A

c. It is given to the infant only once before leaving the hospital

A and B are true
C is false - you need to give it twice
See number 8

Question 58

Q

Eden forgot to set the timer for the test she is currently administering. What kind of threat is present in this?

*
0/1
a. Statistical regression
b. Instrumentation
c. Testing
d. History

Answer

A

b. Instrumentation

See number 4

Question 59

Q

The best way to control demand characteristics.

*
0/1
a. Single-blind experiments
b. Double-blind experiments
c. Covert operations
d. Any of the above

Answer

A

a. Single-blind experiments

Single Blind Experiment - Only subjects don’t know what experimental group they are; to avoid demand characteristics

Double Blind Experiment - Both subjects and experimenters don’t know experimental group; to avoid experimenter bias

Demand characteristics - Subjects know what experimenters are expecting, so they change their behavior

Question 60

Q

Objective tests are paired with projective tests because the latter provides the former with ______ validity.

*
0/1
a. Concurrent
b. Predictive
c. Incremental
d. None of the above

Answer

A

c. Incremental

Incremental Validity - Test adds new information that wasn’t obtained in first test

If test did not add new information, there is no incremental validity

Question 61

Q

Cora wants to develop a test that would determine if recently fired employees would lose purpose in life after a year. She developed her Living with Purpose scale and administered it to her participants. After a year, she administered the Purpose in Life test, and correlated its scores to the test she created a year before. Cora is trying to determine what psychometric property?

a. Concurrent validity
b. Predictive validity
c. Convergent validity
d. Test-retest reliability

Answer

A

b. Predictive validity

Time elapsed

Question 62

Q

Which of the following is not a primary factor in the 16PF?

*
0/1
a. Dominance
b. Tough-Mindedness
c. Vigilance
d. Privateness

Answer

A

b. Tough-Mindedness

Question 63

Q

Jessica’s NMAT results placed her in the 85th percentile. This means that

*
0/1
a. Jessica belongs in the upper 15% of the test takers
b. 85% of the test takers got higher votes than Jessica
c. 15% of the test takers got equal or lower votes than Jessica
d. Both b and c

Answer

A

a. Jessica belongs in the upper 15% of the test takers

Percentile - Percentage of test takers that scored equal or lower than you

Question 64

Q

Lorna is developing a test about altruism. According to a theory she read, altruism is not related to aggression. She then proceeded to correlate her newly-established test to another test that measures aggression. What type of validity could she establish?

a. Concurrent validity
b. Construct validity
c. Convergent validity
d. Divergent validity

Answer

A

d. Divergent validity

See number 9

Answer 64

A

a. Give the necessary service even when we lack proper training

Because it’s an emergency

Answer 65

A

b. Factorial design

Between Groups Design

Two Independent groups design - 1 IV with 2 levels; 1 DV
Multiple groups design - more than 2 levels of the 1 IV; 1 DV
Matched-groups design - 1 IV with 2 levels; 1 DV; the subjects are matched (subject has a partner in the other group with the same demographics/profile)

Factorial Design - Two IVs, 2 Levels

Answer 66

A

c. You take the scores of your participants on your newly made integrity test then you check it against their occupational and police records.

See number 9

Answer 67

A

b. Are there differences in punctuality by grade level (grade school vs. junior high school vs. senior highs school)?

ANOVA - Used to determine the significant difference of two or more groups; in application it’s used if you have 3 or more

Answer 68

A

c. Costa & McCrae Big Five

“NEO” - Big Five Personality Traits

PEN = Psychoticism, Extraversion, Neuroticism

Carl Jung’s Typology - MBTI

Murray’s Psychogenic Needs - TAT; Murray developed with Christiana Morgan (kabit gasp)

Answer 69

A

d. All of the above

All can be a criterion, depends on the test you’re developing

Answer 70

A

a. Floor effect

Floor Effect - So hard many scored low
Ceiling Effect - So easy many scored high

Answer 71

A

d. None of the above

All are projective tests

Free Association Technique - By Freud, let patient discuss thoughts, dreams, words

Early memories Procedure - Get personality based on earliest memories as a child

Sceno Test - Speedy assessment of emotional problems of children; play therapy; observing what child is doing while they play

Answer 72

A

d. MBTI

MBTI - based on Jung’s typology, should NOT be used for diagnosis and selection (results don’t have bearing on these); just used to help testtaker understand themself

16PF - More like on a spectrum per each factor/personality which can lead to ur personality profile; can be used for selection

Answer 73

A

b. Informed consent should be given by both the parents and the children

Adults is informed consent, children is informed assent

Answer 74

A

c. Raven’s Standard Progressive Matrices

Raven’s Standard Progressive Matrices

Nonverbal test
Based on IRT; all items are arranged from easiest to hardest

Answer 75

A

c. Franz Joseph Gall

Franz Joseph Gall - Phrenology; study personality based on size/measurement of skull

Answer 76

A

b. These are paper-and-pencil measures of loyalty

Integrity Test - Measure of honesty and trustworthiness

Answer 77

A

c. Point-biserial correlation

See number 31

Sex - Categorical
Income - Continuous

Answer 78

A

c. Edwards Personal Preference Schedule

MMPI - Personality test to distinguish normal from abnormal behaviors

Bender-Gestalt Test - See if person has brain damage
MBTI - Personality Test

Answer 79

A

b. Within-subjects design

More than one condition for one group

See number 21

Answer 80

A

a. Giveaway item

Giveaway Items - Easy items at the start of the test to relieve test anxiety

Answer 81

A

b. Criterion-referenced test

Criterion-referenced - there is a standard you have to meet to pass

Norm-referenced - comparison of your score to the population (has the above average, average, below average stuff)

Answer 82

A

d. All of the above

Woodworth Personal Data Sheet

Was really made to determine which army recruits have tendency/possibility to have shellshock (PTSD)
Was not finished so it was not used
Still important because it was first attempt to use psych test in workplace

Answer 83

A

d. .30-.70

See number 31

Answer 84

A

c. Given to the experts in the topic to be validated

See number 9

Answer 85

A

a. If the test would reveal to be reliable, it would mean that the test is only reliable in that particular characteristic

Homogenous Group - People in group have similar characteristic

Answer 86

A

c. Survey questionnaires

According to code of ethics

In survey, optional to give name and they can leave anytime they want

Answer 87

A

a. Achievement tests

Class A - Achievement Tests

Class B - Objective Personality Tests and Group Intelligence Tests (RPM)

Class C - Projective Tests and Individualized Intelligence Tests (RPsy)

Answer 88

A

d. Criterion-related validity

See number 9

Answer 89

A

b. It affects children more than it does adults.

Adults are more judgmental

Answer 90

A

a. Point-biserial method

See number 31

Answer 91

A

c. Archival study

See number 50

Answer 92

A

c. 2

STANINE: Mean = 5; SD = 2
STEN: Mean = 5.5; SD = 2

Answer 93

A

b. Army Beta

WWI; For Illiterates
See number 37

Answer 94

A

b. Median

Extreme scores; best method is Median

Answer 95

A

c. Quizzes given to students at school

Answer 96

A

c. Face validity

See number 9

Answer 97

A

d. Ratio

Celcius and Farenheit is Interval
Kelvin is Ratio because it has absolute 0

Brainscape's Knowledge GenomeTM

PA DX INTENSIVE (On-going) (4/29/24) Flashcards

Brainscape's Knowledge Genome^TM