PSYC 523 Flashcards

Question

Reliability (types)

Answer 1

What: It is a measure of the trustworthiness or consistency of a measure. The degree to which the instrument is free of random error, yielding the same results across multiple applications. This is used to assess how reliable a testing method is to get the same results over different conditions and free of measurement error. Internal reliability is the extent to which a measure is consistent within itself (split half, KR20, Alpha) while external reliability is the extent to which a measure varies from one use to another (interrater, test-retest, and parallel forms). Test-Retest Reliability examines consistency of a measure from one time to another. Same test given at two points in time. Correlation between those scores obtained by the same person on 2 occasions Inter-Rater Reliability examines the degree of consistency between different raters' scores. Correlation between those scores Parallel Forms Reliability examines the consistency of the results of two tests constructed in the same way from the same content domain. Tests must be very very similar! Correlation between the equivalent forms of the test Why: EX: A teacher wants to assess if the kids in her class are learning the math material she is teaching. She will test them on the material and then again 3 weeks later. This will help assess if the measure is consistent.

Answer 2

What: A sample is a relatively small subset of the population that is selected to represent the population in a study; the sample must be representative of the population being studied. A population is all members of a group; the larger group of individuals from which a sample is selected. Random sampling is the primary method for obtaining samples and is the best way to mimic the population from which the samples were drawn Why: It’s important to ensure that the sample is representative of the population in research because it increases the potential that any findings of importance can be generalized back to the whole population EX: A researcher is studying the effects of CBT on depression. Out of the population of depressed people, the researcher will get a random sample of this population

Answer 3

rd error of estimate: What: This is part of regression and is the relationship between X and Y given by a regression equation as an index of how closely the predicted value of y for a specific value of x matches its actual value. The smaller the standard error of estimate, the more confident one can be in the accuracy of the estimated y value. How much the data points are spread around the regression line. Why: EX: A researcher finds that the standard error of the estimate for measuring depression scores is only .001 away from the true value of the sample's depression. This is a good standard error.

Answer 4

What: In simpler terms- an estimate of how much an individual's score would be expected to change if retested with the same/equivalent form of test. The smaller the SEM the more precise the measurement capacity of the instrument. The larger the SEM, the greater the score variation across administrations Why: The SEM provides an indication of how confident one may be that an individual's obtained score on any given measurement opportunity represents their true score. EX: An athlete takes a concussion test every year. Each time it is administered to get a baseline measurement, the athlete scores within the same 2 points. The standard error of measurement is low

Answer 5

What: Statistical calculation that informs, on average, how much deviation there is across groups in a study. Scores from the two groups are paired up, and the difference between each pair is calculated. These differences provide a distribution of deviations, of which the average is standardized giving the standard error of the difference. It's the estimate of error between the two groups. *A two-sample t-test compares the means of two samples to see if they came from the same population.* Why: EX: Anxiety between teenagers in the US and Germany. A researcher conducts a study on how caffeine affects test scores. They take the mean of scores from each group (with or without caffeine) and calculate the differences between the means. They then used the standard error of the difference to find the amount of error between the estimated and actual difference.

Answer 6

What: In the context of psychometrics; a systematic error in the measurement process that differentially influences scores for identified groups. OR. This is the tendency of scores on a test to systematically over or underestimate the true performance of individuals to whom that test is administered, particularly because they are members of specific groups like ethnic minorities or genders. Why: This bias is a systematic error and is important to keep in mind when adding cultural and ethnic factors into test making. EX: Researchers develop a test that examines depression levels. The test uses language and vernacular that is not easily recognized by non-white American populations. The test has bias.

Answer 7

What: A type I error is rejecting the null hypothesis when it is in fact true. It is detecting an effect or relationship that does not actually exist. “A false positive.” The probability of making a type I error is the alpha or significance level p-value. Type II error is failing to reject the null hypothesis when it is in fact not true. This is not detecting an effect or relationship exists when there actually is one. This can be assessed using the beta level Why: EX: A researcher is studying CBT for depression and gets a p-value of .06 and says that there were significant differences in the groups. This would be a type I error as there was not an effect discovered

Answer 8

What: The extent to which you're measuring the construct you intended to measure; in general a validity coefficient of 0.3-0.4 is considered adequate. Content Validity: degree to which a measure represents all aspects of a given construct ; how well a measure encompasses the full domain of what it is trying to measure Criterion Validity: extend to which the test corresponds with a particular criterion against which it is compared; how well one measure predicts outcome of another measure Concurrent Validity: extent to which a new measure correlates with a previously established/validated measure Construct Validity: the degree to which the test measures the construct or trait it intends to measure Face Validity: a logical rather than statistical quality; the extent to which a test is subjectively viewed as covering the concept it purports to measure In the context of research...are results trustworthy and meaningful? Internal Validity: whether the effects observed in a study are due to the manipulation of the independent variable and not some other factor External Validity: the extent to which the results of a study can be generalized to other situations and to other people Ecological validity, an aspect of external validity, refers to whether a study's findings can be generalized to the real world Why: EX: A researcher is coming up with a depression measure. She compares the validity with the BDI and it turns out to be valid, she is using concurrent validity.

Answer 9

What: This is the measure of the spread, or dispersion, of scores within a sample or population, whereby a small variance indicates highly similar scores, all close to the sample mean, and a large variance indicates more scores at a greater distance from the mean and possibly spread over a larger range. A measure of variability; the average squared deviation around the mean Why: Variance is helpful in research because it can be quantified using statistics and converted to a number that can be used to compare between samples or across samples in populations to see which has the most or least variance or to see how much variance may change due to an intervention or treatment applied. EX: A researcher is studying the effects of an SSRI on depression symptoms. The variance between the placebo group and the intervention group is high. This means that the SSRI works to treat depressive symptoms.

PSYC 523 Flashcards

(33 cards)