Midterm 1 Flashcards

Question

What are the types of reliability measures?

Answer 1

Inter-rater reliability (finding consistency between raters) Test-retest (re-taking the same test over and over) Internal consistency reliability includes split-half reliability, Cronbach Alpha, and item-total.

Answer 2

It can become biased. If someone is taking the same test, over and over, they may become better at it over time.

Answer 3

How well a test or tool (hypothesis, operational definition etc.) measures what it intended to actually measure.

Answer 4

1. Face validity 2. Content Validity 3. Predictive Validity 4. Concurrent Validity 5. Convergent Validity 6. Discriminant Validity

Answer 5

Does the test appear to measure what it is intending to measure. As face validity is a subjective measure, it’s often considered the weakest form of validity. However, it can be useful in the initial stages of developing a method.

Answer 6

Content validity assesses whether a test is representative of all aspects of the construct. To produce valid results, the content of a test, survey or measurement method must cover all relevant parts of the subject it aims to measure. If some aspects are missing from the measurement (or if irrelevant aspects are included), the validity is threatened.

Answer 7

A mathematics teacher develops an end-of-semester algebra test for her class. The test should cover every form of algebra that was taught in the class. If some types of algebra are left out, then the results may not be an accurate indication of students’ understanding of the subject. Similarly, if she includes questions that are not related to algebra, the results are no longer a valid measure of algebra knowledge.

Answer 8

This is the degree to which a test accurately predicts a criterion that will occur in the future. For example, a prediction may be made on the basis of a new intelligence test, that high scorers at age 12 will be more likely to obtain university degrees several years later. If the prediction is born out then the test has predictive validity. “Can our measure predict something in the future?” Can this selection test predict performance on the job?

Answer 9

This is the degree to which a test corresponds to an external criterion that is known at the time and is already valid. If the new test is validated by a comparison with a currently existing criterion, we have concurrent validity. These are both for the same construct.

Answer 10

For example, let’s say a group of nursing students take two final exams to assess their knowledge. One exam is a practical test and the second exam is a paper test. If the students who score well on the practical test also score well on the paper test, then concurrent validity has occurred.

Answer 11

Convergent validity is a supporting piece of evidence for construct validity. The underlying idea of convergence validity is that related construct’s tests should be highly correlated. For example, in order to test the convergent validity of a measure of self-esteem, a researcher may want to show that measures of similar constructs, such as self-worth, confidence, social skills, and self-appraisal are also related to self-esteem. - different methods of measuring the same construct, to see whether both are related

Answer 12

Discriminant validity tests whether concepts or measurements that are not supposed to be related are actually unrelated. - the same methods, measure different constructs, give scores that are NOT correlated.

Answer 13

1. gold standard | 2. other measures

Answer 14

an event, situation, behaviour, or characteristic. Something that has a quality or quantity.

Answer 15

A variable that measures a magnitude or quantity.

Answer 16

1. Interval - all quantitative variables are interval, but 0:00 is not defined. Celsius or Fahrenheit is not a ratio variable because 0°C does not mean there is no temperature 2. Ratio - takes into account a true zero. Such as time at 0:00, that is a meaningful time. Weight, age, pulse rate, etc. 3. Discrete variable - Variables that can only take on a finite number of values are called "discrete variables." For example, you can only use whole numbers when describing your siblings. You can't have HALF a sibling. 4. Continuous variable - Variables that can take on an infinite number of possible values are called "continuous variables." For example, height can be continuous as in 1.65 metres.

Answer 17

Variables that have different qualities (gender, colours, where you live etc).

Answer 18

Nominal - there is no obvious relationship between the levels Ordinal - takes on an order (i.e., pain level on a scale of 1-5).

Answer 19

You can usually take an average or use a subtraction test for a quantitative variable. Quantitative variables can be discrete or continuous. If we use a midway test, by taking two values and averaging them, if that new value has a meaning it is discrete. If not, it is continuous.

Answer 20

Non-monotonic means that the relationship does not always go in a single direction (i.e., non-linear).

Answer 21

1. No direct intervention 2. Observational or correlational 3. Both variables are measured 4. You can record record physiological responses, or observe behaviour. 5. Examples include self-reports, or using existing records. 6. Cannot determine causal relationships

Answer 22

1. At least one variable is manipulated, the IV. 2. One variable is measured, the DV. 3. Can determine causal relationships

Answer 23

Two variables that appear causal, when they are actually not.

Answer 24

Variables that influence both the dependent and independent variable. The confound makes it hard to determine which variables are actually causing the effect.

Answer 25

Extraneous variables, which are any factors that are in the experiment but not being studied, and confounding variables, which are related to the independent variable and affect the dependent variable.

Answer 26

Poor operational definitions, participant factors (age, intelligence, socio-economic status), order effects (fatigue, practice), and group factors

Answer 27

Random assignment. Random assignment normalizes the effects of confounds.

Answer 28

The extent to which a study establishes a cause-and-effect relationship between a treatment and an outcome.

Answer 29

With random assignment, each participant has an equal chance of being placed in any of the experimental groups or control groups.

Answer 30

Random sampling is different from random assignment. Random sampling involves recruiting random subjects from the population to ensure there is no bias etc.

Answer 31

They help us organize, summarize and describe data (usually based on samples)

Answer 32

They help us make generalizations from the sample to the population.

Answer 33

Shape, spread (variability), outliers, and central tendency.

Answer 34

A pie chart, bar graph, histogram, or frequency table.

Answer 35

1. Bar graph and pie chart | 2. Histogram

Answer 36

1. Deviation

Answer 37

As sample size increases, the statistics become less variable, and more closely estimate the true population.

Answer 38

Shape, central tendency, and variability.

Answer 39

Symmetric (each half is a mirror image of the other) | Skewed (positively skewed - tail is on the right, negatively skewed - the tail is on the left).

Answer 40

There can be unimodal, bimodal, multimodal, or uniform (no defined mode) distributions.

Answer 41

The mean, the median (50th percentile), and the mode (most frequently occurring data).

Answer 42

Mean, median and mode.

Answer 43

Nominal - mode | Ordinal - median, and mode

Answer 44

Pros - it can be used for all types of data | Cons - it only tells you the most frequently occurring data point, but ignores the rest of the data

Answer 45

Pros - Robust against outliers, it gives us a better summary of skewed data, and can be used with ordinal data Cons - Limits the use of many statistical tests

Answer 46

Pros - tells us the average Cons - cannot be used for categorical data, sensitive to outliers, and a poor measure of "central tendency" for highly skewed distributions

Answer 47

A survey/self-report that is administered through an interview or questionnaire.

Answer 48

To learn about attitudes and beliefs, facts, demographics, and behaviours.

Answer 49

Charles Darwin

Answer 50

1. open ended or closed ended questions 2. a methodology for asking people to tell them about themselves 3. can be used to study relationships between variables 4. can serve as an important complement to experimental research findings

Answer 51

1. unnecessary complexity - using unfamiliar technical terminology, or phrasing that overloads your working memory 2. vague questions - using imprecise terms, or poor grammatical structure 3. loaded/leading questions - embedding questions with misleading information, written in a way to bias the information 4. double-barrelled questions - asking two things at once 5. negative wording - the question should not have double negatives 6. yay-say or nay-say wording - hard to distinguish responses from a participant, especially if they are not actually doing the survey properly. To fix this, put a specific question saying "for this question, please put highly disagree."

Answer 52

How great is our hard-working customer service team?

Answer 53

Was the product easy to find and did you buy it?

Answer 54

Was the facility not unclean?

Answer 55

1. Likert scales 2. Use an odd number of levels 3. Rating scales 4. Non-verbal scales (i.e., use of a facial expression)

Answer 56

This is the human tendency to answer questions in ways that are the most complimentary, or flattering, to the respondent rather than telling the absolute truth. This includes demand characteristics, and social desirability bias.

Answer 57

1. Anonymity 2. Deception 3. Disguise the dependent measure by putting random questions so the participant cannot guess the true purpose of the story 4. Ask the participant what they thought the hypothesis was during the debriefing stage

Answer 58

1. Attractive and professional layout 2. Neatly typed, and free from errors 3. Consistent point scales 4. Ask interesting questions first 4. Keep the survey short as possible

Answer 59

1. in person to groups or individuals 2. Mail 3. Internet surveys 4. Apps

Answer 60

Pros - less costly than interviews, and can ensure anonymity | Cons - Boredom and distraction. Participants may also not answer correctly.

Answer 61

Population - all individuals of interest | sample - a sample of some of these individuals

Answer 62

If we study everyone in the population, it is referred to as a census.

Answer 63

A range of values that's likely to include a population value with a certain degree of confidence.

Answer 64

1. Greater confidence intervals, and larger variability.

Answer 65

In non-probability sampling, you do not need to use random sampling, but rather you can use convenience sampling. This is because the phenomena under investigation is expected to be relatively similar across the population. Ex: limit of short term memory. In probability sampling, the phenomena under investigation is expected to vary across the population. Ex: beliefs, values, political view etc. In this case, we need a technique that would be representative of an entire population.

Answer 66

1. Simple random sampling - every member of a population has an equal probability of being selected 2. Stratified random sampling - population is divided into subgroups, and random samples are taken from each strata. Ex: All students are grouped by their major, and then 50 students are randomly chosen from each major. 3. Cluster sampling - randomly selects clusters and uses all individuals belonging to those clusters. Ex: psych majors are identified at 100 schools. Then 10 of those clusters are chosen. All students in each cluster are sampled. 4. Multistage cluster sampling - clusters are identified, and then only some individuals from each cluster is chosen 5. Systematic sampling - choosing every nth person from a group

Answer 67

1. Convenience sampling - sampling whoever is most convenient 2. Purposive sampling - a sample of individuals that meet a pre-determined criterion (ex: age, gender etc.) 3. Quota sampling - sampling reflects the numerical composition of various groups

Answer 68

No, this is not necessary.

Answer 69

p>0.05 indicates that the results are not statistically significant, and that random assignment has failed.

Answer 70

It is used to test the difference between the population of two means.

Answer 71

We use this when we are interested in two variables by the same subject (ex: measuring the left hand and right hand)

Answer 72

Used to determine the difference between one factor, with at least two levels that are independent of each other.

Answer 73

We use this to find the significance between two or more categorical variables.

Answer 74

1. scale of measurement (nominal vs. ordinal, or ratio vs. interval) 2. how many levels of the IV are there? 3. Repeated measures vs. independent groups 4. are we looking for parametric or non-parametric tests?

Answer 75

Independent groups, and repeated measures

Answer 76

Different participants experience different levels of the IV.

Answer 77

The same participants experience all the different levels of the IV.

Answer 78

sometimes in animal studies we have transgenic mice vs. wild type or in humans we want to see the difference between male and female. In addition, sometimes one treatment requires surgery, while the other does not.

Answer 79

1. fewer participants needed 2. greater power 3. more likely to detect true differences 4. reduces confounds because it accounts for individual differences, since it it all the same participants

Answer 80

Order effects: fatigue, practice, contrast effect (dog photo example) Demand characteristics

Answer 81

Counterbalance - do treatment A and then B, and then switch to B and then A.

Answer 82

If you have 4 conditions, you have to have 24 variations. As a result, you would need a minimum of 24 people just to have 1 person in each condition.

Answer 83

1. We can use partial counterbalancing. This is the use of the latin square. Each condition appears in each position once. 2. Randomize the treatment conditions 3. Reverse counterbalancing ABC -> CBA 4. Have a time interval between conditions 5. Use independent group design

Answer 84

1. avoids order effects 2. avoids demand characteristics 3. treatments with relatively permanent effects 4. similar to real world setting

Answer 85

1. greater risk for confounds due to individual differences | 2. any true differences may not be detected due to lower power

Answer 86

1. random assignment 2. matched pairs 3. using spouses because the other individual is likely similar age, and living in the same environment

Answer 87

1. use of a placebo group to avoid confusion | 2. waitlist control group

Answer 88

Ask tasks get more difficult, if participants are uncertain about their answer in a task, they will give an answer they are not sure of. Sometimes they will answer incorrectly on purpose, so the questions stay at the same difficulty level.

Answer 89

Do not always increase difficulty after every correct answer.

Answer 90

Experimenters who know the treatments may treat participants in different conditions differently or interpret data differently.

Answer 91

1. Repeated measures design 2. Single-blind or double blind 3. Automated presentation of conditions and recording of data (i.e., use of a computer program)

Answer 92

IVS with more than two levels, or designs with more than one IV.

Answer 93

1. Detect non-linear relationships between the IV and DV. For a curved and non-linear relationship we need at least 3 levels. Rule out alternative explanations and eliminate confounds. Refer to music example

Answer 94

This experiment has 2 IV and 2 levels.

Answer 95

A marginal mean, shows the average of the main effects of the independent variable.

Answer 96

Main effect in A or B, have a difference in the averages Interactions have a difference between the cells. Main effects on graph you see parallel lines Interactions - non parallel lines (crossing lines, converging, diverging)

Answer 97

``` How many IVS? - 3 How many DVS? - 1 How many levels? - 2 + 3 + 4 = 9 How many main effects? - there can be three, one per IV How many interactions? AB,AC,BC,ABC ```

Answer 98

Measurement error: systematic error + random error The extent that a measure is unreliable Systematic error: Created by faulty equipment or bias (the error is always the same amount each trial) Decreases validity Random error: Errors are unpredictable and cannot be reproduced. Decreases reliability

Answer 99

Variability: Does the operational definition measure the concept it's supposed to? Reliability: Is the operational definition based on observable, objective behaviors?

Midterm 1 Flashcards

(123 cards)