RIP final Flashcards

Question

randomization

Answer 1

- key of true experiment - observed and unobserved factors are equally likely in both groups - transparent, reproducible - allows causal claims

Answer 2

When participants are divided into different groups and each groups receives different treatment. The data is then compared between groups

Answer 3

When all participants receive all different treatments (one after the other, possibly randomized in order). We first compare the data within each person

Answer 4

can serve as a randomization check, correction for differences, and can track changes. in just a posttest design, we would not know if/how the groups differed at the beginning.

Answer 5

learning effect

Answer 6

both prettest-posttest and just posttest. can solve unequal groups at the beginning and check for learning effect. however, can be highly costly.

Answer 7

where the same participants are measured multiple times under different conditions or at different time points. This allows researchers to examine changes within individuals, reducing variability and the need for a large sample size.

Answer 8

A research design used to control for order effects in repeated measures studies. Participants experience all conditions, but the order of conditions is varied across participants to prevent biases from practice, fatigue, or carryover effects.

Answer 9

Research designs that evaluate the effect of an intervention or treatment without random assignment. Instead, groups are naturally formed or pre-existing, making them useful in real-world settings where randomization isn't feasible.

Answer 10

A quasi-experimental design that measures an outcome variable repeatedly over time, both before and after an intervention or event (the "interruption"). It evaluates changes in trends or levels caused by the intervention, making it useful for analyzing the effects of policies, treatments, or external events.

Answer 11

An experiment with a close simulation of the conditions under which the process under study occurs or in a natural settin

Answer 12

design confounds selection effect

Answer 13

A second variable that happens to vary SYSTEMATICALLY along with the intended independent variable ▪This variable is therefore an alternative explanation for the results

Answer 14

▪Design confounds ▪Selection effect ▪Contamination ▪Learning effect ▪Maturation ▪History ▪Regressing to the mean ▪Attrition ▪Testing ▪Instrumentation

Answer 15

▪Observer bias ▪Demand characteristics ▪Placebo effect

Answer 16

When the researcher has certain expectations and is influenced by this in assessing the participants/ interpreting the result

Answer 17

When the participants realize what the study is for and therefore start to behave differently (in the expected direction

Answer 18

When participants make progress because they believe they are receiving an effective treatment

Answer 19

Is it the manipulation or the development (aging, maturing) that caused the differences? Observed differences between the pre- and post-measurement could arise from natural developments of the participants, when participants' characteristics change as part of a natural process.

Answer 20

Is it the manipulation or external events causing the differences? Not only natural changes of participants are a source of influence, but external events as well - events that are not necessarily related to the study.

Answer 21

Is it the manipulation or the natural "shifting" that caused the differences? Regressing to the mean can occur when the participants show extreme values (on average) at the start of the experiment. At a later time, values are expected to be shifted towards the 'normal', less extreme, mean value.

Answer 22

Is it the manipulation or the drop-out of a group of participants that caused the differences? When participants drop out during a study, the outcome can be affected by this. This is primarily a problem when the people that quit the study are different from the people that do not.

Answer 23

Is it the manipulation or the new instrument that caused the differences? When the instrument measuring the dependent variable changes during the experiment, the results are affected.

Answer 24

weak manipulations power problem (there is an effect, but too few participants to detect it) no effect (there really is no difference in the population

Answer 25

by making the chance of making a type one error small (the significance level)

Answer 26

power: 1-B

Answer 27

chance of correctly rejecting H0. measures the chance that an existing difference in the population will be found by the sample data and the statistical test`

Answer 28

power also increases. By increasing alpha (the threshold for rejecting the null hypothesis), it becomes easier to detect a true effect, which increases the likelihood of rejecting the null hypothesis correctly, thereby increasing power. However, the chance of making a type 1 error also increases. Researchers need to find a balance between a small value of aand high power

Answer 29

The sample size The size of the difference in the population The level of significance The spread (or variability) in the measured scores The choice of the statistical technique

Answer 30

A type II error is that the null hypothesis is not rejected, while the null hypothesis is not true.

Answer 31

power increases

Answer 32

Reliability, honesty, respect, accountability

Answer 33

fabrication - making up data, deliberate plagiarism - copying other people's work, deliberate data falsification - not reporting certain findings, adjusting data, misinterpreting it, all deliberately

Answer 34

absence of non-significant effects leads to bias towards large effects

Answer 35

Scientific journals would like to publish interesting/innovative results, which attracts more readers AND researchers need to publish enough to make a career

Answer 36

things like: *Removing outliers to make a difference significant *Add a few more participants to make results significant *Run a different analysis than planned

Answer 37

hypothesising after results are known: in hindsight, formulating hypotheses and pretending that they were the main focus of the research all alon

Answer 38

post-publication peer review retraction pre-registration of aims and intended methods and expectations replication as a standard part of research

Answer 39

Used to describe the size of a difference Measure of relevance; expresses difference between two means in the number of standard devaitions (M2-M1)/SDpooled

Answer 40

Weighted average of SD1 and SD2

Answer 41

another way to describe the size of the difference between the two groups. a range of plausible probable values based on sample data

Answer 42

- Sample size (smaller standard error --> narrower interval) - Spread/variation in scores in population (means greater spread of scores in sample, so more uncertainty --> wider interval) - Chosen confidence level (95% confidence level widely used - more certainty, wider interval)

Answer 43

1. significance is determined based on test statistic t and the p-value 2. relevance is assessed using a measure of effect size, such as cohen's d 3. accuracy is assessed using a confidence interval 4. suitability of statistical test is assessed by checking the assumptions

Answer 44

Cramer's V

Answer 45

Frequency claim Association claim (correlation and regression studies) Causal claim (best made in context of randomized experiments)

Answer 46

Construct Internal External Statistical

Answer 47

Sig (det by p value) Relevance (assessed using effect size) Accuracy (assessed using confidence interval)

Answer 48

1. check assumptions 2. check if hyp match expectations 3. check if results match hypotheses

Answer 49

1. random sample 2. dependent variable is of interval or ratio measurement level 3. two groups are independent 4. scores in both groups are normally distributed 5. scores in both groups have equal spread Violating these assumptions leads to lower statistical validity

Answer 50

- read methods section of article; how did researchers select participants? if sample is not random: - be cautious interpreting results because random sample ensures independence of observation

Answer 51

- methods section - how are constructs operationally defined? is it plausible we can interpret in interval/ratio level? - ig you have enough levels for ordinal, people won't bother (eg aggression)

Answer 52

Solution: use a statistical test for categorical variables (the chi-squared test of homogeneity)

Answer 53

- two independent samples (like t-test) - DV is categorical (unlike t-test) - Used to determine if the distribution of a categorical variable is the same in two groups, can be used with more than 2 groups

Answer 54

H0: distribution of answers in control is equal to distribution of answers in treatment H1: distribution of answers in control is different from the distribution of answers in treatment

Answer 55

- Read Methods Section of an article - Are the participants randomly assigned to two separate groups? - Is there a link between the measurements in the two groups?

Answer 56

Solution: conduct a t-test for dependent samples

Answer 57

Independent sample t-test: two histograms, 1 of scores in control group and 1 of scores in experimental group Paired sample t test: make 1 histogram of difference scores

Answer 58

Can use side-by-side box plot and observe the spread of the arms - graphical checking is preferred Can also use one of the formal t-tests for equal variances (significance means unequality of variances) - Levene’stest - Brown-Forsythe test - F-max test

Answer 59

Use an alternative called Welch’s test The t-test we use under the assumption of equal variances has more power, so that option is preferred

Answer 60

Use the chi-squared test of homogeneity to test if the distributions are homogeneous

Answer 61

theoretical concept --> conceptual definition ---> operational definition ---> variable

Answer 62

measuring strength and direction of linear relationship

Answer 63

describing the linear relationship with an equation and making predictions using this equation when only data on the independent variable is available

Answer 64

finding the equation of the line best fitting to the data

Answer 65

Residuals are the difference between the observed value of Y and the predicted value of Y (= point on the line). When a line fits the data well, the residuals will tend to be small. the equation with the smallest sum of squared residuals is the winner!

Answer 66

the standard deviation of the residuals. roughly, the average error we make when using the regression equation to make predictions

Answer 67

how much of the variation in Y can be explained by the linear relationship with X. percentage variance explained.

Answer 68

option 1: test for the slope - we can test if the slope is significantly different from 0, using the t-test option 2: test for explained variance - to test if the model explains a significant proportion of the variation, we can test to see if the proportion of the variation that is explained by the model, is significantly greater than 0, using the F-test.

Answer 69

measures the change in Y with one SD increase in X

Answer 70

1. linear relationship (check using scatter plot) 2. interval or ration measurement level 3. no outliers (check using residual plot) 4. residuals are normally distributed 5. homoscedasticity (spread around regression line is independent of the value of X)

Answer 71

- explains more of the variation in the DV (so higher R squared) - reduces the average prediction error (so SE will decrease as accuracy increases)

Answer 72

do it one at a time; never remove multiple variables at once based on the t-test!

Answer 73

Given the null hypothesis is true, what is the chance of observing the data we observed

Answer 74

Given the data we observed, what is the chance the null hypothesis is true?

Answer 75

How much more does the observed data support the null hypothesis as compared to the alternative hypothesis relative support for null hypothesis, as measured by support in data for H0/support in data for H1

Answer 76

the support in the data for H0is 5 times greater than the support for H1

Answer 77

support in the data for H0/support in the data for H1

Answer 78

support in the data for H1/support in the data for H0

Answer 79

the support in the data for H0is 0.4 times greater than for H1 but this doesn't really make sense. so in this case we flip the Bayes factor so that B10 = 1/0.4 = 2.5, so the support in the data for the alternative hypothesis is 2.5 times greater than for the null hypothesis

Answer 80

Interval estimate to give the reader an idea of the size of the effect

Answer 81

Given the evidence in the data, the mean score of condition A has a 95% chance of falling between x and y

Answer 82

In almost all original studies the null hypothesis was rejected (had a p-value smaller than .05 but only a third of the replication studies were able to reject the null effect sizes were only half as large in the replications compared to original studies

Answer 83

increase the openness, integrity, and reproducibility of scientific research” everyone should have access to this scientific knowledge*everyone should be able use it for the benefit of science/ societ

Answer 84

working digitally *collecting enormous amounts of data *able to easily share data online

Answer 85

Increases citations increases visibility of academic research increases reusability of academic research results

Answer 86

the range of high-quality, fully open access journals is still limited the number of available reliable journals and articles varies per discipline Quality and reliability of open access journals

Answer 87

Findable Accessible Interoperable Reusable

Answer 88

a greater efficiency of the research process, because new research questions do not always require the collection of new data because suitable data are already available *better reproducibility and greater reliability of research

Answer 89

FAIRness of data

Answer 90

adv: easy to compare disad: problems with internal validity in original research will still be prese

Answer 91

adv: - ability to improve design - increase internal validity disadvantage: - not as easy to compare

Answer 92

adv: Possibility to examine additional research question disad: Not as easy to compare

RIP final Flashcards

(119 cards)