Research methods and statistics 1 (year one) Flashcards

Question

What are the 4 levels of data?

Answer 1

1. nominal 2. ordinal 3. ratio 4. interval

Answer 2

names, categories

Answer 3

data is ordered (e.g on a likert scale)

Answer 4

data points have a similar interval between them (e.g height)

Answer 5

same as interval, but x has a zero value

Answer 6

no set hypothesis explores opinions, experiences etc just from asking people e.g interviews

Answer 7

numerical data analysed with maths based methods | via surveys, tasks etc

Answer 8

``` - Nominal examples male/ female, smoker/non-smoker - Ordinal examples shoe size, position in race, subjective opinion (likert scales) - Interval examples voltage, temperature - Ratio examples (CANNOT go below 0) Height, weight, test scores ```

Answer 9

- describing data - see what data "looks like" - looks at central tendancy and measures of dispersion - only tells us about our sample, not population

Answer 10

- use sample to make inferences about the population | - help us reach conclusions beyond our data

Answer 11

1. - data level - distribution of data 2. - measures of central tendancy - measures of dispersion

Answer 12

- this is how "most" people behave | - measures : mean, median, mode

Answer 13

1. - most common value or score - mostly used with nominal data 2. - central value in a data set ordered from lowest to highest - mostly used with ordinal level data/ skewed data 3. - add up the two scores in the middle and divide by 2 4. - add up all the scores and divide by the number of values

Answer 14

1. - the frequency of the data 2. - scores average around the middle, very few extreme scores - bell-shaped curve - median 3. - mean is not central - extreme scores affect the mean - not normally distribution

Answer 15

1. - remove extreme results 2. - take off 5% of scores from each end equation = %required x no. scores 100

Answer 16

- average distance of scores from the mean

Answer 17

- range, interquartile range, variance and standard deviation

Answer 18

- difference between largest and smallest score

Answer 19

- difference between middle 50% of scores

Answer 20

- want to find the range of the middle 50 % of scores (second and third quartile) - % required x no. scores 100 - answer rounded gives how many scores to take from top and bottom, which you find the range from

Answer 21

- deviation of scores from the meaan - subtract the mean from each score, then find the average - add up and square all scores, take them away from sum of scores squared divided by n, divided by n-1 - n = how many values there are

Answer 22

- square root of the variance

Answer 23

Nominal = frequencies/ %/ mode Ordinal = median (+range) Interval/ ratio skewed = median (+range) ND = mean (+/-SD)

Answer 24

nominal data level | - association between categorical variables

Answer 25

(values taken from observed frequencies) (row total)x(column total) N

Answer 26

evaluate how well the data in your sample supports the null hypothesis

Answer 27

o High p value : data are likely with a true null | o Low p value : data are unlikely with a true null

Answer 28

level at which we accept result to be significant

Answer 29

- Φ .1 = small - Φ .3 = medium - Φ .5 = large

Answer 30

- Type 1 : rejection of true null hypothesis (false positive) - Type 2 : accepting a null hypothesis (false negative)

Answer 31

- Change the alpha level to prevent type 1 errors | o Divide alpha level by number of tests that will be conducted

Answer 32

: method of manipulating data to achieve significant results  Multiple analysis  Omitting other info  Controlling for variables  Analyse part way through then gather more data until a significant result is found  Changing DV

Answer 33

- Null hypothesis = statement of no difference o True until there is evidence against it - Alternative hypothesis = statement of difference or association

Answer 34

- One tailed hypotheses = state which direction the effect will be in (e.g those that subscribe to Zoella will be more likely to choose the unhealthy snack - Two tailed hypotheses = no direction stated

Answer 35

show medians, ranges, IQ ranges, skewness etc  Range = upper adjacent value – lower adjacent value  Uneven whiskers = skewness

Answer 36

IQ Range = upper hinge – lower hinge (whole box)

Answer 37

- Deviation from symmetry - Show a big difference between means, medians, and mode - Extreme scores affecting the mean - Positively skewed : scores greater than the mean skewing - Negatively skewed : scores lower than the mean skewing

Answer 38

- Refers to extent to which scores cluster at the tails of the distribution – changes pointiness - Positive kurtosis : leptokurtic distribution - Negative kurtosis : platykurtic  Flatter than normal

Answer 39

more than twice the standard error

Answer 40

: difference between parameters is significant

Answer 41

- Between subjects : independent, looks at performance between subjects/groups Disadvantages  High sample size  Individual differences

Answer 42

- Confounding variable : extraneous variable that influences results - Situation variables : variables in condition that could confound, e.g environment, temperature, time of day

Answer 43

- Expectancy effects : expecting an effect can cause that effect e.g expecting a substance rather than a placebo may lead to experiencing some of the effects

Answer 44

- Balancing and matching techniques 1. Random allocation 2. Matched group design 3. Natural group design

Answer 45

RANDOM ALLOCATION DESIGN - Participants randomly assigned to groups - Controls for participant variables - Sample size should be larger

Answer 46

MATCHED GROUP DESIGN | - Matches participants based on a certain characteristic (sometimes DV)

Answer 47

WITHIN SUBJECTS DESIGN - Repeatedly measure the same people on the same DV Disadvantages o Boredom/ fatigue o Order/practice effects o Individual differences o Time consuming conditions o Can’t use for experiments where task cannot be repeated (e.g first impressions) o Can’t be used if there’s differential transfer (effects of one condition affect performance in some conditions (e.g using cannabis then placebo)

Answer 48

effects of one condition affect performance in some conditions (e.g using cannabis then placebo)

Answer 49

 Learning  Fatigue  Habituation (leads to reduced response)  Sensitisation (leads to greater response)  Contrast (may lead to less effort if initially rewarded  Adaptation (e.g low light levels, drug effects)

Answer 50

- Each condition given to each participant once - Order of administration varied - Practice effects balanced

Answer 51

o All possible orders | o Selected orders

Answer 52

 Have to calculate the factorial based on levels of IV (result of multiplying number by all numbers less than it)  Used on 3-4 conditions or less

Answer 53

 Based on Latin square  Used for more than 3 conditions  Each condition occurs once in each position  Each condition precedes/ follows each other condition only once

Answer 54

- Each condition administers several times (different orders each time) - Practice effects balanced for each participant - 2 main counter balancing methods o Block randomisation o The ABBA design

Answer 55

1. BLOCK RANDOMISATION  Consists of all conditions  Participants complete the conditions several times, each time in a different order 2. ABBA DESIGN  Presents one random sequence of conditions, then the opposite sequence

Answer 56

``` o Naturalistic observation o Behaviour occurs naturally, experimenter is a passive recorder ADVANTAGES  High external validity  Can investigate complex social situations  Useful for developing theories DISADVANTAGES  Time consuming/ expensive  Description, not causation  Not useful for specific hypotheses ```

Answer 57

``` - PARTICIPANT OBSERVATION o Undisguised • Researcher part of group • In depth interviews/ observations  Advantages  No ethical problems  Natural setting  Openly record data  Disadvantages  Behaviour may change due to presence o Disguised • Those observed are unaware • Prevents observer influence  Advantages  Access to particular groups  Natural setting  Disadvantages  Ethical issues  Problems recording data  Researcher bias  Interaction : researcher may change the observeds behaviour ```

Answer 58

- Cause an event or set up a situation - Observe specific behaviour in a particular setting - No attempt to control for other variables - Uses behavioural checklist or code using mutually exclusive categories - Same procedures across other observers

Answer 59

- Well controlled in natural setting | - Manipulate IV to observe effect on behaviour

Answer 60

- Consistency in measuring between observers | - Correlations can be used to check reliability

Answer 61

- Reactivity : participant modifies behaviour when they know they are being observed  Socially normative behaviour to gain approval  Demand characteristics : change behaviour depending on what the expected objective of the research is - Controlling reactivity : unobtrusive measurement  Disguised participant observation  Adaptation : habituation, desensitisation  Indirect measurement : physical traces, archival data - Expectancy effects with observer bias : knowledge of hypothesis/ previous research  Can be controlled by blind observers

Answer 62

 Precipitate an uncommon/ difficult to observe event  Gain access to closed event/situation  Establish comparison by adding/manipulating IVs  Control antecedent events/ behaviour  Vary qualities of a stimulus event  3 kinds : participant observation, structured observation, field experiments

Answer 63

- Assess relationships between variables

Answer 64

- tells us about the strength of the association : range from -1 to +1 - negative values : negative correlation (-1 = positive) - positive value : positive correlation (+1 = positive)

Answer 65

positive correlation - as one variable increases so does the other negative correlation - as one variable increases the other decreases

Answer 66

- other explanations for correlations: 3rd variable, chance

Answer 67

- spearman rank correlation - ordinal level data - skewed ratio/ interval level data - pearson product-moment correlation - normally distributed interval/ratio level data

Answer 68

- spearman rank correlation - ordinal level data - skewed ratio/ interval level data

Answer 69

- pearson product-moment correlation | - normally distributed interval/ratio level data

Answer 70

- uses the ranks of the data, not the actual data - not influenced by skewed data - when we have two values that would get the same rank we add together the ranks and divide by how many tied scores there are

Answer 71

- shows a positive/negative/no correlation - spearman's : rs - degrees of freedom = n-2 - order = rs (degrees of freedom) = correlation coefficient, p

Answer 72

- one-tailed : direction is stated - we can halve the two-tailed p value to find one-tailed - alpha values : .025

Answer 73

- alpha value : level at which effect is significant | - typically .05, so p values below .05 are significant

Answer 74

- degrees of freedom : the number of observations in the data that are free to vary when estimating parameters

Answer 75

-pearson's symbol = r

Answer 76

- present lots of variables in a table - correlation matrix | - APA format

Answer 77

- predetermined questions - includes questionnaires and structured interviews - can be given online, mail etc

Answer 78

- mail : + convenient | - response rate/bias

Answer 79

``` - internet : + efficient/cheap + convenient + large/diverse sample - representativeness - ethics ```

Answer 80

- phone : + some questions easier to ask + large, diverse sample - sample bias - interviewer bias

Answer 81

- group : + captive audience + large amount of data quickly - privacy/anonymity - pressure

Answer 82

- interview + same questions/order + quantitative analysis - interview bias/ social context

Answer 83

- personally : + convenient/ large sample + good response rate - representativeness/demand characteristics/ questionnaire fatigue

Answer 84

- ability/ aptitude (e.g numerical/verbal reasoning | - personal qualities (personality/attitudes)

Answer 85

- 605-1905 : Chinese civil service exams used to recruit officials

Answer 86

- 1917 : army alpha and beta tests developed by Robert yerkes - evaluated intellectual/ emotional functioning - tested verbal/ numerical ability, (e.g following directions) - also tested capability of serving, job classification, leadership potential - beta test - non verbal equivalent - allowed intelligence classification as superior, average, inferior - highest to lowest score : white Americans, north/west European immigrants, south/ east European immigrants, black Americans - test was very amercio/eurocentric (e.g what is crisco, celebrities) and required cultural knowledge - actually measured level of education / acculturation

Answer 87

- woodworth personal data sheet - world war 1 by US army - a test of emotional stability (susceptibility to shell shock) - first personality test

Answer 88

- Stanford-binet IQ test - used to assess for learning disabilities - used today for clinical/ neurological assessment and educational placement

Answer 89

- need a topic, then draft, then reexamine/ revise, then do pilot study, then edit and specify procedures for administering

Answer 90

- questions must be simple - dont use double barralled questions - avoid using loaded/ guiding questions - avoid negative wording (e.g do you think students shouldnt pay tuition fees

Answer 91

- open ended questions + detailed answers +/- quick to design, long analysis time + participant led - subjective interpretation - partially open-ended (multiple answers given

Answer 92

- closed questions (e.g likert scales, true/false) - guessing - unsubtle - complex to design, quick to mark - theory led - questionnaire fallacy : people will find a box to tick, even if their opinion is not represented

Answer 93

- e.g yes or no, agree or disagree, graphic rating scale - likert scale : labelled statements of a varying strength (e.g strongly agree to strongly disagree - each measure given a score (positive question : strongly agree = 5, negatie question : strongly agree = 1 - semantic differential scale : connotative meaning between bipolar adjectives, and rating is placed on a scale inbetween

Answer 94

- order effects/ priming : detailed questions at first may influecne later general questions - thinking about how the answer to one question while answering another - counter balance questions and randomising can help

Answer 95

- demand characteristics : answer in a certain way to sabotage/ give "beneficial" answers/ look more desirable

Answer 96

acquiescent : always agreeing/ disagreeing, even if it contradicts previous answers - use a mix of positive/negative questions to overcome

Answer 97

- extreme/ neutral responses : may not be concentrating, sabotaging etc - raw data may need to be disregarded

Answer 98

cultural bias : language could be misunderstood, multiple interpretations of words, differing opinions between cultures, social desirability differs

Answer 99

- assumptions : attitudes can be verbally expressed - statements will have the same meaning for all participants - attitudes can be quantified - problems : consistency, social desirability, ambivalence, normative response bias (use a lie scale) - implicity : do the statements express what they should clearly or not?

Answer 100

- used in a clinical setting to determine complexes/ deficiencies used to predict things such as drug use - advantages : quick/ easy to administer - predict prospective drug use - self scoring improves validity - disadvantages : colloquialism - cant make standardised procedures - tests may not be implicit

Answer 101

- implicit cognitive tasks : infer attitude/ beliefs from performance on different tasks - often use reaction times - e.g IAT - automatic association between concepts, used for attitudes towards age/gender/ race etc - now computer based - categorised target concepts with an attitude as quickly as possible - faster association = stronger correlation - disadvantages : cultural values vs beliefs/ attitudes - ecological validity - may not act that way

Answer 102

- reliability : internal (all items measure the same thing) external (consistent across time and setting) - test-retest reliability and split-half reliability

Answer 103

- classical test analysis : assumes observed score (X) is made up of true score (T) and random error score (E) : X=T+E - random error : reading errors, social desirability bias, mood, tiredness - systematic error : characteristic of test e.g "how often do you go to the cinema" would be influenced by factors like wealth

Answer 104

- validity : content validity (covers all behaviour/ aspects) construct validity (measures theoretical construct) criterion- orientated validity (correlates with establishes measure)

Answer 105

- standardisation: standardised instructions and procedures

Answer 106

established population norms : should be able to compare results to an appropriate established tests/theories

Answer 107

- knowledge based : ability, aptitude, achievement e.g intelligence tests, clinical assessment instruments - person based : personality, mood, attitude to assess differences between people

Answer 108

- normative reference testing : scores compared to norm e.g mean/ median split - criterion reference testing : scores compared to pre-determined criteria e.g determine if someone is at risk - restrictive : doesnt take in to account individual differences or non-clinical samples

Answer 109

- focuses on predicting variance in an outcome (criterion or response variable) from predictors (IV) - creates a statistical model to find out whether model is a good fit for data and find whether there is a significant association/ direction of association

Answer 110

- linear relationship formula = Y = bX + a - Y : criterion/response variable - b: slope of the line (based on Pearson's r) - X: predictor variable (years of experience) - a: constant or intercept - calculates line of best fit for the observed data which can be used to make predictions for unobserved values

Answer 111

- Yi = (B0 + B1Xi) + ei

Answer 112

- two variables - X is predictor variable (IV/explanatory variable) - Y is criterion variable (response/outcome/criterion/DV)

Answer 113

- normally distributed continuous outcome - independent data - interval/ ratio predictors - nominal predictors with two categories (dichotomous)

Answer 114

- R square/ adjusted R square - how close data is to fitted regression line - proportion of variance explained by the model - presented as a percentage - coefficient of determination

Answer 115

- ANOVA | - measure of model fit : tells us how well regression fits the data

Answer 116

- beta coefficient | - number of SDs the criterion variable will change as a result of one SD change in the predictor variable

Answer 117

- we need to : - assess model fit (f value) - know how effective model is - R squared value - know whether an association is significant and direction - beta value

Answer 118

- Example : - a bivariate regression was conducted to investigate the association between years of experience and salary. The regression model predicted approximately 70% of variance in salary, adjusted R^2 = .70, F(1,8) = 22.34, P= .001. There was a positive association between years of experience and salary 𝜷 = .86, p = .001

Answer 119

- can increase the amount of variance explained by a model by including additional variables

Answer 120