Research Methods & Communication Flashcards

Question

In the standard equation y = ax + b what variables are y and x?

Answer 1

y is the dependent variable x is the independent variable

Answer 2

S = (sqrt)S^2 ?????

Answer 3

\* Stem-and-leaf plots \* Histograms \* Boxplots

Answer 4

1) Logical guess based on other people's results 2) Predictions tested 3) Results. Agree with hypothesis = win. If not, formulate new hypothesis.

Answer 5

Attempt to control for variation statistically. - take measurements of variables that might influence the result, and hope we can quantify their influence. - this generally requires replication - we lose some degrees of freedom in estimating the effect of these variables.

Answer 6

The correlation coefficient OR Pearson's Product-Moment Correlation Coefficient OR r. - falls between 1 and -1. - 1 = complete positive correlation - -1 = complete negative correlation - 0 = no correlation - Defined as the covariance divided by the product of their standard deviations.

Answer 7

All of them are strongly affected by discipline.

Answer 8

\* Non-independence of data points and pseudoreplication \* Sample size too small \* Confirmation bias & observer expectation \* Researcher degrees of freedom & 'p-hacking' \* Interpreting non-significant result as meaning something true \* Interpreting a significant result as meaning that something is true

Answer 9

**+** Allows direct statements about probability (eg the probability that one drug is better than another) **+** Can be used to calculate the probability of future observations. **-** It is subjective: because the posterior probability is affected by the prior probability, different people (with different priors) can reach different conclusions from the same data. - However, as more evidence is accumulated the posterior probabilities will converge on the same result, whatever the priors. Advocates of Bayesian statistics argue that since science is based on differences of opinion, methods of analysis should reflect this.

Answer 10

\* allows to identify outliers, errors and patterns in variance \* gives an impression of how the continuous variable is dependent on the categorical variable \* less useful when n is high

Answer 11

\* see relationships between two variables \* check for non-linearity \* check for outliers and errors \* check for change in variance \* check for structure in the data

Answer 12

Use lower significance levels (e.g, 0.01 or 0.001)

Answer 13

Main effect and interactions. A main effect is the effect of one factor in isolation. An interaction is the effect of one factor when the level of the other factors is taken into account.

Answer 14

\* to avoid selection bias \* control for temporal effects \* control for regression to the mean \* basis for statistical inference

Answer 15

In classical statistics, we ask what the probability of seeing our data is, given a particular hypothesis (the null hypothesis)

Answer 16

\* Normal errors \* Independence of data points \* Equal variances - R uses a version that is fine with unequal variances

Answer 17

The partitioning of variance in the data into that unexplained by the factor(s) & that which is explained.

Answer 18

(x-x̄)(y-ȳ) / n-1 ---------------------- SxSy

Answer 19

Used to assess the quality of an individual's scientific output

Answer 20

1. Data - ink ratio & graphical redesign 2. Chartjunk 3. Data - ink maximisation 4. Multi-functioning graphical elements 5. High resolution data graphics 6. Aesthetics & technique in graphical design

Answer 21

Two people will test positive. - one will have the disease - one is a false positive There is a 50:50 chance that you have the disease.

Answer 22

χ2 = Σ (obs - exp)^2 ----------------------- exp

Answer 23

Causes problems because it will lead to systemic underestimation of σ

Answer 24

When the outcomes of an event are mutually exclusive (cannot happen at the same time) i.e the probability of rolling a 2 OR a 5

Answer 25

\* Normal errors \* Variances must be similar across the relationship

Answer 26

χ2 = Σ ((obs-exp)-0.5)^2 ----------------------------- exp

Answer 27

ANOVA calculates the between group variance, or the factor variance. - This is compared with the within group variance, or error variance by using an f test.

Answer 28

Failing to reject a null hypothesis which is actually incorrect. FALSE NEGATIVE

Answer 29

In Bayesian statistics, we ask what the probability of different hypotheses are, given our data: we then pick the most likely hypothesis.

Answer 30

\* Normally distributed errors \* Homoscedasticity \* Observations are independent

Answer 31

S^2 = (x - x̄ )^2 ------------- n-1

Answer 32

95% PI ALWAYS exceeds CI

Answer 33

Single-factor; two-factor; Higher level factorial design; incomplete design; Nested design

Answer 34

With two CONTINUOUS variables

Answer 35

Bayes' rule = P(A|B) = P(B|A) x P(A) --------------------------------- P(B)

Answer 36

When a contingency table is 2x2

Answer 37

Replicate the treatment that it is applied to

Answer 38

\* Each set of measurements must be independent \* No sample must be s exact test instead )

Answer 39

By comparing our value of t with Student's t Distribution which takes account of this.

Answer 40

the difference between means was (or was not) statistically significant (t=X.XX, Ydf, P=Z.ZZ)

Answer 41

It is strongly affected by the length of a person's career

Answer 42

r^2 The coefficient of determination is an estimate of the % variability in one variable explained by the other variable.

Answer 43

\* Variance \* Standard deviation \* IQR

Answer 44

lm(dependent~independent) then use summary( ) to get more information.

Answer 45

(N-1)N/2 pairwise comparisons

Answer 46

COVARIANCE = Σ (x-x̄)(y-ȳ) ------------- n-1

Answer 47

cor.test( )

Answer 48

ANCOVA --\> Analysis of covariance - uses the independent variable as the covariate.

Answer 49

The strength and significance of the relationship between two variables.

Answer 50

\* Compare two means \* Compare a before and after

Answer 51

Use sum of squares (SS), do not use the variance (S^2) SS = Σ(x-x̄ )^2 OR SS = S^2 X df

Answer 52

1-(0.95^number of tests)

Answer 53

To fit a line to allow estimates of the dependent variable to be made from the independent.

Answer 54

Estimating dependent variables from a regression equation outside the range of your data

Answer 55

\* tell us about the shape of the frequency distribution \* helps to identify outliers \* helps to identify possible errors

Answer 56

Reject a null hypothesis that is actually correct. FALSE POSITIVE

Answer 57

number of times articles published in 2010 & 2011 were cited in 2012 ----------------------------------------------------------------------- citable articles in 2010/11

Answer 58

Your result is SIGNIFICANT, reject the NULL hypothesis

Answer 59

A person's H index is the highest number, h, for which they have h papers each with h citations.

Answer 60

x̄ = 1/n Σ

Answer 61

\* structure in the data \* Error distribution \* Variance structure \* Linearity

Answer 62

lm( ) or aov( )

Answer 63

hist(model$residuals)

Answer 64

By the method of least squares

Answer 65

Calculate a p value associated with r

Answer 66

If it is one-tailed then it will have one outcome, if it is two-tailed then it will have two outcomes.

Answer 67

With one CONTINUOUS variable and one CATEGORICAL variable

Answer 68

A prediction interval indicates a region we are 95% certain predictions of the dependent lie.

Answer 69

they are greater than 0.05

Answer 70

On 2 continuous variables

Research Methods & Communication Flashcards

(95 cards)