Definitions Flashcards

Question

P-value

Answer 1

Tells you how likely a result is or more extreme that the one obtained from a given study or experiment is to have occurred purely by chance if the null hypothesis is correct.

Answer 2

A random variable whose possible values cannot be put in any meaningful order.

Answer 3

Any random variable whose values can be put in a meaningful order.

Answer 4

Variables that have word labels and can be put into order.

Answer 5

For a random variable is a choice of a standard form we know or assume that the probability density function associated to the variable we have.

Answer 6

A random variable, X, with two possible outcomes and a single parameter, representing p(X=1).

Answer 7

A random variable whose PDF is a normal distribution.

Answer 8

A set of techniques involving summary statistics and graphical methods for exploring data before you do formal inference.

Answer 9

For a set of data or a distribution is the number below which k/q of the distribution lies.

Answer 10

For a data set with n points against a model distribution is the plot of (X,Y) values where the kth y-value is the kth smallest datapoint in the set, and the kth x-value is the kth n quantile for the model distribution.

Answer 11

A normal with mean = 0 and sd = 1.

Answer 12

{lowest datapoint, lower quartile, median, upper quartile, highest datapoint}

Answer 13

The mean and standard deviations are strongly influenced by changes to only a few data points.

Answer 14

Used when interested in studying the effect of two categorical predictor variables on a single response variable.

Answer 15

The difference between the actual value of the parameter, on the population, and the value of the parameter under the null hypothesis.

Answer 16

An X% confidence interval for a parameter theta is an interval (L,U) generated by some procedure that in repeated sampling has an X% probability of containing the true value of theta for all possible values of theta.

Answer 17

An X% confidence procedure is any procedure that generates intervals containing theta in X% of repeated samples.

Answer 18

The difference in means, m1 - m2.

Answer 19

Rejecting the null hypothesis when it shouldn't be.

Answer 20

Not rejecting the null hypothesis when we should.

Answer 21

The smallest difference from the null hypothesis value of the parameter that we consider to be important.

Answer 22

1 - beta of a statistical test is the probability of rejecting the null when the null is false with some effect size greater that epsilon. (Probability of not making a type 2 error when the effect size is large enough to be of interest to us).

Answer 23

The statistic we get by replacing the population standard deviation by the sample standard deviation in the z-statistic.

Answer 24

A generalisation of t-tests.

Answer 25

Mu + alpha(i) + epsilon, mu is a reference level, alpha(i) represents the deviations of the mean for the ith treatment group from the reference level mu.

Answer 26

Mu + epsilon

Answer 27

Residual sum of squares from the reduced model.

Answer 28

Residual sum of squares from the full model.

Answer 29

Used when there is one categorical predictor variable and one continuous response variable.

Answer 30

A measure used in ANOVA, the further away from 1 it is, the more wrong the null is.

Answer 31

One that does not make any assumptions about the distribution of residuals.

Answer 32

If you have a list of data from quantitative or an ordinal variable, you can put it in order. The position of the datapoint in this ordered list is its rank. If several datapoints are equal, then the rank of each one is the average of their positions on the list.

Answer 33

Non-parametric version of a 1 sample or paired t-test. For a 1-sample test; it tests the null hypothesis. H0: the median of the population is m0. For a paired t-test; it tests the null hypothesis. H0: the medians of the two populations satisfy m1-m2=m0.

Answer 34

Non parametric version of the independent samples t-test. It tests the null hypothesis that the probability of an element of the first group being greater than an element of the second group is exactly 0.5.

Answer 35

Non-parametric version of one-way ANOVA carried out on ranks. H0: for any two groups you consider, the probability that a random element in the first group will yield a greater value of your variable than a random element in the second group of exactly 0.5. Ha: for at least two groups, the probability is different from 0.5.

Answer 36

To compare the expected values in each cell of the tables with the observed values. H0: response is independent of condition. H1: response depends upon condition. If there is a big difference, we can conclude that it is unlikely that there is no difference in the population.

Answer 37

Refers to the idea that the scientific studies which end up getting published are a biased sample of the total population of scientific studies.

Answer 38

Incentives to find significant p values.