Statistics, SPSS, and Intro Methods (Fields Ch. 1 -4) Flashcards

Question

Hypothesis

Answer 1

a prediction about the state of the world (see experimental hypothesis and null hypothesis).

Answer 2

an experimental design in which different treatment conditions utilize different organisms (e.g., in psychology, this would mean using different people in different treatment conditions) and so the resulting data are independent (a.k.a. between-groups or between-subjects design).

Answer 3

another name for a predictor variable. This name is usually associated with experimental methodology (which is the only time it makes sense) and is used because it is the variable that is manipulated by the experimenter and so its value does not depend on any other variables (just on the experimenter). I just use the term predictor variable all the time because the meaning of the term is not constrained to a particular methodology.

Answer 4

the limits within which the middle 50% of an ordered set of observations fall. It is the difference between the value of the upper quartile and lower quartile.

Answer 5

data measured on a scale along the whole of which intervals are equal. For example, people's ratings of this book on Amazon.com can range from 1 to 5; for these data to be interval it should be true that the increase in appreciation for this book represented by a change from 3 to 4 along the scale should be the same as the change in appreciation represented by a change from 1 to 2, or 4 to 5.

Answer 6

In the context of academia a journal is a collection of articles on a broadly related theme, written by scientists, that report new data, new theoretical ideas or reviews/critiques of existing theories and data. Their main function is to induce learned helplessness in scientists through a complex process of self-esteem regulation using excessively harsh or complimentary peer feedback that has seemingly no obvious correlation with the actual quality of the work submitted.

Answer 7

this measures the degree to which scores cluster in the tails of a frequency distribution. There are different ways to estimate kurtosis and in SPSS no kurtosis is expressed as 0 (but be careful because outside of SPSS no kurtosis is sometimes a value of 3). A distribution with positive kurtosis (leptokurtic, kurtosis \> 0) has too many scores in the tails and is too peaked, whereas a distribution with negative kurtosis (platykurtic, kurtosis \< 0) has too few scores in the tails and is quite flat.

Answer 8

see Kurtosis.

Answer 9

the relationship between what is being measured and the numbers obtained on a scale.

Answer 10

a form of research in which you observe what naturally goes on in the world without directly interfering with it by measuring several variables at multiple time points. See also correlational research, cross-sectional research.

Answer 11

the value that cuts off the lowest 25% of the data. If the data are ordered and then divided into two halves at the median, then the lower quartile is the median of the lower half of the scores.

Answer 12

a simple statistical model of the centre of a distribution of scores. A hypothetical estimate of the 'typical' score.

Answer 13

the discrepancy between the numbers used to represent the thing that we're measuring and the actual value of the thing we're measuring (i.e., the value we would get if we could measure it directly).

Answer 14

the middle score of a set of ordered observations. When there is an even number of observations the median is the average of the two scores that fall either side of what would be the middle value.

Answer 15

the most frequently occurring score in a set of data.

Answer 16

description of a distribution of observations that has more than two modes.

Answer 17

where numbers merely represent names. For example, the numbers on sports players' shirts: a player with the number 1 on her back is not necessarily worse than a player with a 2 on her back. The numbers have no meaning other than denoting the type of player (full back, centre forward, etc.).

Answer 18

a type of quantile; they are values that split the data into nine equal parts. They are commonly used in educational research.

Answer 19

a probability distribution of a random variable that is known to have certain properties. It is perfectly symmetrical (has a skew of 0), and has a kurtosis of 0.

Answer 20

data that tell us not only that things have occurred, but also the order in which they occurred. These data tell us nothing about the differences between values. For example, gold, silver and bronze medals are ordinal: they tell us that the gold medallist was better than the silver medallist, but they don't tell us how much better (was gold a lot better than silver, or were gold and silver very closely competed?).

Answer 21

a variable whose values we are trying to predict from one or more predictor variables.

Answer 22

a type of quantile; they are values that split the data into 100 equal parts.

Answer 23

see Kurtosis.

Answer 24

refers to the possibility that participants' performance in a task may be influenced (positively or negatively) if they repeat the task because of familiarity with the experimental situation and/or the measures being used.

Answer 25

a form of criterion validity where there is evidence that scores from an instrument predict external measures (recorded at a different point in time) conceptually related to the measured construct.

Answer 26

a variable that is used to try to predict values of another variable known as an outcome variable.

Answer 27

the function that describes the probability of a random variable taking a certain value. It is the mathematical function that describes the probability distribution.

Answer 28

a curve describing an idealized frequency distribution of a particular variable from which it is possible to ascertain the probability with which specific values of that variable will occur. For categorical variables it is simply a formula yielding the probability with which each category occurs.

Answer 29

extrapolating evidence for a theory from what people say or write (cf. quantitative methods).

Answer 30

inferring evidence for a theory through measurement of variables that produce numeric outcomes (cf. qualitative methods).

Answer 31

values that split a data set into equal portions. Quartiles, for example, are a special case of quantiles that split the data into four equal parts. Similarly, percentiles are points that split the data into 100 equal parts and noniles are points that split the data into 9 equal parts (you get the general idea).

Answer 32

a generic term for the three values that cut an ordered data set into four equal parts. The three quartiles are known as the lower quartile, the second quartile (or median) and the upper quartile.

Answer 33

the process of doing things in an unsystematic or random way. In the context of experimental research the word usually applies to the random assignment of participants to different treatment conditions.

Answer 34

the range of scores is the value of the smallest score subtracted from the highest score. It is a measure of the dispersion of a set of scores. See also variance, standard deviation, and interquartile range.

Answer 35

an interval variable but with the additional property that ratios are meaningful. For example, people's ratings of this book on Amazon.com can range from 1 to 5; for these data to be ratio not only must they have the properties of interval variables, but in addition a rating of 4 should genuinely represent someone who who rated it as 2. Likewise, someone who rated it as 1 should be half enjoyed this book twice as much as someone as impressed as someone who rated it as 2.

Answer 36

the ability of a measure to produce consistent results when the same entities are measured under different conditions.

Answer 37

an experimental design in which different treatment conditions utilize the same organisms (i.e., in psychology, this would mean the same people take part in all experimental conditions) and so the resulting data are related (a.k.a. related design or within-subject design).

Answer 38

another name for the median.

Answer 39

a measure of the symmetry of a frequency distribution. Symmetrical distributions have a skew of 0. When the frequent scores are clustered at the lower end of the distribution and the tail points towards the higher or more positive scores, the value of skew is positive. Conversely, when the frequent scores are clustered at the higher end of the distribution and the tail points towards the lower more negative scores, the value of skew is negative.

Answer 40

an estimate of the average variability (spread) of a set of data measured in the same units of measurement as the original data. It is the square root of the variance.

Answer 41

variation due to some genuine effect (be it the effect of an experimenter doing something to all of the participants in one sample but not in other samples, or natural variation between sets of variables). We can think of this as variation that can be explained by the model that we've fitted to the data.

Answer 42

another name for the sum of squares.

Answer 43

the possibility that an apparent relationship between two variables is actually caused by the effect of a third variable on them both (often called the third-variable problem).

Answer 44

the ability of a measure to produce consistent results when the same entities are tested at two different points in time.

Answer 45

although it can be defined more formally, a theory is a hypothesized general principle or set of principles that explain known findings about a topic and from which new hypotheses can be generated.

Answer 46

this is variation that isn't due to the effect in which we're interested (so could be due to natural differences between people in different samples such as differences in intelligence or motivation). We can think of this as variation that can't be explained by whatever model we've fitted to the data.

Answer 47

the value that cuts off the highest 25% of ordered scores. If the scores are ordered and then divided into two halves at the median, then the upper quartile is the median of the top half of the scores.

Answer 48

evidence that a study allows correct inferences about the question it was aimed to answer or that a test measures what it set out to measure conceptually (see also Content validity, Criterion validity).

Answer 49

anything that can be measured and can differ across entities or across time.

Answer 50

an estimate of average variability (spread) of a set of data. It is the sum of squares divided by the number of values on which the sum of squares is based minus 1.

Answer 51

another name for a repeated-measures design.

Answer 52

the value of an observation expressed in standard deviation units. It is calculated by taking the observation, subtracting from it the mean of all observations, and dividing the result by the standard deviation of all observations. By converting a distribution of observations into z-scores a new distribution is created that has a mean of 0 and a standard deviation of 1.

Answer 53

the probability of making a Type I error (usually this value is .05).

Answer 54

the prediction that there will be an effect (i.e., that your experimental manipulation will have some effect or that certain variables will relate to each other).

Answer 55

the probability of making a Type II error (Cohen, 1992, suggests a maximum value of .2).

Answer 56

a correction applied to the \_-level to control the overall Type I error rate when multiple significance tests are carried out. Each test conducted should use a criterion of significance of the \_-level (normally .05) divided by the number of tests conducted. This is a simple but effective correction, but tends to be too strict when lots of tests are performed.

Answer 57

this theorem states that when samples are large (above about 30) the sampling distribution will take the shape of a normal distribution regardless of the shape of the population from which the sample was drawn. For small samples the t-distribution better approximates the shape of the sampling distribution. We also know from this theorem that the standard deviation of the sampling distribution (i.e., the standard error of the sample mean) will be equal to the standard deviation of the sample(s) divided by the square root of the sample size (N).

Answer 58

An effect size that expressed the difference between two means in standard deviation units. In general it can be estimated using the formula above.

Answer 59

for a given statistic calculated for a sample of observations (e.g., the mean), the confidence interval is a range of values around that statistic that are believed to contain, with a certain probability (e.g., 95%), the true value of that statistic (i.e., the population value).

Answer 60

an impossible thing to define in a few pages, let alone a few lines. Essentially it is the number of 'entities' that are free to vary when estimating some kind of statistical parameter. In a more practical sense, it has a bearing on significance tests for many commonly used test statistics (such as the F-ratio, t-test, chi-square statistic) and determines the exact form of the probability distribution for these test statistics. The explanation involving soccer players in Chapter 2 is far more interesting...

Answer 61

the difference between the observed value of a variable and the value of that variable predicted by a statistical model.

Answer 62

an objective and (usually) standardized measure of the magnitude of an observed effect. Measures include Cohen's d, Glass's g and Pearson's correlations coefficient, r.

Answer 63

synonym for alternative hypothesis.

Answer 64

the probability of making a Type I error in an experiment involving one or more statistical comparisons when the null hypothesis is true in each case.

Answer 65

the probability of making a Type I error in any family of tests when the null hypothesis is true in each case. The 'family of tests' can be loosely defined as a set of tests conducted on the same data set and addressing the same empirical question.

Answer 66

how sexually attractive you find a statistical test. Alternatively, it's the degree to which a statistical model is an accurate representation of some observed data. (Incidentally, it's just plain wrong to find statistical tests sexually attractive.)

Answer 67

a model that is based upon a straight line.

Answer 68

this is a statistical procedure for assimilating research findings. It is based on the simple idea that we can take effect sizes from individual studies that research the same question, quantify the observed effect in a standard way (using effect sizes) and then combine these effects to get a more accurate idea of the true effect in the population.

Answer 69

a method of estimating parameters (such as the mean, or a regression coefficient) that is based on minimizing the sum of squared errors. The parameter estimate will be the value, out of all of those possible, that has the smallest sum of squared errors.

Answer 70

the reverse of the experimental hypothesis, it says that your prediction is wrong and the predicted effect doesn't exist.

Answer 71

a test of a directional hypothesis. For example, the hypothesis 'the longer I write this glossary, the more I want to place my editor's genitals in a starved crocodile's mouth' requires a one-tailed test because I've stated the direction of the relationship (see also two-tailed test).

Answer 72

a very difficult thing to describe. When you fit a statistical model to your data, that model will consist of variables and parameters: variables are measured constructs that vary across entities in the sample, whereas parameters describe the relations between those variables in the population. In other words, they are constants believed to represent some fundamental truth about the measured variables. We use sample data to estimate the likely value of parameters because we don't have direct access to the population. Of course it's not quite as simple as that.

Answer 73

in statistical terms this usually refers to the collection of units (be they people, plankton, plants, cities, suicidal authors, etc.) to which we want to generalize a set of findings or a statistical model.

Answer 74

the ability of a test to detect an effect of a particular size (a value of .8 is a good level to aim for).

Answer 75

a smaller (but hopefully representative) collection of units from a population used to determine truths about that population (e.g., how a given population behaves in certain conditions).

Answer 76

the probability distribution of a statistic. We can think of this as follows: if we take a sample from a population and calculate some statistic (e.g., the mean), the value of this statistic will depend somewhat on the sample we took. As such the statistic will vary slightly from sample to sample. If, hypothetically, we took lots and lots of samples from the population and calculated the statistic of interest we could create a frequency distribution of the values we got. The resulting distribution is what the sampling distribution represents: the distribution of possible values of a given statistic that we could expect to get from a given population.

Answer 77

the extent to which a statistic (the mean, median, t, F, etc.) varies in samples taken from the same population.

Answer 78

the standard deviation of the sampling distribution of a statistic. For a given statistic (e.g., the mean) it tells us how much variability there is in this statistic across samples from the same population. Large values, therefore, indicate that a statistic from a given sample may not be an accurate reflection of the population from which the sample came.

Answer 79

the standard error associated with the mean. Did you really need a glossary entry to work that out?

Answer 80

a statistic for which we know how frequently different values occur. The observed value of such a statistic is typically used to test hypotheses.

Answer 81

a test of a non-directional hypothesis. For example, the hypothesis 'writing this glossary has some effect on what I want to do with my editor's genitals' requires a two-tailed test because it doesn't suggest the direction of the relationship. See also One-tailed test.

Answer 82

occurs when we believe that there is a genuine effect in our population, when in fact there isn't.

Answer 83

occurs when we believe that there is no effect in the population, when in fact there is.

Answer 84

a variable containing values of money.

Answer 85

variables involving words (i.e., letter strings). Such variables could include responses to open-ended questions such as 'How much do you like writing glossary entries?'; the response might be 'About as much as I like placing my gonads on hot coals'.

Answer 86

a graph in which a summary statistic (usually the mean) is plotted on the y-axis against a categorical variable on the x-axis (this categorical variable could represent, for example, groups of people, different times or different experimental conditions). The value of the mean for each category is shown by a bar. Different-coloured bars may be used to represent levels of a second categorical variable.

Answer 87

a graphical representation of some important characteristics of a set of observations. At the centre of the plot is the median, which is surrounded by a box, the top and bottom of which are the limits within which the middle 50% of observations fall (the interquartile range). Sticking out of the top and bottom of the box are two whiskers which extend to the highest and lowest extreme scores, respectively.

Answer 88

superfluous material that distracts from the data being displayed on a graph.

Answer 89

similar to a histogram except that rather than having a summary bar representing the frequency of scores, it shows each individual score as a dot. They can be useful for looking at the shape of a distribution of scores

Answer 90

a graphical representation of the mean of a set of observations that includes the 95% confidence interval of the mean. The mean is usually represented as a circle, square or rectangle at the value of the mean (or a bar extending to the value of the mean). The confidence interval is represented by a line protruding from the mean (upwards, downwards or both) to a short horizontal line representing the limits of the confidence interval. Error bars can be drawn using the standard error or standard deviation instead of the 95% confidence interval.

Answer 91

a graph in which a summary statistic (usually the mean) is plotted on the y-axis against a categorical variable on the x-axis (this categorical variable could represent, for example, groups of people, different times or different experimental conditions). The value of the mean for each category is shown by a symbol, and means across categories are connected by a line. Different-coloured lines may be used to represent levels of a second categorical variable.

Answer 92

a line on a scatterplot representing the regression model of the relationship between the two variables plotted.

Answer 93

a graph that plots values of one variable against the corresponding value of another variable (and the corresponding value of a third variable can also be included on a 3-D scatterplot).

Statistics, SPSS, and Intro Methods (Fields Ch. 1 -4) Flashcards

(119 cards)