Introduction to Statistics Flashcards

Question

Normal Distribution

Answer 1

Bell Curve, Symmetric distribution, Mean in centre Area under the bell curve presents probabilities

Answer 2

One std deviation either side of the mean captures 68% of data, Two std deviations either side of the mean captures 95% of data, Three std deviations either side of the mean captures 99.7% of data.

Answer 3

Number of standard deviations away from the mean.

Answer 4

Value of interest, subtract the mean, divided by standard deviations.

Answer 5

When it is more than two std deviations from the mean.

Answer 6

Takes into account all of the data, not just the two end points. Variance looks at how much each individual score differs from the mean. Squaring them, then averaging them

Answer 7

The 50% point

Answer 8

The 25% percentile

Answer 9

The 75% percentile

Answer 10

Sample Size, sample proportion / percentage, 95% confidence interval, anything else of interest

Answer 11

Shape, centre (mean / median), Spread, Outliers

Answer 12

Taking information from a sample, inferring about a population from a sample.

Answer 13

Turning a research question into a statement. hypothesis is not a question. Hypothesis is to be tested

Answer 14

Looks at categorical data, specifically those with two categories, compares a percentage / proportion to a fixed value

Answer 15

For metric date, compares a mean to a fixed value.

Answer 16

Hypothesis - what is the sample being measured Sample - sample size, who is in the sample? Comparison - Name of test - Quote test statistics - if significance include 95% confidence Conclusion - use appropriate language

Answer 17

Standard deviation (s= )

Answer 18

When it's below 0.05 (<0.05)

Answer 19

t-value - Degrees of freedom (df) - P value - t(115) = 2.453, p = .016

Answer 20

p value is probability that our test statistic takes the observed value or a value more extreme. The smaller the p value, the stronger the evidence.

Answer 21

Not with the zero in front of the decimal, Always quote the tree numbers. p=.115 only with carrot when we are Say below .001 ( <.001)

Answer 22

The difference between sampling.

Answer 23

Normal distribution - The mean of the sample proportion ( or sample mean) equals population proportion (or population mean) Standard deviation of sample distribution depends on the size of the sample

Answer 24

The proportion in the population

Answer 25

The area outside the 95% markers. The 5% probability.

Answer 26

When the researcher is able to manipulate the IV. | We can then have causal conclusions.

Answer 27

We are just observing what happens, Not manipulating the IV. No causal conclusions.

Answer 28

When the researcher is unable to conduct experimental study, | Or it is unethical.

Answer 29

We cannot determine something for certain, | We cannot make definitive statements.

Answer 30

A variable that correlates (might effect) the dependant variable, The IV is NEVER a nuisance variable, Nuisance variable must vary

Answer 31

Associated with the participant, age, gender, driving experience etc.

Answer 32

Accociated with the conditions of the experiment.

Answer 33

When the same participants are used for both conditions.

Answer 34

Two separate groups with where the participants are matched as similar to one another as possible.

Answer 35

Groups are randomly separated.

Answer 36

To hold them constant.

Answer 37

A variable that alters the logic of the experiment by being correlated to the IV and DV.

Answer 38

A random sample with an arbitrary starting position. Random numbers are drawn to select the sample.

Answer 39

Where the population comprises subgroups.

Answer 40

Where we combine different sampling methods.

Answer 41

Population has some kind of natural (ideally homogenous) group (cluster), Eg: all Victorians = clutter would be local government area. Sample within the cluster.

Answer 42

From a random starting point, sample every Kth item.

Answer 43

Cause and effect

Answer 44

Hypothesis

Answer 45

Observation.

Answer 46

They mask or hide the effects of the independent variable, | They destroy the logic of an experiment.

Answer 47

Compares sample means for two groups, making inference

Answer 48

DV is metric, Independence of observations, Both samples must come from normal distribution, Equal Variance, both sample should have similar spread

Answer 49

The t value is rounded to two decimal places

Answer 50

In relation to the sample rounding.

Answer 51

Used to test the relationships when we have repeated measures or matched pairs research design.

Answer 52

Something about the population

Answer 53

With the mean for each group first, then the sample mean difference xd.

Answer 54

Infer about the population

Answer 55

The p value and the means

Answer 56

Metric data Independence of observations Normality

Answer 57

The probability in that the sample can say something about the population.

Answer 58

Looking at the relationship between two metric variables

Answer 59

On the x axis (horizontal)

Answer 60

On the y axis (vertical)

Answer 61

Direction Form Strength Outliers

Answer 62

The measure of the strength of a linear association between two metric variables

Answer 63

When the form is non linear (curved)

Answer 64

Tells us more about the relationship between two variables

Answer 65

Example: .123 x .123 | R squared

Answer 66

Example: .085 8.5% .123 12.3%

Answer 67

Indicates that in the population the strength of the linear relationship is between ...

Answer 68

Where we have strong positive correlation where it does not make sense, sometimes a third factor.

Answer 69

Rho, | Looks like a p

Answer 70

Upwards from left to right. | More of IV means more of DV

Answer 71

Downwards from left to right | More of IV means less of DV

Answer 72

An indication of how well you can predict the value of the DV when you know the value of the IV

Answer 73

That there is 5 chances In 1000

Answer 74

That there is 5 chances in 100

Answer 75

Y = a + b x X

Answer 76

Dependant Variable

Answer 77

A constant known as the vertical intercept

Answer 78

Slope or regression coefficiant.

Answer 79

Independent variable.

Answer 80

To calculate the linear relationship

Answer 81

The vertical intercept

Answer 82

What evidence we are looking for to draw a conclusion.

Answer 83

That a change in the IV will produce a change in the DV

Answer 84

Chi squared (pronounced ki) testing the relationship between two categorical variables.

Answer 85

The relationship between the two measured variables, not the difference like in some tests, categorical variables.

Answer 86

There is a specific population parameter that we are trying to estimate using the sample statistic

Answer 87

A test that does not measure the relationship between sample and population. Simply measures significance.

Introduction to Statistics Flashcards

(112 cards)