statistics Flashcards

Question

Why does linear and non linear matter in scatterplots when quantifying correlation?

Answer 1

-Linear relationship = Can measure correlations -Non linear relationship = Measuring correlation does not make sense, might need to transform data

Answer 2

- Calculated directly from the raw scores - interval or ratio data - Highly affected by outliers - Not suitable for skewed data

Answer 3

- Calculated from the ranking of the raw scores - ordinal data - Minimally affected by outliers - skewed data

Answer 4

Density curves

Answer 5

- generalising results to the population. - A density curve is a histogram distribution - Displays overall pattern (shape) of a distribution - always on or above the horizontal axis

Answer 6

- Curves are calculated so they have an area of exactly 1 (probability) underneath them - 100% of the scores under the curve - if you know certain values of the model (e.g. mean or SD) you can make predictions about the overall population Area above the mean = 0.6 60% of the scores will be above the mean

Answer 7

point that divides area into two equal parts

Answer 8

points that divide area under curve into quarters

Answer 9

positions at the peak of the curve

Answer 10

the balancing point of the curve

Answer 11

Symmetrical Single peaked Tails meet the x- axis at infinity

Answer 12

its Standard deviation

Answer 13

- Allows us to compare values from two data sets where two values can be made into a single score, this is called Z- scores (standard scores)

Answer 14

(Score) - (Mean) divided by (standard deviation)

Answer 15

- To compare data from two different normal distributions = Convert normal datasets into standard normal distributions by calculating the z- score

Answer 16

Using table entries Table entry always gives = area to the left of z score Can work out the percentage of population above or below our point of interest

Answer 17

1 (representing 100% of the participants)

Answer 18

- Non parametric (Makes no assumptions of population parameters so they are distribution free)

Answer 19

The goodness of fit test The test of independence - Both types of tests are there to test for significant differences between data sets

Answer 20

- Used on unrelated categorical data, where each person can only be in one category - Used to look at the proportions of a population - Looks at the categories of one variable

Answer 21

- The observed frequencies are the numbers of participants measured in individual categories e.g. number of men vs number of women - These frequencies are then compared to frequencies predicted by the null hypothesis (the expected frequencies)

Answer 22

Sample size x the proportion

Answer 23

- Looks at the categories of two variables - uses data in the form of frequencies in different categories which is compared to expected frequencies predicted by the null hypothesis - But instead of 2 categories there are 4 Data is presented in the form of a matrix displaying all categories

Answer 24

(number of rows R minus 1) x (number of column C minus 1)

Answer 25

A measure of how likely it is that some event will occur Probability can vary from 0 (never) to 1 (always)

Answer 26

- Assuming there is no difference and there is no relationship between the two conditions - Calculate how probable it is to get the score as extreme or more extreme than what we obtained - If the probability is very small, reject the null hypothesis (Thus accepting the alternative H) If the p- value is less than 0.05 (5%)

Answer 27

- A score that tells you if someone scores less or higher than this they are outside this 5% range - Essentially scores that are the cut off point for statistical significance (top 5%)

Answer 28

- To get a 5% score you need a z- score of 1.645 - That 5% cut of of significance is roughly 1.6 standard deviations below or above the mean -The same for every normal distribution

Answer 29

Rejecting the null hypothesis (and accepting the alternative) when we shouldn't - Deciding the score is statistically significant when its not

Answer 30

Accepting the null hypothesis (rejecting the alternative) when we shouldnt

Answer 31

By reducing the threshold of significance from 5% to 1% (0.01) - - However this could increase the possibility of a type 2 error. - And decreasing the likelihood of a type 2 error could also increase the likelihood of a type I error

Answer 32

Does not state the direction just states they will differ

Answer 33

States which direction its going on (e.g. higher lower, better worse)

Answer 34

Using probability theory to make inferences about a population from sample data Why do we do it? - to Make inferences from a sample to the population

Answer 35

(Estimated mean) divided (standard deviation from sample)

statistics Flashcards

(63 cards)