igd Flashcards

Question

what type of data iare histograms used for

Answer 1

interval shows the distribution of data or frequency

Answer 2

fairly level

Answer 3

how spread it is | can be +- or normal

Answer 4

shot data exploration visually between 2 variables

Answer 5

show trends

Answer 6

P(A) = F(A)/F(E) F= frequency of outcome A/E E usually larger number like a population of number in the sample

Answer 7

multiplication rule - just times the probabilities together eg) P(A+B) happening -> P(A) x P(B) 0. 04x0.03x0.05 = 0.00006 OR as percentage x100 so 0.006

Answer 8

ADDITION - | P(N) + P(C) + P(M)

Answer 9

P(A) + P(B) - P(A+B)

Answer 10

area under curve??

Answer 11

σ/ (Root)N its how far sample mean is likely to be from sample the larger the standard error the larger the SD smaller SError means more reliable and smaller SD

Answer 12

when n>30 z distribution used

Answer 13

tells us sample means are normally distributed around the POPULATION mean

Answer 14

degrees of freedom

Answer 15

in a t table or z table of just the tables

Answer 16

2 SD of error or 1.96

Answer 17

used to derivegeneral conclusions about our data and beyond

Answer 18

summarises what our data shows

Answer 19

central tendency dispersion, plots and charts to illustrate distribution of data, STANdard devEVIATION

Answer 20

THE SAMPLE IS LARGER AND MORE NORMAL THE DISTRIBUTION.

Answer 21

range of values around a sample

Answer 22

samples less than 30

Answer 23

- parsimonious - generalisable - testable - plausable - directional

Answer 24

when p < 0.05/ x if YOUR value is BIGGER than the critical value if Z <1.96

Answer 25

1) confidence level 95% (1.96) is narrower than 99% CI 2) variability as measured by the SD 3) sample size

Answer 26

more certainty

Answer 27

significance level

Answer 28

one directional relationship in one direction and disregards that it can go in the other direction (cant falls before -1.96) two tailed can fall in +/- 1.96

Answer 29

calculates the sample was drawn from a normal population.

Answer 30

Ho sample data are NOT SIGNIFICANTLY different than a normal distribution Ha sample data ARE Significantly different than normal population.

Answer 31

when data is a repeat measurement (eg over 2 years). or if samples are paired in same manner! must be as it violates the assumption that samples are independent from one another

Answer 32

- that it doesnt need to have assumption of normality of data values ASSUMES PAIRED DIFFERENCES ARE NORMALLY DISTRIBUTED (this is because we are using paired differences rather than actual observations)

Answer 33

lorgorhythms, square roots, for positive data reciprocals for non-zero data histograms

Answer 34

- test statitsics (the value calculated) - eg) Z -score | - probability p-values - eg) mean, SD, sample size

Answer 35

reject the null found using t-table

Answer 36

2 tests are exactly the same

Answer 37

when its non-parametric

Answer 38

Mann-whitney-u

Answer 39

(aka analysis of varients) -used when comparing more than 2 groups compares VARIATION within groups to the variation between groups greater the variations -> we reject null

Answer 40

1 - observations between samples are independent | 2- observations in each catagory are normally distributed

Answer 41

f-statistic - a ratio of variences which helps to answer whether "is the variation due to a group, greater than the residual variation

Answer 42

we can reject null

Answer 43

used to tell us which groups differ from the rest they arent used unless null hypothesis was rejected for the f test

Answer 44

- tukeys | - honestly significant different test (HSD)

Answer 45

tests the independence of TWO catagorical variables from a single sample the hypothesis either HAS effect = H0 or HAS NO effect= HA

Answer 46

NOMINAL data in the form of frequencies or counts

Answer 47

NO it does not assume normality

Answer 48

after a chi squared test measures degree of association of 2 variables

Answer 49

measures intensity of linear relationship

Answer 50

+1 perfect positive -1 perfect negative 0 no corrolation

Answer 51

non-parametric corrolation

Answer 52

- relationship is CAUSAL | - relationship is linear

Answer 53

- residuals are normally distributed - residuals have a mean of 0 - errors dont vary with x - residual errors are independent and dont influence others

Answer 54

increasing varience on a graph

Answer 55

- linear relationship - multivariate normality - normal distribution of errors - no multicolinearity - homoskedasticity, no clear distribution of residuals

Answer 56

dataset consisting of 2 variables x and y

Answer 57

comparing 2 variables eg petal width or speal width | - reversal of trend when you group data

igd Flashcards

(86 cards)