Test 2 Flashcards

Question

Effect size

Answer 1

- power - alpha - n - s data dispersion - known

Answer 2

-allows you to calculate the sample size needed for univariate and multivariate tests

Answer 3

- usually when your results were almost significant | - often in poor taste

Answer 4

-predictor and response -bivariate = x and y positive, negative and no relation = zero

Answer 5

-scatter diagram is graphical method to display relationship between two variables

Answer 6

-least squares method. -distance from potential line (residuals) squared and added up for all points to try to get lowest number possible -always passes through the mean of y and x WHY to convert a value standardize: calibration curve!

Answer 7

can we distinguish line with slope from line with no slope | zero slope or no relation is our null

Answer 8

coefficient of determination - how much variation in y is determined by x - want 1 or -1

Answer 9

- each x and y are independent and random - normal distribution of x values - homogenity - linear relation - measurements of x are free of error or small compared to y (error will make a relation hard to understand)

Answer 10

-can be used to predict -R measure of strength of linear assocation between x and y -R is NOT sqrt(R^2) -Want 1 or -1 R > 0 direct linear R

Answer 11

- doesn't meet normality - homogenity of variance - rank correlation also used when one or both consists of ranks - can also have multiple values y for x

Answer 12

-indep and paired t test -correlation analysis -linear regression ANOVAS

Answer 13

(also have their own assumptions!) - Mann Whitney u - Spearman Rho

Answer 14

-take an abnormal distribution to normal -there's a number of ways to do this depending on original distribution -WONT MAKE UP FOR POOR SAMPLING specifically non random sampling, very sensitive to outliers -KNOW YOUR LIT/FIELD prepare to defend your choice

Answer 15

heterogeneity of variance (base 10 or natural)

Answer 16

heterosadastic variance ( data with non-constant variance) commonly used on count data

Answer 17

-binomial dis -yes/no -proportions or percentages -sqrt of a number radians range from 0 to 1

Answer 18

-even though you've transformed, means nothing to readers, have to go backwards for writing it up

Answer 19

- data value different from majority - need to report and state why you throw them if you trim your data set - Need to think about them - can't discard due to inconvenience - rerun analysis without outlier to see if its the same - run an rank test? categories? - transforming may help

Answer 20

- reduces sample size | - small size decreases power and increases chances of extremes

Answer 21

discrete data | 3 of something

Answer 22

3.14579 of something

Answer 23

I am a human | convert data into bins

Answer 24

race age sex -put into contingency table -categorical variables -chi square analysis must always use frequencies and see how it compare to expected can use models! Mendelian genetics used Hardy Weinberg

Answer 25

odds success/odds failure

Answer 26

graphical way to look at frequencies - column = "treatment" - row variable = "response"

Answer 27

- statistical test that exploits variance (s^2) | - uses normally dis sets to compare differences between groups

Answer 28

``` Two variables: -categorical -quantitative Question: Do the means of the quantitative variable depend on which category the individual is in? IF ONLY 2 values 2 sample t test but you can have 3 or more :) -determines p value from f statistic ```

Answer 29

Tests these hypotheses: 1. means of the groups are equal (H0) 2. not all means are equal (Ha) * doesn't tell us which differ, have to follow up with post hoc testing

Answer 30

-each group is approx normal check graphically, or with normality tests. Can withstand some weirdness but not crazy outliers -STDEVS are approx equal between each group ratio of largest to smallest sample's stedv should be less than 2:1 Levene's test takes care of this

Answer 31

n = number of total individuals I = number of groups x = individual X bar = mean for entire data set

Answer 32

measures variation - between groups (group mean and overall mean) - within groups (value between value and mean of group)

Answer 33

ratio of between group mean square variation/mean square within group variation between/within MSG/MSE

Answer 34

sum of squares between/ sum of squares total | SSB/SST

Answer 35

- compare in twos: pairwise using two sample t test | - need to adjust p value threshold because multiple tests same data

Answer 36

- if family error rate is 0.05 then | - individual alpha = 0.0199 w/ 95 % CI

Answer 37

kruskal-Wallis Test | nonparametric procedure used to test the claim that 3+ indep samples come from pops with the same distribution

Answer 38

- STRONGER hypothesis than ANOVA which only compares means - samples are simple random samples from 3+ pops - data can be ranked - principle is dumping all the data together and seeing if its a normal dispersion - large values of H indicate Ri (sum of ranks of samples) are different than expected - If H is too large then we reject the null - K-W is always right tailed

Answer 39

- 3 populations or sample size of 5 or less, value from K-W table - 4 or more or sample size from one pop is greater than 5, value is chi^2

Answer 40

step 0: samples are indep random, data can be ranked step 1: box plots to compare data step 2: hypotheses. H0 data dis is the same H1 data dis is not the same step 3: rank observations smallest to largest step 4: level of signifigance--either K-W or chi^2 step 5: compute test stat step 6: compare critical values test stat must be bigger than crit value

Test 2 Flashcards

(65 cards)