stats Flashcards

Question

test of correlation

Answer 1

correlation

Answer 2

chi square

Answer 3

chi square

Answer 4

cannot be represented by fractions or numbers classes or categories for example: gender v major

Answer 5

can be represented by numbers and/or fractions

Answer 6

chi square

Answer 7

correlation

Answer 8

when your null hypothesis is very wrong, and usually very small when your null hypothesis is correct.

Answer 9

the probability that your results or more extreme results than yours could have occurred due to chance even when the null hypothesis is actually true

Answer 10

reject the null

Answer 11

“There is a less than 5% chance that I could have gotten these results if the null hypothesis were true, so I would rather conclude that the null is not true than accept such an unlikely outcome.”

Answer 12

the two samples are paired or dependent because they contain the same subjects. Conversely, an independent samples t test contains different subjects in the two samples

Answer 13

used when the data of two samples are statistically independent

Answer 14

explanations/hypotheses/theories

Answer 15

predictions

Answer 16

stats and probability are intuitive we tend to jump to conclusions we tend to be over confident we see patterns in random data we don't realize that coincidences are common we find it hard to combine probabilities: monty hall we are fooled by regression to the mean

Answer 17

is the repetition of an experimental condition so that the variability associated with the phenomenon can be estimated.

Answer 18

The group of replicated measurements that is used to help estimate natural variability

Answer 19

the selection of a subset (a statistical sample) of individuals

Answer 20

all individuals in a population should have an equal probability of being selected so that the proportions sampled can help us estimate the probability that similar samples would occur in the future

Answer 21

If something about the sampling process causes a particular type of individual in the population to be more likely to be sampled

Answer 22

It is the amount by which samples will differ due to chance

Answer 23

techniques attempt to overcome this problem by “using information about the population” to choose a more representative sample

Answer 24

is the act of choosing the number of observations or replicates to include in a statistical sample

Answer 25

the power of a statistical test is the probability that the test will detect: 1) a pattern in the population if the pattern truly exists or 2) the effect of a specific condition on the population if the effect truly exists

Answer 26

A strong pattern

Answer 27

includes experimental designs in which treatments are not replicated (though samples may be) or replicates are not statistically independent resulting in an inflation of the reported number of samples or replicates.

Answer 28

dealing with two variables, usually an independent variable and a dependent variable

Answer 29

will deal with more than two variables (e.g. three or more dependent variables, or any combination of multiple independent and dependent variable

Answer 30

the study of populations of two or more different species occupying the same geographical area and in a particular time

Answer 31

the placement of species and/or sampled locations into groups used to distinguish different kinds of communities from each other if there appear to be clear distinctions

Answer 32

the arrangement or ‘ordering’ of species and/or sample locations along environmental gradients can be more useful when there are not clearly distinct kinds of communities because they grade into one another with fuzzy boundaries

Answer 33

has taxa (usually species) as rows and samples as columns (Table 1) or vice versa

Answer 34

takes your cloud of data points, and rotates it such that the maximum variability is visible. Another way of saying this is that it identifies your most important differences.

Answer 35

DCA only represents the patterns of dependent variables (species abundance) but does not directly compare the species abundances to the possible independent variables that cause them We could use the DCA to make hypotheses about the causes of the species distributions

Answer 36

It is called a triplot because it simultaneously displays three pieces of information: samples as points, species as points, and environmental variables as lines.

Answer 37

include several different statistical methods in which the data are not assumed to come from prescribed models that are ‘custom fit’ to the data by a small number of parameters

Answer 38

use general model descriptions associated with 1 or more numerical parameters, which can be adjusted to allow the models to be applied to a variety of data sets ex:normal distribution model, the Poisson distribution model, and the binomial distribution model

Answer 39

These permutations keep the actual data intact, but randomly associate the environmental data with the species data

Answer 40

Thus, DGA is best coupled with an ordination (multivariate) technique like CCA.

Answer 41

If we directly include environmental variables as independent variables we are changing our DCA into a CCA

Answer 42

basically the center of a cloud of points

Answer 43

a ‘best fit’ line for the cloud of points

Answer 44

they are ranked from the highest to the lowest These are related to the amount of variation explained by each axis

Answer 45

is a range of values that has a 95% chance of containing the true single value that you are trying to estimate.

Answer 46

The random distribution of numbers of sightings

Answer 47

two parameters t you are distinguishing between two (and only two) possible outcome

Answer 48

not different

Answer 49

correlation

Answer 50

discrete variable

stats Flashcards

(77 cards)