Module 3 Flashcards

You may prefer our related Brainscape-certified flashcards:
1
Q

How does the binomial distribution work?

A

In a population, and a fixed proportion of individuals are assigned to 1/2 groups: successes or failures.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Define “Binomial distribution” (2 things that it provides)

A

Provides the probability distribution for the number of successes in a fixed number of independent trials aka the proportion of successes.
-Has a Bernoulli trial (not many)

BD = X successes in n trials.
*look at formula (no need to mem)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are the assumptions of the binomial distribution?

A
  1. The number of trials is fixed
  2. Separate trials are independent
    3 The probability of success is the same in every trial.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How does n affect the binomial distribution?

A

N is large = narrow distribution

N is small = broad distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are the assumptions for a Goodness-of-fit test?

A
  1. Individuals in a data set are random samples from a population
  2. Individuals are chosen independently from one another.
  3. No category has expected frequency of <1.
  4. No more than 20% have a frequency of <5.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the purpose of the Poisson distribution?

A

Tests whether successes occur randomly in time or space. Describes the number of successes over time or space.
-Has many Bernoulli trials

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is a goodness-of-fit test used for? (ex. Chi-squared)

A

To compare observed and expected variables to see if they’re significantly different from one another.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is a goodness-of-fit test used for? (ex. Chi-squared)

A

To compare observed and expected variables to see if they’re significantly different from one another.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Define “extrinsic data”

A

When expected info is derived from info other than the data you are analysing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Define “Intrinsic data”

A

When expected info is derived from the data you are analysing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What variables does the GOF test use?

A

1 categorical variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How do you calculate the degrees of freedom for a GOF test?

A

df= k-p-1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

How do you calculate the degrees of freedom for an extrinsic data test?

A

df= k-p-1

  • k = # of categories
  • p = # of parameters estimated (columns-1)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are the alternative distributions to Poisson?

A
  1. Clumped - successes occur closer together than expected by chance. Variance > mean.
  2. Dispersed - successes occur more evenly spread out than expected by chance. Mean > Variance.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is a Bernoulli trial ??

A

A trial with only 2 possible outcomes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is a contingency test used for?

A

Used to test the dependence of a (categorical) variable to another.

17
Q

How do you calculate the degrees of freedom of an intrinsic data test?

A

df=rc-[(r-1)+(c-1)]-1
r = row
c=column

18
Q

What type of test uses intrinsic data to generate expected values?

A

Contingency

19
Q

What type of test uses intrinsic data to generate expected values?

A

Contingency

20
Q

What are the conditions that need to be met for a Poisson distribution to occur?

A
  1. The probability of two or more occurrences in a single sample subdivision is negligibly small.
  2. The probability of one occurrence in a sample subdivision is proportional to the size of the subdivision (in time or space). Ex. 3 times area = 3 times probability.
  3. The outcome in one subdivision of the sample unit is independent of the outcome in all other Bernoulli.
  4. The probability of an occurrence is identical for all sample subdivisions.
21
Q

What is the purpose of a G-test?

A

Testing the null hypothesis (of no association) between two or more categorical variables.

22
Q

What are the assumptions of a G-test?

A
  1. Random samples
  2. No more than 20% of cells have a frequency of <5
  3. No cell has a frequency of <1.
23
Q

What are the degrees of freedom for a G-test?

A

(r-1)(c-1) ***not sure if this is correct