Chapter 2 Flashcards

1
Q

If we have sample data from just 1 sample, which 3 approaches can be used to obtain the sampling distribution?

A
  1. bootstrapping
  2. exact approaches
  3. theoretical approximations
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Bootstrap sample

A

A random sample drawn from the initial sample. This is done through sampling with replacement

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How big should a bootstrap sample be?

A

Equal to the initial sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

In bootstrapping, what happens when we have an unrepresentative initial sample?

A

Results will be skewed if the proportion in the initial sample is very different from the proportion in the population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Apart from the proportion in the initial sample, what else affects the accuracy of the sampling distribution?

A

The size of the initial sample. Bigger samples will yield better results, but will still not be perfectly accurate.

Note: The population should be much larger than the sample size!

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the main limitation of bootstrapping?

A

We are never sure if our initial sample reflects the population well

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the main advantage of bootstrapping?

A

Every sample statistic can be bootstrapped

E.g. the median weight of candies

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

When do we use bootstrapping?

A
  • When the assumptions for theoretical approximation are not met
  • When we have a continuous variable
  • When SPSS does not have a test for the sample statistic we want to examine
  • When we know nothing about the population proportion
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

When do we use exact approaches?

A

When we (think we) know the proportion of the variable in the population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are limitations of exact approaches?

A
  • Only works for discrete or categorical variables

- Needs a lot of computing power

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What do we actually do in an exact approach?

A

We use this approach to calculate the probabilities for all possible outcomes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Describe the binomial distribution

A

It is a frequency distribution of the possible number of successes (vs. failures) for N repeated trials in which there is the same probability of success. There are always 2 possible outcomes

E.g. number of heads in 3x coin toss. Either heads or tails.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What would we use an exact binomial test for?

A

Questions involving population proportions

E.g. β€œIs 50% of candies in the population sticky?”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What would we use an exact cross tabs test for?

A

To examine the association between 2 categorical variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What would we use an exact chi-squared test for?

A

To test whether 1 categorical variable follows an hypothesized population mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What do we mean by a theoretical approximation?

A

We then use a known mathematical function to approximate the sampling distribution

T-distribution / normal distribution

17
Q

Why would we use theoretical approximations?

A
  • Quick computations
  • They offer plausible arguments about chance and probabilities (e.g. we would expect a sampling distribution to be symmetrical, so using a normal distribution is then logical)
18
Q

When can we use a theoretical approximation?

A

sample size * population prop. = β‰₯ 5

We need a large sample size, and a population proportion of 0.5 is ideal

19
Q

Why do we pay attention to the tail ends of the probability distribution?

A

These are used for significance tests

20
Q

Repeated measures

A

Comparing one set of measurements with a second set of measurement from the SAME sample