WEEK 3: Limited samples and prediction of the main features of a variable, I Flashcards

1
Q

What is a population?

A

a collection of all possible individuals, objects or measurements of interest

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is a sample?

A

a portion or part of the population of interest

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the goal of a sample?

A

representativeness - necessary if you want to generalise to a population + to allow the use of inferential techniques

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is a probability sample?

A

A sample selected such that each item or person in the population being studied has a known likelihood of being included in the sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are the criteria / methods to select a representative sample?

A

sample size
sampling methods
sampling error

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What 3 factors determine the size of a sample?

A

the desired level of reliability (Level of Confidence)

the margin of error the researcher will tolerate

the variation in the population being studied

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How many number of observations is regarded as the minimum for findings to be statistically reliable?

A

100 observations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What four methods can be used to construct a probability sample?

A

Simple Random Sampling

Systematic Random Sampling

Stratified Random Sampling

Cluster Sampling

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is simple random sampling?

A

every sample of size n has the same probability of being selected - equal chance / included

eg Kent school has 845 students, teacher wants to do study and selects a random sample of 52 students. Each student writes their name on a slip of paper and puts it in box, after being mixed the first selection is made, process is repeated until the sample of 52 students is chosen

process = sampling without replacement, so probability of each selection changes eg 1/845, 1/844, 1/843

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How would you perform simple random sampling in excel?

A

Data –> Data Analysis –> Sampling –> Fill boxes with info

eg input range?
random
no of samples
output range?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is systematic random sampling?

A

items are taken at regular intervals eg every 5th person in a list

items/individuals of the population are arranged in some order, a random starting point is selected and then every k^th member of the population is selected for the sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How do you calculate K? (Systematic Random Sampling)

A

k = population size / sample size

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

A population of students at Kent school is 845 students, a sample of 52 students will be selected from this, how would you do a systematic random sample?

A

1) work out K
–> K = population size / sample size = 845/52 = 16

random sampling is used in the selection of the first name among the first 16 students, then select every 16th name on the list after.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is stratified random sampling?

A

a population is first divided into subgroups (strata) and a sample is selected from each stratum

(the population is divided into appropriate categories + sample is chosen so the proportions in each category are the same as the proportions in the population)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Population size = 400
Sample size = 50
Companies are divided into stratum, the number of firms in each stratum is:
1 = 24
2 = 43
3 = 208
4 = 119
5 = 6

How would you do a stratified random sample?

A

1) work out relative frequency for each company (in table) = number of firms / population size (total should equal 1 when added together
2) work out the number sampled = relative frequency x sample size (when added together, total should equal sample size)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is cluster sampling?

A

uses a natural division of the population
first chooses a random sample of clusters and then a sample from each cluster - usually contains random sampling

17
Q

What is the sampling error?

A

the difference between a sample statistic and its corresponding population parameter

sampling error = (sample statistic - population parameter)

18
Q

What is the sample statistics symbol for the mean?

A

19
Q

What is the population parameter symbol for the mean?

A

μ

20
Q

What is the sampling error of the mean?

A

X̄ - μ

21
Q

What is the sample statistics symbol for the standard deviation?

A

s

22
Q

What is the population parameter symbol for the standard deviation?

A

σ

23
Q

What is the sampling error of the standard deviation?

A

s - σ

24
Q

What is the sample statistics symbol for the Variance?

A

s^2

25
Q

What is the population parameter symbol for the Variance?

A

σ^2

26
Q

What is the sampling error of the variance?

A

s^2 - σ^2

27
Q

What is the sample statistics symbol for the Proportion?

A

p

28
Q

What is the population parameter symbol for the Proportion?

A

π

29
Q

What is the sampling error of the proportion?

A

p - π

30
Q

What is the sampling distribution of the sample mean?

A

a probability distribution function consisting of all possible sample means of a given sample size size selected from a population

31
Q

What will the mean of the distribution of sample means be exactly equal to?

A

the population mean

–> the mean of the distribution of sample means = the population mean

32
Q

What is σX̄ ? (the standard deviation of the sampling distribution of the sample mean?)

A

σX̄ = σ / √n

33
Q

How do you work out the population mean (μ)?

A

μ = ∑Xi / N

34
Q

How do you work out the population standard deviation (σ)?

A

σ = √(∑(Xi-μ)^2/N)

35
Q

How would you work this out

“What is the sampling distribution of the sample mean for samples of size 2?”

on excel?

A

In Excel use the function =COMBIN(7, 2) to calculate the total number of
combinations of size 2 (second argument from a population of 7 items.

We first create all possible samples of size two, using all possible 21 combinations of the original 7 individuals, and then we calculate the mean of each of these 21 samples.

36
Q

Why is sampling preferred to surveying the whole population?

A

Surveying the whole population can be very expensive.

It can be physically impossible to find all the items in the population.

Sample estimates can be very accurate.