Data collection Flashcards

1
Q

population

A

A population in statistics means all the individuals/objects you are interested in for a particular investigation

eg. all 6-year-old girls in the UK, all items manufactured by a factory or all the trees in a public park.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

census

A

A census observes or measures every member of a population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

sample

A

A sample is a selection of observations taken from a subset of the population which is used to find out information about the population as a whole. We then assume that the results for this sample are representative of the whole population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Census pros

A

Should give a completely accurate result

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Census cons

A

Time consuming and expensive
Can’t be used when testing process destroys the item
Hard to process a large amount of of data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

sample pros

A

. Less time consuming and expensive than a census
. Fewer people have to respond
. Less data to process than in a census

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

sample cons

A
  • The data may not be as accurate
  • The sample may not be large enough to give information about small sub-groups of the population
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

sampling units

A

Individual units of a population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

sampling frame

A

A list (or other representation) of the items available to be sampled

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

sampling fraction

A

The proportion of the available items that are actually sampled is called the sampling fraction. A 100% sample is called a census.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

sampling error

A

The difference between an estimate of a parameter (e.g. mean) derived from sample data and its true value. To reduce the sampling error, you want your sample to be as representative of the parent population as you can make it.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

bias

A

different types of people should be represented in the sample that is chosen. If the sample involves a more of certain group of people within the population, then it is said to be biased. To make good use of a sample we want to avoid bias.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

representative sample

A

A sample that is typical of the whole population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Random Sampling Techniques

A
  • Simple random sampling
  • Systematic sampling
  • Stratified sampling
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Simple random sampling

A

A simple random sample of size n is one where every possible sample of size n has an equal chance of being selected. This can be achieved by ensuring every member of a finite population has an equal chance of being selected as long as sampling is without replacement and selections are independent of each other.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

two methods of choosing the numbers in Simple random sampling

A
  • Using a random number generator (using a calculator, computer or random number table).
  • Lottery sampling – eg. writing members of the sampling frame on tickets and drawing them out of a bag.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Simple random sampling - pros

A
  • Free of bias
  • Easy and cheap to implement for small populations and small samples
  • Each sampling unit has a known and equal chance of selection
18
Q

Simple random sampling - cons

A
  • Not suitable when the population size or the sample size is large
  • A sampling frame is needed
19
Q

Stratified sampling

A

In stratified sampling, the population is divided into mutually exclusive strata and a random sample is taken from each.

Divide population into sub-groups or strata: e.g. low income, middle income, high income, male, female

20
Q

proportional stratified sampling

A

If we randomly sample from each group in proportion to the size of the group then it is called proportional stratified sampling.

21
Q

stratified sampling - pros

A
  • Sample accurately reflects the population structure
  • Guarantees proportional representation of groups within a population
22
Q

stratified sampling - cons

A
  • Population must be clearly classified into distinct strata
  • Selection within each stratum suffers from the same disadvantages as simple random sampling
23
Q

Systematic sampling

A

In systematic sampling, the required elements are chosen at regular intervals from an ordered list.

From a list, choose a random starting item, then sample, for example, every 5th item.

To determine the interval required you divide the population by the required sample.

24
Q

Systematic sampling - pros

A
  • Simple and quick to use
  • Suitable for large samples and large
  • populations
25
Q

Systematic sampling - cons

A
  • A sampling frame is needed
  • It can introduce bias if the sampling frame is not random
26
Q

Non-random sampling

A
  • Quota sampling
  • Opportunity sampling
27
Q

Quota sampling

A
  • In quota sampling, an interviewer or researcher selects a sample that reflects the characteristics of the whole population.
  • The population is divided into groups according to a given characteristic. The size of each group determines the proportion of the sample that should have that characteristic.
  • This is similar to a stratified sample but a specific number of people from each particular strata is sampled e.g. male/female, different age groups, etc.
28
Q

use of Quota sampling

A

This method is often used for market research and is usually used by interviewers. The interviewer would meet people, assess their group and then, after interview, allocate them into the appropriate quota. This continues until all quotas have been filled. If you then begin interviewing someone for whom your quota is full, you just move on. The actual selection of the sample members is up to the interviewer, whereas stratified samples are done at random.

29
Q

Quota sampling - cons

A

Non-random sampling can introduce bias
Population must be divided into groups, which can be costly or inaccurate
Increasing scope of study increases number of groups, which adds time and expense
Non-responses are not recorded as such

30
Q

Quota sampling - pros

A

Allows a small sample to still be representative of the population
No sampling frame required
Quick, easy and inexpensive
Allows for easy comparison between different groups within a population

31
Q

Opportunity sampling

A

Opportunity / convenience sampling consists of taking the sample from people who are available at the time the study is carried out and who fit the criteria you are looking for.

An example is interviewing passers-by on the street or the first 20 people you meet outside a supermarket on a Monday morning.

32
Q

Opportunity sampling - pros

A

Easy to carry out
Inexpensive

33
Q

Opportunity sampling - cons

A
  • Unlikely to provide a representative sample
  • Highly dependent on individual researcher
34
Q

Qualitative variables or qualitative data

A

Non-numerical data that come in classes or categories e.g. favourite colours, makes of car.

35
Q

Quantitative variables or quantitative data

A

Numerical data for which the numbers are meaningful e.g. times to run a race, height.

36
Q

Numerical data categories

A

Discrete
Continuous

37
Q

Numerical data categories - Discrete

A

The set of values taken by the data can be listed
e.g. shoe sizes (), number of goals, number of children in a family

38
Q

Numerical data categories - Continuous

A

Values can’t be listed because data can take any value in a particular range

e.g. heights, mass, time, etc.

39
Q

Bivariate data

A

Two variables are assigned to each item e.g. height and weight, age and mileage of cars

40
Q

Class boundaries

A

Class boundaries are the minimum and maximum values that belong in each class.

41
Q

midpoint

A

The midpoint is the average of the class boundaries.

42
Q

class width

A

The class width is the difference between the upper and lower class boundaries.