UNIT 1: The collection of data Flashcards

1
Q

Primary Data

A

Primary data is collected by, or for, the person who is going to use it.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Secondary Data

A

Secondary data has been collected by someone else

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Advantages Primary Data

A
  • Accuracy is known
  • Collection is known
  • Can find answers to specific questions
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Disadvantages Primary Data

A
  • Time consuming

- Can be expensive

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Advantages Secondary Data

A
  • Cheap
  • Quick
  • Can be reliable from Office of National Statistics, or sporting results pages
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Disadvantages Secondary Data

A
  • Don’t know method of collection
  • May not be able to find answers to specific questions
  • Websites may be unreliable
  • May be out of date
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Qualitative Data

A

Non numerical observation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Quantitative Data

A

Numerical observation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Continuous Data

A

Can take any numerical value on a scale

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Discrete Data

A

Can only take particular values (eg. Shoe size, number of words typed per minute)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Ordinal Data

A

Data from a numerical rating scale

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Categorical Data

A

Data which can be sorted into non overlapping categories/class intervals

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Bivariate Data

A

Data which involves pairs of related data (each pair of data points refer to one item)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Multivariate Data

A

Data which involves three or more related data values. Each set of data values refers to one item

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Population

A

Everything or everybody that could possibly be involved in an investigation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Census

A

A survey or investigation with data taken from every member of the population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Sample

A

Contains information about part of the population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Census: Advantages

A
  • Accurate

- Takes the whole population into account

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Census:

Disadvantages

A
  • Time consuming
  • Expensive
  • Difficult to ensure the whole population is used
  • Lots of data to handle
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Sample: Advantages

A
  • Cheaper than a census
  • Quicker than a census
  • Less data to handle
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Sample: Disadvantages

A

-Not completely representative

22
Q

Bias

A
  • If a sample is not representative of the population it is biased.
  • It could be selected unfairly.
  • It could be that the sample size is too small
23
Q

Sampling Frame

A

A LIST OF all the people/things that we are selecting our sample from

24
Q

Sampling Unit

A

The people/things that are being sampled

25
Electoral Roll
A list of people who are eligible to vote in the UK. The easiest way to get a list of adults in a geographical area
26
Petersen Capture-Recapture Formula
M/N=m/n M=total tagged at start N=population (unknown) m=number tagged in sample n=sample size
27
Petersen Capture-Recapture Assumptions
- Population is closed (state specifics in context) - Tagging doesn't affect survival rate - Tags don't get lost/removed and are easily recognisable - The sample size is large enough to represent the population. - The probability of being caught is equal for all individuals in the population
28
Random Sampling
In a random sample every member of the population has equal chance of being selected
29
Random Sampling Advantages
- Provided it is large the sample is likely to be representative of the population. - Choice of members of sample is unbiased
30
Random Sampling Disadvantages
- Needs a full list of the whole population | - Needs a large sample
31
Methods of random sampling
Always number your sampling frame (1) Pull numbers from hat (2) Use RanInt function on your calculator (3) Use a random number table: select starting point on table randomly
32
Opportunity Sampling
Use the people or things that are available at the time
33
Quota Sampling
Group population by characteristics (eg gender/age) and then ask a specific number from each quota
34
Judgement sampling
Use your judgement to select a sample that you think is representative
35
Cluster Sampling
Population forms in natural groups. Your sampling frame is the list of clusters, random select clusters to sample
36
Systematic Sampling
Pick a random starting point and then select every eg 10th item on your sampling frame. Need to number sampling frame
37
What four things to comment on when non random sampling
- Bias - Cost - Time - Sample size
38
Stratified sampling calculation
(Total in stratum / Total population) × sample size
39
Pilot Survey - reasons
- To check the response rate - To check the questions make sense - To check you collect the data you are expecting
40
Questionnaire key points
- Open questions - free written answers - Closed questions - multi choice or opinion scale (Must include "other" or "don't know") - Always include a timeframe (eg: yesterday, last week, last year etc) - Avoid questions where respondents would be tempted to lie. - Don't ask leading questions like "Don't you agree….?"
41
Interview: Advantages
- High response rate - Can explain the questions - Can explain their answers - Can put people at ease
42
Interview: Disadvantages
- Less honest for personal questions/trying to impress the interviewer - Time-consuming, therefore expensive - Bias in who the interviewer speaks to - Sample size is small
43
Anonymous Questionnaire: Advantages
- Honest for personal questions - Quick, cheap - No interviewer bias - Sample size can be a large as you like
44
Anonymous Questionnaire: Disadvantages
- Low response rate - May not understand the questions - May not understand the answers
45
Reliable data
Data which can be replicated
46
Valid data
Data which measures what you want it to measure
47
Cleaning Data
The process of identifying gaps, anomalies or errors in the data. Usually done on Excel with "sort" and "find" function
48
Extraneous variables
A variable you are not interested in that could affect your result
49
Control group
Select two groups randomly. Give the control group no treatment. Give the test group treatment. Compare the results. (Often used for medical trials)
50
Matched pairs
Two groups where each individual in one group is paired with an individual in the second group. They should have everything in common except the factor being studied.
51
Hypothesis
An idea that can be tested by collecting and analysing data.