Official Statistics and Big Data Flashcards

1
Q

What is secondary analysis?

A

The analysis of data gathered by someone else, for their own purposes, by researchers who were not involved in its collection

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the advantages of secondary analysis?

A
  1. Saves money and time
  2. Offers high quality data
  3. Gives opportunity for longitudinal analysis
  4. Allows subgroup/ subset analysis
  5. Opportunity for cross-cultural studies
  6. More time for data analysis
  7. Enables application of recent theory to old data
  8. Gets more value from original data
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are the disadvantages of secondary analysis?

A
  1. Need to become more familiar with how the data was collected/ coded/ managed
  2. Data can be very large and complex
  3. Data quality should not be taken for granted
  4. Variables important for analysis may be missing
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What advantages do official stats have over other forms of quan data such as survey data?

A
  1. Data has already been collected so saves time and expenses
  2. The people who source the data aren’t being asked questions that are part of a research project so its unobtrusive, fewer issues with reactivity
  3. There is the prospect of analysing the data both cross-sectionally and longitudinally
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Are official statistics neutral?

A

No, the data does not just sit there and wait to be collected in that particular way, instead, conscious social actors decide what data is needed, what data to collected and how to do so

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Why has the use and analysis of official statistics been so controversial?

A

Due to the lack of objectivity and socially constructed research

  • no stats can be considered ‘objective facts’ as they are all paid for and collected by an organisation with a purpose
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are the issues with crime and deviance statistics?

A
  1. Crimes have to go through a vigorous process before they can be recorded by the police
  2. Where police officers are put will affect which crimes are caught
  3. Reliability of official statistics often jeopardised because definitions and policies regarding the phenomena to be counted vary over time
  4. Reliability (and validity) can of course be compromised due to ‘fiddling’ of stats by police office
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is counted under the crime recording rules?

A
  1. Serious not minor crimes
  2. Real issue with classification - if something has occurred how do we know what category it is in?
  3. How we count certain crimes - per victim, per offender, per crime?
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Why do changes in offence totals not necessarily represent changes in criminal behaviour?

A
  1. Policing priorities
  2. Police recording - notifiable offence list
  3. Rules on recording and counting
  4. Crime reporting behaviour of the public - campaigns to encourage reporting
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are the political and ethical issues with crime and deviance statistics?

A

Police are collecting the data - this is difficult, time consuming and they are under pressure to ‘cook the books’.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What type of statistic are police stats?

A

In 2014 they were downgraded from a ‘national statistic’ to an ‘official statistic’. These stats are increasingly seen as ‘unfit for purpose’

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is big data?

A

Usually refers to extremely large sources of data that are not immediately amenable to conventional ways of handling them, often focusing on social media but also consumer behaviour.

  • considered non-reactive and with potential that is not yet utilised
  • there are data, process and management challenges
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are the issues with data designs and processes being hidden from data users? (particularly in official forms of data)

A

We must ask who or what is excluded from data sources, national surveys often have incomplete population coverage due to the systematic exclusion of certain ‘non-households’

  • this is important as the excluded groups are often the people who are already marginalised within society (by policy design)
  • some countries may choose to purposefully exclude marginalised groups for political reasons
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

How can ‘non-household’ groups be included in data?

A
  1. Bespoke surveying, implies high costs and gatekeeper access
  2. ‘piggyback’ onto existing data collection infrastructure by adding new questions
  3. Combining multiple data sources
  4. Using action based sampling in which the focus is surveying people when they engage with key services
  5. Retrospective data collection, where people who have previously been classified as a non-household are surveyed as if they were still part of that non-household population
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

When we have created data, how can we ensure we analyse it responsibly?

A

The overall aim is to be true to original data by using appropriate analysis techniques - not manipulating the data or misrepresenting peoples views

How well did you know this?
1
Not at all
2
3
4
5
Perfectly