L2 Where data comes from Flashcards

1
Q

what does one zettabyte = ….

A

a trillion gigabytes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

2024 there was how many volumes of data in zettabytes?

A

147

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

where does all of this data now come from?

A

electronics (phones / wearables / home electronics)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

by 2025: what is estimated?

A

that 80% of global data will be unstructured

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

what is meant by unstructured data?

A

it wouldn’t be sorted in fixed known locations - data will be everywhere

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

what are surveys good at collecting?

A

large amounts of data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

what kind of questions do surveys typically have?

A

standardised questions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

who are surveys typically asked to?

A

a sample of the population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

why are surveys often not asked to more people?

A

time consuming + expensive

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

more data = an increase in…

A

…data scepticism

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

more data = more space for…

A

…misreporting and misrepresenting

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

what in recent years has increased the amount of data produced?

A

online tools and surveys

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

more bad data is being produced now because of what?

A

poor quality bad surveys

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

what are less people also doing now and what does this lead to?

A

answering surveys - which causes issues for the data produced

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

what happens every time we google something on our phones?

A

data is produced

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

what percentage of the British people said that they believed television news readers to say the truth?

A

52%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

……. of British people trust journalists to tell the truth

A

28% (just over 1/4)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Less than …… of British people think that politicians and gov. ministers generally tell the truth

A

20%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

who is widely regarded as individuals who do tell the truth?

A

scientists and professors

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

where is the UK ranked in trusting for our media?

A

very low

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

what media is trusted the least / the most

A

least = social media
most = radio

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

what is administrative data originally used for?

A

keeping records (by governmental departments and agencies)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

administrative data covers…

A

…entire populations of registered people

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

what is an example of a record that is administrative data?

A

health / tax / benefits / car reg / work permits

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

administrative data is not………..avaliable

A

publicly

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

why is administrative data not publicly available?

A

as much of it is very personal information

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

what is administrative data often used for when helping with surveys?

A

as a sampling frame to get a sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

will administrative data be the same as other surveys done?

A

no

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

administrative data is of high quality - true or false + why?

A

true - as the government collected it

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

what is there when it comes to protecting peoples data?

A

strict security protocols

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
31
Q

how is big data generated?

A

digitally (through online and transactional data)

32
Q

big data is in huge………..and high………..

A

volume
velocity

33
Q

what are examples of different sources of big data?

A

clicks
shares
purchases

34
Q

is big data open access data? and why?

A

no - as it’s not collected for research purposes and often used commercially

35
Q

as humans we create d… t…..

A

data trails

36
Q

google trends example =

A

spike in “unemployment” google searches over the pandemic

37
Q

who was William Petty?

A

17th century demographer

38
Q

17th century demographer =

A

William Petty

39
Q

what did William Petty do? (3 parts)

A
  • began surveying
  • collecting numbers on people living in London
  • surveying GDP
40
Q

what does census in latin mean?

A

to estimate

41
Q

is census is essentially a big…

A

…survey

42
Q

how many people are in a census? + 2 examples

A

all people from a population under study
- i.e. all in country / all in education

43
Q

how often does a census happen in the UK? + why?

A

every 10 years
- as they are expensive and time consuming

44
Q

when did a census last not happen in the UK?

A

during peak of WW2

45
Q

when was the most recent UK census

A

2021

46
Q

when were census’ first conducted?

A

in BC / early AD

47
Q

who can census’ now be answered?

A

one household member for the whole house

48
Q

primary data =

A

data collected directly by researchers for a very specific purpose and used for that purpose

49
Q

example of creating primary data =

A

conducting a survey + using results in report for a dissertation

50
Q

primary data is directly for what?

A

ones own use

51
Q

pros of primary data =

A
  • up to date / current
  • specific to the question
  • researcher has full control (can pick the q’s asked)
52
Q

cons of primary data =

A
  • time consuming
  • sometimes impossible
  • can be expensive
53
Q

what happens if conducting primary data is impossible?

A

have to rely on and use secondary data instead

54
Q

secondary data =

A

data that has been previously by someone else for a different purpose - but is available for others to use

55
Q

example of using secondary data =

A

viewing others surveys and data collected + analysing it to answer Q’s for a report / coursework

56
Q

secondary data is the number 1 source of data in the UK - true or false?

A

true

57
Q

pros of secondary data =

A
  • affordable
  • easily accessible
  • longitudinal studies are possible
58
Q

what does the pro of longitudinal studies (in 2ndary data) mean?

A

can compare old data to more recent
- see how it has developed and changed over time

59
Q

cons of secondary data =

A
  • can be outdated
  • not specific to your question
  • can be time consuming to begin with
60
Q

why might secondary data sometimes be time consuming?

A

having to find specific data that relates to what you want

61
Q

how to tell good from bad sources of data 1 =

A

sources - who produced the data?

62
Q

how to tell good from bad sources of data 2 =

A

purpose - why was it produced?

63
Q

how to tell good from bad sources of data 3 =

A

time - when was it produced?

64
Q

we can trust what data sources?

A
  • public institutions (ONS)
  • respected research companies
65
Q

we should be sceptical of what data sources?

A
  • unknown institutions
  • sources with dubious reputations
66
Q

what data sources can we NEVER trust?

A
  • gov. sources known to use ‘fake news’ to influence opinions
  • info published by satirical newspapers
67
Q

lots of data has………… ………

A

underlying motives

68
Q

example of satirical news =

A

2012: Onion new paper said rural whites prefer the president of Iran to Obama - which some believed

69
Q

Trump running twitter polls =

A

2016: where he had underlying motives + leading questions = example of push polling

70
Q

are push polls real surveys?

A

no

71
Q

what is push polling aiming to do?

A

sway and influence voters

72
Q

when is push polling most often used?

A

during political campaigns

73
Q

surveys should never promote what ideas nor try to…

A

propaganda ideas / change the mind of respondents

74
Q

leading questions =

A

leads respondents to answer in a certain way (unbalanced)

75
Q

loaded questions =

A

forces respondents to answer in a way they might not agree (loaded with assumption)

76
Q

it is important to always consider …….. data was collected

A

…when…

77
Q

a good first critical question when looking at data is…

A

…where did the data come from?