L2 Where data comes from Flashcards
what does one zettabyte = ….
a trillion gigabytes
2024 there was how many volumes of data in zettabytes?
147
where does all of this data now come from?
electronics (phones / wearables / home electronics)
by 2025: what is estimated?
that 80% of global data will be unstructured
what is meant by unstructured data?
it wouldn’t be sorted in fixed known locations - data will be everywhere
what are surveys good at collecting?
large amounts of data
what kind of questions do surveys typically have?
standardised questions
who are surveys typically asked to?
a sample of the population
why are surveys often not asked to more people?
time consuming + expensive
more data = an increase in…
…data scepticism
more data = more space for…
…misreporting and misrepresenting
what in recent years has increased the amount of data produced?
online tools and surveys
more bad data is being produced now because of what?
poor quality bad surveys
what are less people also doing now and what does this lead to?
answering surveys - which causes issues for the data produced
what happens every time we google something on our phones?
data is produced
what percentage of the British people said that they believed television news readers to say the truth?
52%
……. of British people trust journalists to tell the truth
28% (just over 1/4)
Less than …… of British people think that politicians and gov. ministers generally tell the truth
20%
who is widely regarded as individuals who do tell the truth?
scientists and professors
where is the UK ranked in trusting for our media?
very low
what media is trusted the least / the most
least = social media
most = radio
what is administrative data originally used for?
keeping records (by governmental departments and agencies)
administrative data covers…
…entire populations of registered people
what is an example of a record that is administrative data?
health / tax / benefits / car reg / work permits
administrative data is not………..avaliable
publicly
why is administrative data not publicly available?
as much of it is very personal information
what is administrative data often used for when helping with surveys?
as a sampling frame to get a sample
will administrative data be the same as other surveys done?
no
administrative data is of high quality - true or false + why?
true - as the government collected it
what is there when it comes to protecting peoples data?
strict security protocols