2 - Innovating with Data Flashcards
Data is any [what] that is useful to an organisation?
Such as?
Information.
Such as; Spreadsheets, emails, audio, images, ideas
What are 2 challenges of legacy systems when it comes to data?
- processing volumes and varieties of new data (batch or realtime)
- Finding cost effective solution for setting up and maintaining data centres
- scaling resource capacity up or down
- accessing historical data
- driving insights from old and new data
How did budget airlines transform by unlocking the power of data?
- Using past data to predict how many meals will be purchased on certain flights to ensure no wastage or customer dissatisfaction
- Used destination, time of day, flight connections before and after
- this uncovered actionable insights to predict accurate amount of meals.
Data Mapping for retail. The below makes up a [what] data bucket?
- Transactions data set, Item returns data set, Footfall data set = [what] Data Bucket?
- Staffing levels data sets, delivery (stock Delivery dates), sales performance, staff structure = [what] data bucket?
User
Corporate
What is structured and unstructured data? Examples for both
Structured
- highly organised (customer info with names, address etc) easily stored and managed in databases
Unstructured
- no organisation (word processing docs, audio files, videos, images)
Example of a used car dealership requiring and using both structured and unstructured data?
Used car dealership built an ML model to predict the price of a new car coming in
photo of the car (unstructured)
the pricing of previous similar cars (structured)
Used this combined data to predict the price.
Time to value a car dropped from 20mins - 3mins
The key benefits of using cloud technology to unlock value from data, especially for traditional Enterprises?
- Businesses can process [how much data?] of data in real-time
- Businesses can query their data and [get what] instantly.
terabytes
retrieve results
To get the most value out of data, you need what 3 things?
- to know what you have
- to find it easily
- to use it while keeping it secure
What are the 3 key terms around data storage?
Databases
Data Warehouses
Data Lakes
What are googles 2 fully managed DB options? What kind of data does a DB store
CloudSQL
Cloud Spanner
Transactional Data
What is Googles Data Warehouse option? What is it good for?
BigQuery
Assembles data from multiple sources to make it useful for analysis.
Can transform unstructured to semi-structured and use this with structured data for analysis
Enables rapid analysis of multi-dimensional datasets
What data does a data lake generally hold? What is googles solution?
back-up data
Cloud Storage
Match the below in regards to the Cloud Storage Classes:
nearline - best for data accessed X per month,
coldline - accessed once per X days
Archive classes - once per X
- nearline - best for data accessed once per month,
- coldline - accessed once per 90 days
- archive classes - once per year
What is GCP’s Business intelligence solution? What does it do?
Looker
A data platform that sits on top of an analytics database and makes it simple to describe your data and define business metrics
With a reliable source of truth for business data, anyone can analyse and explore it and share insights with a simple link
What is ML?
Most dashboards you use as a company probably use backward-looking data (reports etc) to look at whats happened in the past
To create value in your business you need to use that backward data to do what?
Establish trends with backward data and use ML to predict insights to help with future decisions