Article: Big Data Flashcards
Which issues are related to Big Data? Give examples.
- privacy and security risks (e.g. data sent to third-parties, data not securily stored)
- Not enough regulations
Why are customers in a disadvantaged position when it comes to firms using BD to analyze the customer´s preferences?
- They lack the knowledge to understand the processes
List the 5 dimensions of Big Data
- Volume, Velocity, Variety, Variability, Complexity
Describe the dimension ‘Volume’. What does it refer to? Why is it a appealing target for hackers? What difficulties exist when it comes to high data volumes?
- Storage of a high volume of data.
- Hackers try to get to all this data (e.g. in a cloud environment)
- Storage has to be outsourced
- Concern: How to store securily (false data injections, data manipulation)
What does predictive privacy harm mean?
- Customers are scared/shocked of/by highly customized offerings
What can be a problem of high variety of data?
- Firms can uncover hidden connections between seemingly unrelated pieces of data
- Makes it more difficult to detect security breaches and to react to them
- Most organizations ‘struggle’ to manage unstructured data which can contain more sensitive data
Why is the complexity of data so problematic?
- Data comes from partly not identifiable sources and there is no process to request the consent of a person for the resulting data
- Anonymization of data is nearly impossible
How does 5G impact BD?
- Faster (near real-time and higher amounts of data)
- Hackers can download data much faster
- Amplified technical impacts
What is edge computing?
- Moving storage and networking functions closer to where the data is generated rather than to more central locations
- Edges can have lower level of security as the central locations
Describe the dimension ‘Velocity’.
- Speed is more important than volume (time-sensitive data)
- Concerns around real-time profiling and tracking technologies
What does data variety mean?
- Data comes in multiple formats, e.g. structured (numeric), unstructured (text)
What does data variability mean?
- Data flows vary with periodic peaks related to holidays, trends, …
- Companies struggle to handle data flows
- More attractive for hackers during peak season
Make an example of a company that has made effective use of BD in MR -> explain in use of dimensions
- E.g. Netflix, Youtube, Google, Instagram