Data Architecture Flashcards
What is data architecture?
The approach an organisation takes to managing its data - from collection, through transformation, storage and use
What are the key components of data architecture?
Data sources: where the data comes from
Data storage: how the data is stored
Data processing: how the data is transformed or manipulated
Data access: how users interact with the data
Name some of the key factors which affect DArch?
Scale
Geographic distribution
Data sensitivity
What are the benefits of well designed DArch?
Reduce redundancy (<redundancy, easier to store and manage (normalisation))
Improve data quality (avoid data swamps through good data governance)
Enable integration (should facilitate data integration eg. via APIs. Enable systems to communicate with each other and integrate with new sources coming in/out to new external sources)
Lifecycle Mgmt & Security (adapt as data ages, enable flexible data security)
Name the four data mgmt systems
Database
Data warehouse
Data mart
Data lake
What can the cloud offer other than data storage
It can also comput, network and provide software
From a DArch perspective, what are the advantages of hosting in the cloud
Fast set up
Scalable
No upfront costs
No hardware to manage and maintain
Lower staffing costs
Hardware updates and software patches are taken care of
What are the advantages of hosting on premise?
May already have the hardware and staff
Usage is predictable
Very high control over costs (vs. being put on the wrong cloud tier where costs can rocket very quickly)
Very high control over security (eg. gov orgs)
Outline the cloud deployment models
Public cloud
Private cloud
Hybrid cloud
Multi cloud
Community cloud
Describe public clouds
- Internet accessible
- owned & managed by cloud service provider
- Quick & easy to set up
- handing over security
- no physical access to data centre & hardware
Describe private clouds
- Accessible over private connection
- Up-front costs required
- Direct control
- Requires mgmt of data, hardware & software
Describe hybrid clouds
Combination of public and private. Potentially using private to store sensitive information or run certain applications
Describe multi cloud
Use a variety of different public providers to spread cost & risk
Describe community clouds
Part-way between public and private, but sharing the cloud within similar sectors or with similar aims (eg. banks)
Name the different types of cloud service models
IaaS Infrastructure as a Service
PaaS Platform as a Service
SaaS Software as a Service