Week 4 Quiz PREP Flashcards
Tell me what are the 3 layers of data requirements??
1) OrganisatiOnal
2) OperatiOnal
3) Technical
What is a stakeholder BARRIER to obtaining requirements?
- LIMITED participation (stakeholders too busy for focus groups/workshops -> need senior managers for incentivising/prioritising participation)
What is metadata mannn
dAtA about DaTa // data dicTIONARY, e.g.
- datatype & size
- which apps produce & use the data
- what business rules are imposed on it
- has it been transformed from source system
- last time it was loaded into data warehouse
What is MASTER data?
“REAL” data like LiSt of customers/products (names, contact deets, job, age, demographic info / IDs, dimensions, colour, cost, country of origin etc.
WHEN is metadata created?
whenever we WORK with data (build, store, clean, share, use data)
What is a conflict BARRIER to obtaining requirements?
- Politics that prevent consensus (department conflict with each other, hard to reach agreement of shared objectives for analytics systems)
What is a system BARRIER to obtaining requirements?
New analytics system = employees being unsure what data/analysis they can ask for or may receive
= resist using new system
Tell me what are the 3 DIMENSIONS of data requirements??
1) Functional
2) Corporate Governance
3) MeasureMent
Can you list the types of data there are?
- unstructured
- transactional (data related to (non)monetary interactions)
- metadata
- hierarchal (stores relationships between other data)
- master (four groupings: people, things, places, concepts - sub-categorised: subject areas, domain areas, entity types)
Why, oh why, would an organisation use EXTERNAL memory instead of INTERNAL memory?
ex extends in memory
steps in ETL process, pls
- extract (pull data from existing systems, ongoing process, time consuming/complex)
- transform (standardise data - make it consistent/change formats)
- load (archival - historical data loaded once, current - data from current systems, ongoing - continual revision with operations)
What’s a data mart yo
- local SINGLE-subject database used to create a smALLER database than data warehouse for specific needs of group of analysts
- often used in tiered/federated data warehouse
role of people in a DATA MANAGEMENT system??
- create
- maintain
- evolve
- use
organisational memory
what are some desirable ATTRIBUTES of data in orgs.
shareable transportable secure accurate timely relevant
what does SOP stand for
Standard Operating Procedures - ways to perform ROUTINE activities
probs with data management systems
redundancy (same data in many systems) lack of data control (poorly managed) poor interface delays lack of reality lack of data integration (multiple data formats)
What is DATA?$?$?
raw, unsummarised, unanalysed facts
What is INFORMATION??
data processed into MEANINGful form
What is KnOwLEdGe??
capacity to use information
DATA WAREHOUSE??
repository (?) of organisational data (in one collection), measured in petabytes
What’s the process of managing the data warehouse my dude
ETCLSM
extraction, transformation, cleaning, loading (archival, current, ongoing), scheduling, metadata
types of data warehouses?
centralised, federated, tiered
examples of functional dimension
strategy, finance, hr, marketing, sales etc.
examples of governance dimension
policy, compliance, risk, audit
examples of measurement dimension
measures, metrics, indicators, indices, references, trends
tell meee why metadata is important
- gives context to data in warehouse
- corrects errors/misunderstandings
- allows changes to be traced
three types of metadata??
1) business
2) technical
3) process
what’s the master data life cycle ??
CRUD
created, read, updated, deleted, searched
what is MDM
tools and processes to help - gather - identify - classify relevant data to be used as MASTER data throughout an entire organisation
challenges of mdm??
1) is it valuable (provide timely, accurate, consistent info)
2) effort+structure = complicated+costly
3) good project management skills to overcome HESITATION at effort needed
how has the move to CLOUD COMPUTING affected data management
- SECURITY (physical, access control, data transport encryption, firewalls, storage encryption)
- SLAs, QoS (quality of service), national laws
what is a database?
highly STRUCTURED data file that allows data input, retrieval, organisation
flat files??
- two dimensional tables usually stored as DELIMITED text files (csv, tsv)
hierarchical databases
parent-child relationships between tables (each child can only have one parent)
relational databases
no limit on number of parent/child connections
what is blob
binary large OBject
- data format for storing images, pdfs, audio or other multimedia objects, executable code.