unit 4- Data Resource Management Flashcards
Database
set or collection of data, organized in tables and easily searchable
DBMS
Data base Management System:
software program that organizes, manages, and accesses data. Enables users to create, manage, retrieve, and update data in database. (ex.healthcare system data)
Database server
powerful computer that stores the databases and DBMS to access and administer database
Relational Databases
most common based upon the relational model of data, relationships between tables in rows and columns
Object databases
used when multimedia, computer aided design are required, not easily sorted by columns and rows.
hierarchical database
early database style, inflexible data relationship, treelike structure
Database administrator
- person responsible for the creation, management and integrity of the database
- monitors performance
- maps out the conceptual design for a planned database`
Data administrator
person who is responsible for the securing access, privacy, security, retention of data within the database.
entity
data components have various properties and relate to each other in various ways
field
attribute of an entity (name, address, $amount) example; once cell or field
record
single row occurrence or related field, separated by columns, example, customer record.
table
collection of related records
primary key
field that uniquely identifies a record (ex.Social security number)
foreign key
field that is in a relational table, relates to a primary key of another table
schema
the complete definition of the database, including fields, relationships, etc. -blue print on how the database is constructed
zero to many
optional
0- advisor and instructor
one or more
mandatory
I- student one or more courses
One and only one
Mandatory
II- student has one adviser, course has one instructor
flowcharting
during the design phase to avoid inconsistencies in database
normalization
- database must go through normalization in design process to simplify complexity
- removes redundant or unnecessary duplication
- correctly defines keys, tables, columns and relationships
Data Migration
- extract
- transform
- load
- validate
ETL
Extracted transformed loaded.
data warehouse
data vault- static data(same all day)
dynamic data
operational systems(marketing, sales, ERP, SCM, etc.) -changes all day
data warehouse and data marts
used for analytical purposes.
-a sub within warehouse for specific information out of the data warehouse. ex finance.
data governance
policies and procedures are established that define the data governance program, such a who has access who has update capabilities
1.recorded
how data are recorded may be critical to operation of a business. inputting consistently by using the same descriptions, standards, units of measurement and formats can mean the difference between success and failure
2.Stored
where data are stored can have a lasting effect on an organizations success. Data stored on systems not controlled by the organization or in a way that inappropriate users have access, may contribute to heightened risk for loss or theft
3.retrieved
how data are retrieves and who can retrieve them are questions central to design of a database
4.Retained and destroyed
Even when data are no longer being used, there may be laws or policies that govern how long it should be kept
Data mining
- query that mines for information
- discovers patterns in large data sets such a data warehouse.
- used in business strategies
OLAP
Online analytical processing
- is software for performing multidimensional analysis at high speeds on large volumes of data from a data warehouse, data mart, or some other unified, centralized data store.
- containing data that are used for making decisions such as the many levels
OLYP
online transaction processing
-ATM example
Big Data
- unstructured
- Variety- texts, emails, web mining.
- Volume-great amount of transaction data
- Velocity-batch streaming (for marketers)
cloud database advantage:
- more flexible
- available and scalable
- data can be stored in multiple location
cloud database disadvantage:
- more expensive to firms
- because web-based databases are connected to the internet, security is a significant issue.
Software as a service SaaS:
it is sometimes referred to as in demand software- software licenses on a subscription basis. software not downloadable.
ex: dropbox
Platform as s Service PaaS:
hardware and software allowing customers to develop, run, create, and customized software without writing extensive code, easy to run focus on creative side. ex: amazon
Infrastructure as a Service IaaS
storage, networking and virtualization, various low level details of underlying network infrastructure like physical computing resources, location, backup etc. Ex: data center
Hypervisor
creates and runs virtual machines. a computer on which a hypervisor runs one or more virtual machine is called a host machine, and each virtual machine is called a guest machine
CTO -Chief technology officer
responsible for implementing technology strategies
- responsible for identifying technology risks in the company