Chapter 6 – Data Design Flashcards
Database Management System (DBMS)
• Collection of tools, features and interfaces that enables users to: - Add - Update - Manage - Access - Analyze ~ The contents of a set data • Interface between a: - Database - Users ~ Who need to access the data • Components: - Interface for: ~ Users ~ Database administrators ~ Related systems - Data manipulation language - Schema - Physical data repository
Schema
• The complete definition of a database including descriptions of all:
- Fields - Tables - Relationships
Subschema
- Can be defined one or more
- View of the database used by one or more systems or users
- Defines only those portions of the database that a particular system or user needs is allowed to access
Physical Data Repository
• Data dictionary is transformed into a:
- Physical data repository also contains the schema and subschemas
• Physical repository:
- Centralized or distributed at several locations
• Database standards:
- Open database connectivity (ODBC)
- Java database connectivity (JDBC)
Characteristics of Web-Based Design
• Global access - The internet enables worldwide access using existing infrastructure • Ease of use - Web browsers provides a familiar interface that is: ~ User-friendly ~ Easily learned • Multiple platforms • Cost effectiveness • Security issues • Adaptability issues
Internet Terminology
- Intranet
- Extranet
- Clients
- Servers
- Protocols
Connecting a Database to the Web
• Database must be connected to the internet or intranet
- Middleware
- Example:
~ Adobe ColdFusion
Data Security
• Web-based data:
- Totally secure - Easily accessible to authorized users
Data Design Terminology (*)
• Definitions - Entity - Table or file - Field or Attribute ~ Single characteristic about an entity - Record or Tuple
Key Fields
- Primary
- Candidate
- Foreign
- Secondary keys
Primary key
Uniquely identifies a particular member of an entity • Combination key • Composite • Key • Concatenated key • Multi-valued key
Candidate key
• Could serve as the primary key
- Nonkey field
~ Not a primary key and not a candidate key
Foreign key
- Primary key in one table
* Foreign key in another table
Secondary key
Can be used to access or retrieve records
Referential Integrity
• Validity checks:
- Can help avoid data input errors
• In a relational database:
- Means that a foreign key value cannot be entered in one table unless it matches an existing primary key in another table
• Example:
- Preventing you from entering a customer order in an order table unless that customer already exists in the customer table
• Orphan
Entity-Relationship Diagrams (ERD)
• An entity is a person, place, thing or event
- For which data is collected and maintained
• ERD:
- Provides an overall view of the system
- A blue print for creating the physical data structure
Normalization
• A process of creating table designs by: - Assigning specific fields or attributes • Table design • Involves 4 stages: - Unnormalized design (UNF) - First normal form (1NF) - Second normal form (2NF) - Third normal form (3NF) • Standard Notation Format
Database Models
• Relational Databases
- New entities and attributes can be added at any time
~ Without restructuring the entire database
• Object-Oriented Databases
- Natural extension of the object-oriented analysis process
- Object Database Management Group (ODMG)
- Each object has a unique object identifier
Data Storage and Access
- Involves strategic business tools
- Strategic tools for data storage and access
- Logical and Physical Storage (*)
- Data Storage Formats
Data warehouse - Dimensions of characteristics (*)
• Integrated collection of data that can support:
- Management analysis - Decision making
Data mart (*)
• Serve the needs of a specific department
• Each data mart includes only:
- The data that users in that department require to perform their jobs
Data mining (*)
• Looks for:
- Meaningful data patterns and relationships
• Help companies identify:
- Potential customers based on their prior purchases
• Focuses on:
- Methods of obtaining valuable business knowledge from a data warehouse
Goal of data mining
• Increase: - Average pages viewed per session - Checkouts per visit - Number of referred customers - Average profit per checkout • Reduce: - Clicks to close • Clickstream storage – market basket analysis
Logical and Physical Storage (*)
• Logical storage - Data that a user can: ~ View ~ Understand ~ Access \+ Characters \+ Date element / Data item \+ Logical record • Physical storage - Hardware-related ~ Physical record / block ~ Buffer ~ Blocking factor
Data Storage Formats
• Binary digits - Binary storage format - Integer format - Long integer ~ Exists for efficient storage of exceedingly long numbers • Bit • Byte • EBCDIC & ASCII • Unicode