Slide 2 Flashcards

1
Q

Database Fundamentals

A

 Database Fundamentals & Advantages
 Relational Databases
 Database Management Systems (DBMS)
 Practice designing databases

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Understanding Data as the Backbone of Information

A

 Data Quality
 Granularity, redundancy
 Data integrity constraints
 Access controls

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

The data hierarchy

A

Database - file- record-field-byte-bit

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Database

A

Collection of data organized to serve many applications by centralizing data and controlling redundant data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Database management system (DBMS) - software through which users and application programs interact with a database (e.g. Microsoft Access)

A

 Software that manages one or more databases  Interfaces between application programs and physical data files  Separates logical and physical views of data ( How a data looks to the user (logical)  How data is actually stored (physical) )
 Enables organization to centralize data management and security

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Files are maintained separately by different departments (or apps) - if every app manages its own data

A

 Data redundancy: Presence of duplicate data in multiple files
 Data inconsistency: Same attribute has different values in different applications
 Lack of flexibility
 Poor security
 Lack of data sharing and availability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Database examples:

A

 Web applications like Wordpress use a database (typically MySQL)
 Search engines use a (massive) database to store pages and indexes
 iPhone Apps can use the SQLite database to store and manage their data
 Enterprise Applications used by accounting, marketing, and operations use databases to centralize and share information

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Database fundamentals

A

 Database – maintains information about various types of objects (inventory), events (transactions), people (employees), and places (warehouses)

 Relational database (dominant model) – stores information in the form of logically related twodimensional tables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Table
Record(row)
Field(columns)

A

 Table – a collection of similar records

 Record (row) – a person, place, thing, transaction, or event about which information is stored

 Fields (columns) – characteristics or properties of an record  The columns in each table contain the fields

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Database Structure

A
  • Represent data as two-dimensional tables called relations or files
  • Each table contains data on entity and attributes
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Primary key

A

Field in table used for key fields

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Foreign key

A

Primary key used in second table as look-up field to identify records from original table
• (When a field is used in one table to create a relationship, but it is NOT a primary key, it is then called a foreign key)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Row (tuples)

A

records for different entities

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

fields (columns)

A

Represents attribute for entity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

The big three in database

A

table, queries, reports

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Parts of table

A

 Column  aka Field, Attribute, or Property  Row  aka Record or Tuple
 Primary Key  Foreign Keys

17
Q

Relationships

A

 Use Foreign Key to link

One-to-One, One-to-Many  (Many-to-Many)

18
Q

Database advantages from a business perspective includ

A

 Increased flexibility
 Increased scalability (Reduce and Increase)and performance
 Reduced information redundancy (Repeating information)
 Increased information integrity (quality)
 Increased information security

19
Q

Information granularity尺度

A

refers to the extent of detail within the information

20
Q

Factors affecting granularity

A

 Level of abstraction
(fine or detailed, coarse or summarized)
 Frequency of gathering data items

21
Q

Characteristics of high quality Information:

A
 Accuracy 
 Completeness 
 Consistency 
 Uniqueness 
 Timeliness
22
Q

The four primary sources of low quality information include

A
  1. Online customers intentionally enter inaccurate information to protect their privacy 2. Information from different systems have different entry standards and formats 3. Call center operators enter abbreviated or erroneous information by accident or to save time 4. Third party and external information contains inconsistencies, inaccuracies, and errors
23
Q

Potential business effects resulting from low quality information include:

A

 Inability to accurately track customers
 Difficulty identifying valuable customers
 Inability to identify selling opportunities
 Marketing to nonexistent customers
 Difficulty tracking revenue due to inaccurate invoices
 Inability to build strong customer relationships

24
Q

A well-designed database should: (increased flexibility)

A

 Handle changes quickly and easily
 Provide users with different views
 Have only one physical view (Physical view – deals with the physical storage of information on a storage device)
 Have multiple logical views (Logical view – focuses on how users logically access information)

25
Q

Databases reduce information redundancy

A

Inconsistency is one of the primary problems with redundant information

26
Q

Databases offer several security features including

A

 Password – provides authentication of the user
 Access level – determines who has access to the different types of information
 Access control – determines types of user access, such as read-only acces

27
Q

Data quality audit:

A

 Structured survey of the accuracy and level of completeness of the data in an information system

28
Q

Data cleansing :

A

 Software to detect and correct data that are incorrect, incomplete, improperly formatted, or redundant
 Enforces consistency among different sets of data from separate information system

29
Q

Ensuring data quality:

A

 More than 25% of critical data in Fortune 1000 company databases are inaccurate or incomplete

 Most data quality problems stem from faulty input

 Before new database in place, need to:
 Identify and correct faulty data
 Establish better routines for editing data once database in operation

30
Q

How do organizations use data? Data management is always a work-in-progress!

A

 As senses and alerts to identify threats and opportunities - Relevant and timely
 As evidence and input for decisions and analysis - Standardized and legitimate  As common language for collaboration and communication - Consistent across different users and departments
 As organization of work and responsibilities - Clear roles and authorizations for access and manipulation

31
Q

Integrity constrain

A

rules embedded in the database management system that help ensure the quality of information

 Relational integrity constraint – rule that enforces basic and fundamental information-based constraints
 Business - critical integrity constraint – rule that enforce business rules vital to an organization’s success and often require more insight and knowledge than relational integrity constraints

32
Q

Increased scalability and performance

A

A database must scale to meet increased demand, while maintaining acceptable performance levels
 Scalability – refers to how well a system can adapt to increased demands
 Performance – measures how quickly a system performs a certain process or transaction

33
Q

Data standardization/centralization allows for consistency, but it is not a trivial task

A

 Different systems may use different data standards (e.g. SCM partners)  Data from third parties may not be standardized or aligned with an organization’s data standards (e.g. sources on the Internet)  Online customers may feed the system with incorrect data due to privacy concerns  Employees may seek shortcuts in data entry, such as abbreviations  Different parties may wish to defend their own standard

34
Q

Most DBMS enforce (or at least give the option to enforce) rules to achieve consistency

A

 Validation rules for data entry

 Business rules for entity relationships

35
Q

Validation rules

A

work to standardize data structure (formatting, character limitations, mandatory data input, etc.)

eg. “Phone number of a customer must include area code”

36
Q

Business rules

A

enforce logical relationships across entities.

eg.“Each part can be provided by only one supplier”