My Addition Flashcards
What is data?
Data can exist in various forms such as numeric, textual, visual, or audio, and is used to describe real-world systems.
Data is essential for understanding and analyzing different aspects of reality.
Based on scope, how can data be characterized?
- global data: accessible to everyone within a system
- scoped data: restricted to specific users or groups, allowing access only to a defined subset of information within the system
Data varies in scope, format, and access.
How can data be represented?
Data can be represented as numbers, text, images, audio, or video.
Different formats allow for diverse applications of data in various fields.
What does data access refer to?
- Data can be private to organizations or
- Publicly available due to regulations.
This distinction is crucial for understanding data privacy and sharing policies.
What was the historical context of data?
- Historically, data was mostly analog, encoded as continuous variations on various physical media.
- Today, data is mostly digital, encoded as zeros and ones on electronic and magnetic media.
This transition has enabled the development of large databases.
What is an information management system?
An information management system is a software application that manages corporate data for a specific business function.
These systems are vital for efficient data handling in organizations.
What components typically comprise an information management system?
comprise: consist of
An information management system usually includes:
* A database system
* A user interface
* Business logic
* Interfaces to other systems
These components work together to facilitate data management.
What are the primary roles in database management?
Roles in database management include:
- Database Administrator
- Database Designer
- Database Programmer
- Database User
Each role has distinct responsibilities crucial for effective database operation.
What is the role of a Database Administrator?
A Database Administrator ensures security and user access.
This role is critical for protecting sensitive data.
What does a Database Designer do?
A Database Designer structures the data and database format for quick access.
Effective design is key to database performance.
What is the responsibility of a Database Programmer?
A Database Programmer writes applications using databases.
This role bridges the gap between users and data.
Who are Database Users?
Database Users consume data, request, or update information.
Users play a vital role in leveraging data for decision-making.
What defines a Database?
Databases are defined as organized collections of data.
This organization enables efficient data retrieval and management.
What is a Database Management System (DBMS)?
A DBMS is software that manages databases, ensuring security and consistency.
It acts as an intermediary between users and the database.
What is a Query Language?
A specialized programming languages to retrieve or manipulate data in databases.
SQL is one of the most commonly used query languages.
What is a Database Application?
A Database Application is software for users to interact more easily with databases.
These applications enhance user experience and accessibility.
What are examples of public data sources?
Examples of public data sources include:
* data.gov (U.S. government data)
* cancer.gov/research (cancer research data)
* kaggle.com (community-driven data sets)
* data.nasa.gov (NASA’s data)
* opendata.cityofnewyork.us (data from NYC)
Public data sets are valuable for research and analysis.
True or False: Data is primarily analog in today’s context.
False
Today, data is predominantly digital.
What are the key requirements of database systems?
- Performance
- Authorization
- Security
- Rules
- Recovery
These requirements ensure effective management of data and user access.
What types of database systems are most commonly used?
- Relational databases (using SQL)
- Non-relational databases (NoSQL)
NoSQL databases have emerged to handle large volumes of less structured data.
Database systems are more suitable than file systems for large, complex databases, because?
Database systems provide better performance, security, and data recovery.
What are the characteristics of transaction management in database systems?
- Ensure complete transaction processing
- Prevent conflicts between concurrent transactions
- Ensure transaction results are saved
These characteristics maintain data integrity and consistency.
What does ACID stand for in the context of relational databases?
- Atomicity
- Consistency
- Isolation
- Durability
These properties ensure reliable transaction processing.
Fill in the blank: Relational databases store data in _______.
tables
Each table represents a specific entity and maintains relationships with other tables.
What are some advantages of relational databases?
- Structured data organization
- Flexibility in Scalability & querying
- Data redundancy control
- Support for transactions
- Data security & integrity
- Widely supported and standardized
These advantages make relational databases effective for various applications.
What role do NoSQL databases serve in modern data management?
- Big Variety of Real-Time data handling
- Flexible schema
- High availability and scalability
- Integration with cloud systems
- Support for various applications
NoSQL databases complement relational databases by providing flexibility and performance for unstructured data.
What are the main components of a database system?
- Query processor
- Storage manager
- Transaction manager
- Log
- Catalog (data dictionary)
Each component has specific functions that contribute to the overall operation of the database system.
What is the function of a catalog in a database system?
It serves as a directory containing metadata about database objects.
This metadata helps other components understand how to access and manipulate data effectively.
What is a limitation of file systems compared to database systems?
- Inability to manage concurrent access effectively
- Lack of security features
- No enforcement of data integrity
These limitations can lead to data corruption and inconsistencies.
How does a database avoid issues arising from simultaneous access?
Databases provide mechanisms such as managing reading and writing to shared data which prevent conflicts and ensures data integrity.
What might happen if two programs access a text file simultaneously?
- One program may miss updates made by the other
- Data corruption may occur
- Inconsistent data may be read
These issues highlight the limitations of file systems in managing concurrent data access.
What are the four common CRUD operations?
Create, Read, Update, Delete
What does the Create operation do in CRUD?
Inserts new records into a database
What does the Read operation do in CRUD?
Retrieves existing records from a database
What does the Update operation do in CRUD?
Modifies existing records in a database
What does the Delete operation do in CRUD?
Removes records from a database
What is the primary focus of CRUD operations?
Direct manipulation of data within the database
Which SQL command is used for the Create operation?
INSERT
Which SQL command is used for the Read operation?
SELECT
Which SQL command is used for the Update operation?
UPDATE
Which SQL command is used for the Delete operation?
DELETE
What does ACID stand for in the context of database transactions?
Atomicity, Consistency, Isolation, Durability
CRUD operations does not include:?
schema management commands
What are schema operations in database management?
Commands that manage database structure
such as CREATE TABLE, ALTER TABLE, or DROP TABLE
What do access control commands like GRANT and REVOKE manage?
User permissions and access to data
What is the SQL CREATE TABLE statement used for?
To define a new table and its columns
SQL is the primary language used for writing _______ in relational database systems.
queries
What data type is used to store integer values in SQL?
INT
What data type is used to store textual values in SQL?
VARCHAR
What is the main function of SQL in database management?
To interact with relational databases through commands that manage data.
What does the SELECT statement enable users to do?
Retrieve specific data from one or more tables
What is the purpose of database administration in SQL?
Control database management tasks, including backup, restore, and overall administration
What does the DECIMAL data type store in SQL?
Fractional numeric values
What does the DATE data type in SQL store?
Year, month, and day
While some NoSQL databases might offer some level of _____1___, it’s not a defining feature.
- SQL compatibility ; NoSQL are named for their distinction from traditional SQL databases.
- NoSQL databases often use different query languages or data models.
What is an Attribute?
- In essence, an attribute is a characteristic of something.
- But a Unnecessary attribute can be removed without losing the ability to uniquely identify each row.
relational model development
- Developed by E. F. Codd of IBM in 1970, the relational model is based on mathematical set theory and represents data as independent relations
Refer to the ER
CustomerID is:
Columns that refer to a foreign key have an empty bullet prefix.
Refer to the ER
The arrow from Sale to Book corresponds to which column of Sale?
The BookID
Refer to the ER
What element of the ER diagram does the BookAuthor table implement?
Writes
What are the two parts of logical design?
- Either a specification for tables, keys, and columns, or
- The process that creates the specification
A schema is depicted in:
A schema specifies the result of logical design, not the result of analysis.
Refer to the ER
What is ‘Writes’?
- Author-Writes-Book is a relationship between the Author and Book entities.
- Relationships are depicted as lines on ER diagrams.
Refer to the ER
What is ‘Category’?
Category appears inside the Book rectangle. Therefore Category is an attribute.
Refer to the ER
What is ‘Sale’?
Entities are depicted as rectangles on ER diagrams.
A sale is a thing tracked in the database, with properties unit price, quantity, and date
What are the three phases of the database design process for large, complex databases?
- Analysis
- Logical Design
- Physical Design
In ER diagrams, what do rectangles represent?
Entities
What do lines between rectangles in ER diagrams represent?
Relationships
What is depicted inside rectangles and below entity names in ER diagrams?
Attributes
What is SQL identified as?
The standard relational query language
Why is SQL not considered having a programming feature?
While SQL is a powerful language for managing data, it’s not generally considered a full-fledged programming language like Python or Java
What is an API in the context of database programming?
A library of procedures or classes that connects a host programming language to a database
What is an example of an API for Java?
Java Database Connectivity (JDBC) an (API) that allows Java programs to connect to databases
What role do cursors play in database programming?
They help bridge the gap between SQL and host languages
What does the logical design phase implement?
Database requirements in a specific database system
In a relational database, what do keys identify?
Individual rows of a table
What is a database schema?
The logical design specified in SQL, depicted in a table diagram
What do solid bullets (●) indicate in table diagrams?
Key columns
What is the overall process of determining and implementing database requirements called?
Database design
What does the analysis phase of database design involve?
Gathering and documenting database requirements
What principle states that physical design affects query performance but not query results?
Data independence
What is the difference between logical design and physical design?
- Logical design structures data requirements
- Physical design focuses on storage details and indexing
What is information independence often used interchangeably with?
Data independence
What is the purpose of the physical design phase?
To determine how the database will be stored physically, including indexing strategies
Fill in the blank: A database design is defined as the specification of database objects such as tables, columns, data types, and _______.
Indexes
How does data independence benefit relational databases?
It allows changes to the physical design without impacting application programs
True or False: Logical design is more important for database users and programmers than physical design.
True
What is the visual representation of database requirements in the analysis phase?
Entity-Relationship (ER) diagrams
What does the logical design phase result in?
Tables, keys, and columns
What affects the speed of query processing in a relational database?
Physical design
What is MySQL?
A leading relational database management system (RDBMS) sponsored by Oracle
What does RDBMS stand for?
Relational Database Management System
What is SQL?
Structured Query Language, a standard programming language for managing databases