PA and OA set Flashcards
Organization of data for the purpose of producing desired information involves:
recording, classifying, sorting, summarizing, calculating, disseminating and storing data.
Data Processing
____ is the the result of processing raw facts to reveal its meaning.
Data
______ is the result of revealing the meaning of raw facts.
information
_____ is the body of information and facts about a specific subject.
knowledge
Accurate, relevant, and timely information is the key to _____.
good decision making
_____ serve as the intermediary between the user and the database.
Database Management System
The database structure in a DBMS is stored as a _____.
collection of files
A desktop database is a _____ database.
single-user
To reveal meaning, information requires _____.
context
Raw data must be properly _____ for storage, processing and presentation.
formatted
_____ describe the data characteristics and the set of relationships that links the data found within the database.
Metadata
Where end-user data are integrated and managed.
- The _____ itself is stored as a collection of files
- The only way to access the data in those files is through the DBMS.
database structure
_____ receive all application requests and translate them into the complex operations required to fulfill those requests.
database programming languages
One disadvantage of the DBMS is that it increases the risk of data _____.
security breaches
A _____ can store data derived from many sources.
data warehouse
- database is designed for transactional processing, while
- A data warehouse is optimized for analytical processing of data from multiple sources.
_____ is raw facts of interest to the end user.
end-user data
______ provide a description of the data characteristics and the set of relationships that link the data found within the database.
Metadata
Where end-user data are integrated and managed.
- _____ exists when different versions of the same data appear in different places
- In essence, ________ is a consequence of poorly managed data redundancy.
data inconsistency
Say a customer’s address is stored in both:
- the customer relationship management (CRM) system
- marketing database
inconsistencies can arise if the address is updated in one system but not the other
The response of the DBMS to a query is the _____.
query result set
A(n) ______ database is used by an organization and supports many users across many departments.
enterprise
Workgroup databases are designed for smaller teams or departments, while enterprise databases support larger organizations with many users and departments.
A(n) _____ database supports a relatively small number of users (usually fewer than 50) or a specific department within an organization.
workgroup
- Workgroup databases are designed for smaller teams or departments, while
- Enterprise databases support larger organizations with many users and departments.
A workgroup database is a(n) _____ database.
multi-user
Data warehouse contains historical data obtained from the_____.
operational databases
_____ data exist in the format in which they were collected.
unstructured
as is
_____ data exist in a format that does not lend itself to processing that yields information.
unstructured
This type of data isn’t organized in a predefined manner (like rows and columns in a spreadsheet), making it harder to analyze and extract meaningful information.
____ data are the result of formatting disorganized data in order to facilitate storage, use and generation of information.
structured
Most data that can be encountered are best classified as _____.
semistructured
An XML database supports the storage and management of _____ XML data.
semistructured
_____ is data about data through which the end-user data are integrated and managed.
Metadata
Where end-user data are integrated and managed.
An _____ query is a spur-of-the-moment question.
ad-hoc
_____ is a special language used to represent and manipulate data elements in a textual format.
Extensible Markup Language (XML)
_____ databases focus primarily on storing data used to generate information required to make tactical or strategic decisions.
Analytical
A _____ is a collection of programs that manages the database structure and controls access to the data stored in the database.
database management system
A(n) _____ might be written by a programmer or it might be created through a DBMS utility program.
application
A _____ is a specific request issued to the DBMS for data manipulation.
query
______ refers to a collection of related records.
field
_____ enhance the user’s ability to understand the data.
spreadsheet applications
A _____ is a logically connected set of one or more fields that describes a person, place, or thing.
record
A field is a single piece of information within that record.
A _____ is a collection of related records.
file
A _____ is a character or group of characters that has a specific meaning.
field
The organization of data within folders in a manual file system is determined by _____.
expected use
_____ a specific deviation from the expected pattern in a data set. It can be a signal of errors, inconsistencies, or even fraudulent activity.
Data anomaly
_____ is defined as the condition in which all of the data in the database are consistent with the real-world events and conditions.
data integrity
_____ reflects a situation where the same data is stored unnecessarily at different places.
data redundancy
A(n) _____ develops when all required changes in the redundant data are not made successfully.
data anomaly
_____ exists when the same data are stored unnecessarily at different places.
Data redundancy
_____ exists when different and conflicting versions of the same data appear in different places.
Data inconsistency
- Inconsistency: Different versions of the same data exist.
- Redundancy: Same data stored in multiple places.
- Isolated data pockets within an organization.
- They lack connections to other data sources, hindering comprehensive analysis and decision-making.
islands of information
Data is said to be _____ if the data always yields consistent results.
verifiable
_____ refer to a type of database that stores most of its data in RAM rather than in hard disks.
in-memory databases
The term _____ refers to an organization of components that define and regulate the collection, storage, management and use of data within a database environment.
database system
_____ relates to the activities that make the database execute transactions more efficiently in terms of storage and access speed.
performance tuning
_____ relates to activities that make a database operate more efficiently in terms of storage and access speed.
performance tuning
The DBMS uses the _____ to look up the required data component structures and relationships
- relieving programmers from having to code such complex relationships in each program.
data dictionary
A _____ contains at least all of the attribute names and characteristics for each table in the system.
- relieves programmers from having to code complex relationships in each program.
data dictionary
The practical significance of taking the _____ is that it serves as a reminder of the simple file concept of data storage.
logical view
You can think of a table as a _____ representation of a logical relation.
persistent
The order of the _____ is important to the DBMS.
rows and columns
- Character data type also known as _____, can contain any character or symbol that is not intended for mathematical manipulation.
- Although they’re not intended for mathematical manipulation it still uses character sets: which include digits or a numerical string.
string data type
The row’s range of permissible values is known as its _____.
domain
Each table _____ represents an attribute.
column
Date attributes contain calendar dates stored in a special format known as the _____ date format.
Julian
A table is also called a _____ because the relational model’s creator, used the two terms as synonyms.
relation
table:relation
In a relational table, each column has a specific range of values known as the _____ domain.
attribute
- Provides a framework in which an assertion (statement of fact) can be verified as either true or false.
- _____ logic, used extensively in mathematics.
predicate
_____ are required in relational databases to ensure data integrity.
primary keys
A _____ is created when the next entry is selected without making a prior entry of any kind.
- “Say you press the Enter key or the Tab key to without making a prior entry of any kind”
null
Depending on the sophistication of the application development software, _____ can create problems when functions such as COUNT, AVERAGE, and SUM are used
null
In the relational model, _____ are important because they are used to ensure that each row in a table is uniquely identifiable.
keys
A _____ is a key that can uniquely identify any row in the table.
superkey
A _____ key can be described as a minimal superkey, a superkey without any unnecessary attributes.
candidate key
A _____ is the primary key of one table that has been placed into another table to create a common attribute.
foreign key
A _____ key is defined as a key that is used strictly for data retrieval purposes.
secondary
When you define a table’s primary key, the DBMS automatically creates a(n) _____ index on the primary key column(s) you declared.
unique
In a relational model, _____ :
- establish relationships among tables
- ensure the integrity of the data.
keys
A primary key is a _____ key selected to uniquely identify all other attribute values in any given row.
In essence, an attribute is a characteristic of something.
candidate
- The CUSTOMER table’s primary key: [ID].
- The CUSTOMER primary key column has zero null entries, meaning all entries are unique.
This is an example of _____.
entity integrity
_________ dictates:
- That the foreign key must contain values that match the primary key in the related table
- or must contain null.
Referential integrity
The _____ constraint can be placed on a column to ensure that every row in the table; has a value for that column.
NOT NULL enforces data integrity by preventing null values in the specified column.
The NULL represents the absence of any object
To avoid nulls: some designers use special codes, known as _____, to indicate the absence of some value.
flags
- Sometimes described as “the database designer’s database”
- It records the design decisions about tables and their structures.
data dictionary
In summary: provides a detailed description of all tables found within the user/designer-created database
In a database context, a(n) _____ indicates the use of different names to describe the same attribute.
synonym
A _____ provides a detailed description of all tables found within the user/designer-created database
data dictionary
Sometimes described as “the database designer’s database”
The _____ can be described as a detailed system data dictionary that describes all objects within the database.
objects within the database:
“Table data, Indexing data, accessing data”
system catalog
objects within the database:
- Table names, the table’s creator, and creation date
- The number of columns in each table; the data type corresponding to each column
- index filenames, index creators
- authorized users, and access privileges.
A ____ is a central database repository storing metadata about the database’s structure, objects, and other vital information.
system catalog
Repository~ a file, database, or cloud server address that contains one or more projects
_____ enforce integrity rules automatically.
Relational Database Management System
RDBMSs
The _____ relationship is the “relational model ideal”
Which relationship is rare? 1:M or 1:1
1:M
1:1 is a rare relationship
The _____ relationship should be rare in any relational database design.
1:1
Another name for a composite entity is a(n) _____ entity.
bridge
The _____ relationship is the relational database norm.
Which relationship is rare? 1:M or 1:1
1:M
1:1 is a rare relationship
The entities PROFESSOR and DEPARTMENT exhibited in scenario below is the _____ relationship:
- Each department can only have one professor serving as its chair.
- A professor who is a department chair can only chair one department at a time.
1:1
One characteristic of generalization hierarchies is that they are implemented as _____ relationships.
1:1
The proper use of foreign keys is crucial to controlling _____.
data redundancy
Proper _____ designs requires carefully defined and controlled data redundancies to function properly.
data warehouse
A(n) _____ is an orderly arrangement used to logically access rows in a table.
logical: It implies that the arrangement allows for efficient retrieval and manipulation of data based on specified criteria or relationships
index
A _____ index is an index in which the index key can have only one pointer value (row) associated with it.
unique
An index key can have multiple _____ .
attributes
An index key with multiple $ is also known as a composite index
An entity in the entity relationship model corresponds to a table in the _____ environment.
relational
In the entity relationship model, a table _____ corresponds to an entity instance.
Table row
In the Chen and Crow’s Foot notations:
An entity is represented with a _____ containing the entity’s name.
rectangle
In the original Chen notation:
- Each attribute is represented by an _____
- With the attribute name connected to an entity rectangle with a line.
oval
A(n) _____ is a primary key composed of more than one attribute.
composite identifier
Connectivities and cardinalities are established by concise statements known as _____.
business rules
In a one-to-many (1-M) relationship:
- To avoid the possibility of referential integrity errors
- The data of the _____ side must be loaded first.
one
_____ relationships are common in manufacturing industries.
unary
consisting of or involving a single component or element.
A _____ has a primary key that is partially or totally derived from the parent entity in the relationship.
weak entity
The _____ notation of entity-relationship modelling can be used for both conceptual and implementation modelling.
UML (Unified Modeling Language)
The entity relationship diagram (ERD) represents the _____ database as viewed by the end user.
Number of entity occurrences is very helpful at the application software level.
conceptual
:relating to or based on mental concepts
A(n) _____ is the set of possible values for a given attribute.
domain