Chapter 7: Databases and Data Warehouses Flashcards

1
Q

Data can be maintained in one of the two ways: the _________ - which has no mechanism for tagging, retrieving, and manipulating data- and the _________, which does have this mechanism.

A

traditional files approach

database approach

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

_________ wastes storage space (and consequently money) and is inefficient.

A

Data redundancy

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Inaccuracies affect _______ - the characteristic that the data represents what it is supposed to represent and that it is complete and correct.

A

data integrity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

An ________ is any object about which an organization chooses to collect data.

A

entity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

The smallest piece of data is a _________.

A

character

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

A ______ is one piece of information about an entity, such as the last name or first name of a student, or the student’s address.

A

field

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

The fields related to the same entity make up a _______.

A

record

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

A collection of related records, such as all the records of a college’s students, is called a _______.

A

file

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

The program used to build databases, populate them with data, and manipulate the data is called a ________.

A

database management system (DBMS)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

The program used to build databases, populate them with data, and manipulate the data is called a _____.

A

database management system (DBMS)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Data is accessed in a database by sending messages called _____, which request data from specific records and/or fields and direct the computer to display the results.

A

queries

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

The _________ consists of tables. Its roots are in relational algebra, but you do not have to know relational algebra to build and use them.

A

relational database model

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

A ___ is a field whose values identify records either for display or for processing.

A

key

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

A _____ combines data from two or more tables.

A

join table

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

A _____ is the field by which records in a table are uniquely identified. If your query specified that you wanted the record whose CustomerID value is 36003, the system would retrieve the record of the person you wanted, even if there are more records of people with the same name.

A

primary key

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

A _____ is a combination of two or more fields that together serves as a primary key, because it is impractical to use a single field as a primary key.

A

composite key

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

A ____ is created when a group of employees belongs to only one department. All would have the same department number as a foreign key in their records, and none will have more than one department key.

A

one-to-many relationship

18
Q

A ______ can be maintained, for instance, for professors and students in a college database as a professor might have many students, and a student might have many professors.

A

many-to-many relationship

19
Q

The ______ uses the object-oriented approach to maintaining records.

A

object-oriented database model

20
Q

The combined storage of both data and the procedures that manipulate them is referred to as ______.

A

encapsulation

21
Q

The ability in object-oriented structures to create a new object automatically by replicating all or some of the characteristics of a previously developed object (called the parent object) is called _______.

A

inheritance

22
Q

A _______ creates a temporary table that is a subset of the original table or tables. It allows you to create a report containing records that satisfy a condition, create a list with only some fields about an entity, or product a report from a join table, which combines relevant data from two or more tables. If so desired, the user can save the newly created table.

A

relational operation

23
Q

The three most important relational operations are ____, ____, and ____.

A

select
project
join

24
Q

____ is the selection of records that meet certain conditions. For example, a human resources manager might need a report showing the entire record of every employee whose salary exceeds $60,000.

A

Select

25
Q

______ is the selection of certain columns from a table, such as the salaries of all the employees.

A

Project

26
Q

In relational model, the joining of data from multiple tables is a called a ____.

A

join

27
Q

________ has become the query language of choice for many developers of relational DBMSs. It is an international standard and is provided with most relational database management programs.

A

Structured Query Language (SQL)

28
Q

The _____ describes the structure of the database being designed: the names and types of fields in each record type and the general relationships among different sets of records or files.

A

schema

29
Q

The description of each table structure and types of fields become part of a _____, which is a repository of information about the data and their organization.

A

data dictionary

30
Q

The information describing each field can be called as _____. It includes the source of the data including contact information, population rules: what is inserted, or updated, and how often, etc.

A

metadata (data about the data)

31
Q

Analyzing an organization’s data and identifying the relationships among the data is called _____.

A

data modeling

32
Q

Effective data modeling and design of each database involves the creation of a conceptual blueprint of the database. Such a blueprint is called an _______.

A

entity relationship diagram (ERD)

It is a graphical representation of all entity relationships, and they are often consulted to determine a problem with a query or to implement changes.

33
Q

Transaction data can be used for important management decisions, such as researching market trends or tracking down fraud. Organizing and storing data for such purposes is called _______.

A

data warehousing

34
Q

A _______ is a large, typically relational, database that supports management decision making.

A

data warehouse

35
Q

Organizations often set up their data warehouse as a collection of _____, smaller collections of data that focus on a particular subject or department.

A

data marts

36
Q

Three phases are involved in transferring data from a transactional database to a data warehouse: ______, ______, and ______.

A

extraction
transforming
loading

37
Q

In the ____ phase, the builders create the files from transactional databases and save them on the server that holds the data warehouse.

A

extraction

38
Q

In the _____ phase, specialists “cleanse” the data and modify it into a form that allows insertion into the data warehouse. For instance, they will check if the data contains any spelling errors and fix them, and also make sure that all data is consistent.

A

transformation

39
Q

In the _____ phase, the specialists transfer the transformed files to the data warehouse. They then compare the data in the data warehouses with the original data to confirm completeness.

A

loading

40
Q

_____ is a magnification of expansion of the amount, types, and level of detail that is collected and stored. It is data specifically collected about and from individuals.

A

Big Data

The collection and storage of ever-more detailed quantities of data.

41
Q

______ is a data mining method that uses a combination of natural language processing, computational linguistics, and text analytics to identify and extract subjective information in source materials.

A

Sentiment analysis