Chapter 3 - Introduction to Databases and Data Analytics Flashcards

1
Q

database

A

an organized collection of locally related data, text, references, or pictures in a standardized format, typically stored in a computer system for multiple uses

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

database administrator (DBA)

A

the individual responsible for the technical aspects of databases. The DBA is responsible for designing the database as well as managing the database after implementation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

stewardship

A

the job of supervising or taking care of something, such as an organization or property

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

certified health data analyst (CHDA)

A

an AHIMA credential that identifies practitioners who have the knowledge to acquire, manage, analyze, interpret, and transform data into accurate, consistent, and timely information while balancing the healthcare organization’s strategic vision with day-to-day details

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Triple Aim of Healthcare

A

improve the patient experience, reduce costs, and improve the health of the general population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

data set

A

a collection of data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

database management system (DBMS)

A

a software package designed to define, manipulate, retrieve and manage data in a database

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

data definition language (DDL)

A

the syntax for creating and modifying database objects such as tables, indices, and users. DDL statements are similar to a computer programming language for defining data structures, especially database schemas

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

data manipulation language (DML)

A

a computer programming language used for adding (inserting), deleting, and modifying (updating) data in a database. A DML is often a sublanguage of a broader database language such as SQL, with the DML comprising some of the operators in the language

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

data structures

A

a data organization, management, and storage format that enables efficient access and modification; more precisely, a data structure is a collection of data values, the relationships among them, and the functions or operations that can be applied to the data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

schema

A

a representation of a plan or theory in the form of an outline or model

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

data control language (DCL)

A

a syntax similar to a computer programming language used to control access to data stored in a database

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

natural language queries

A

the use of common words to tell the database which data are needed. For example, the user may enter a query by typing “list all of the patients whose principal procedure is 0F140D3.”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

query by example (QBE)

A

a query method whereby the user only has to point and click to choose tables and fields contained in the database. The IS then allows the user to choose whether the entries that meet those criteria should be included or excluded from the query

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Boolean data type

A

a data type that has one of two possible values (usually denoted true and false, but in search queries can include things such as AND, BUT, OR, and NOT to retrieve information based on a specific search statement)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

wildcard character

A

a kind of placeholder represented by a single character, such as an asterisk (*), which can be interpreted as a number of literal characters or an empty string. It is often used in file searches so the full name need not be typed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

structured query language (SQL)

A

a standard language for the relational database. SQL defines data elements, manipulates data, and controls access. The data definition components of SQL allow the user to create tables, delete tables, and show how something is viewed. Data manipulation allows the user to add and delete rows in a table and to sort, find, and compare. Another function of the data manipulation component is to update data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

data dictionary

A

a descriptive list of the names, definitions, and attributes of data elements to be collected in an IS or a database whose purpose is to standardize definitions and ensure consistent use

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

entity–relationship model (or ER model)

A

a model that describes interrelated things of interest in a specific domain of knowledge

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

data-flow diagram

A

a way of representing a flow of data through a process or a system. The DFD also provides information about the outputs and inputs of each entity and the process itself. A data-flow diagram has no control flow, there are no decision rules and no loops

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

audit trail

A

a data trail that identifies tasks such as who accessed the information system or when and where someone performed a certain data function

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

vocabulary standards

A

an arbitrary and authoritative list or collection of clinical words or phrases with their meanings; this list addresses the problem of multiple ways to define, classify, and represent language

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

eponym

A

(1) a person, place, or thing after whom or which someone or something is, or is believed to be, named. (The adjectives derived from eponym include eponymous and eponymic)
(2) a name (as of a drug or a disease) based on or derived from an eponym (e.g. Disneyland as an eponym for Walt Disney)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

dyspnea

A

difficult or labored breathing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

data modeling

A

The process of determining the users’ information needs and identifying relationships among the data. There are three levels of data models—conceptual data model, physical data model, and logical data model.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

conceptual data model

A

not tied to a particular database model, but rather defines the requirements for the database to be developed. This conceptual data model is the basis for the logical and physical data models.

27
Q

physical data model

A

a model that shows how the data are physically stored within the database.

28
Q

logical data model

A

a complete representation of data requirements and the structural business rules that govern data quality in support of project’s requirements; in other words, it ensures that the data are available and in a useful format for the intended purpose. For example, the nursing department would look at the data differently than the HIM department and the HIM department would look at it differently than the marketing department.

29
Q

entity-relationship diagram (ERD)

A

a common type of data modeling that focuses on relationships between entities. An entity is a person, location, thing, or concept that is to be tracked in the database. In healthcare, entities would include such items as patient, physician, and laboratory test. Each entity has attributes, which are facts or data about the entity.

30
Q

use case

A

a written description of how users will perform tasks on your website. It outlines, from a user’s point of view, a system’s behavior as it responds to a request. Each use case is represented as a sequence of simple steps, beginning with a user’s goal and ending when that goal is fulfilled.

31
Q

relational database

A

a type of database that stores and provides access to data points that are related to one another; the data is stored as database tables

32
Q

database table

A

all data related to a particular subject or concept, such as a patient, and stored in rows and columns in a spreadsheet

33
Q

file

A

a collection of digital data stored in a database

34
Q

data field

A

A predefined area within a healthcare database in which the same type of information is usually recorded. Examples of data fields collected in healthcare include patient last name, health record number, gender, diagnosis, procedure, and date of birth

35
Q

alphabetic field

A

a data field that accepts only alphabetic characters. Data elements of this field type include patient name and city.

36
Q

numeric field

A

a data field that accepts only numbers that can be calculated. This would include charges, but would not include zip codes and the health record number, as these numbers cannot be calculated (added, subtracted, and so on).

37
Q

alphanumeric field

A

a data field that accepts alphabetic characters, numbers, or a combination of the two. Examples of alphanumeric fields include street address, zip codes, and phone numbers.

38
Q

time and date fields

A

a data field that contains only a date or time or a timestamp

39
Q

autonumbering fields

A

a data field that creates a unique number that will never be assigned again. This could be the health record number or a unique number assigned to a patient visit.

40
Q

normalization

A

breaking the data elements into the level of detail desired by the healthcare organization. For example, last name and first name should be in separate fields, as should city, state, and zip code. This allows the user to search or otherwise manipulate any of the data elements.

41
Q

query

A

a request for data or information from a database table or combination of tables or from something else

42
Q

key field

A

a field in a table that holds a unique identifier to ensure that each data entry in the database table is different. This unique identifier is generally called the primary key. In the electronic health record (EHR) and other clinical ISs, this unique identifier is typically the health record number. Within the database, the tables organize the information from all records into rows and columns or cubes and use the primary key to locate the information. With the use of the primary key such as the health record number (in the patient table), then all of the patient’s health information can be linked together.

43
Q

hierarchical database model

A

a data model that structures the data in a hierarchy very similar to that used for an organizational chart. To use the analogy of a tree, the trunk of the tree would be the starting point or the initial query by the user, and as the search narrows or goes forward, each branch of the tree becomes smaller, progressing outward toward the leaves of the tree, which represent the end point of the search. Each piece of data in the database is called a parent or child node. Pointers indicate where in this tree structure the data are stored.

44
Q

network database model

A

uses pointers to connect data. The nodes are called owners and members rather than parent and child nodes, as in the hierarchical database model. A member node in the network database model can have more than one owner, unlike in the hierarchical database model. In other words, each piece of data can be connected to multiple pieces of data, kind of like how a room in MajorMUD can be connected to multiple other rooms in different directions (e.g. north, south, east)

45
Q

node

A

a basic unit of a data structure; nodes contain data and may link to other nodes

46
Q

object-oriented database model

A

a database management system in which information is represented in the form of objects as used in object-oriented programming

47
Q

multidimensional database model

A

used in data warehouses. The data are collected from multiple sources, such as other databases, and then summarized. This enables data analysis using a wide range of data. The multidimensional database is designed to quickly access this summarized data

48
Q

data warehouse

A

a large store of data accumulated from a wide range of sources within a company and used to guide management decisions

49
Q

data repository

A

an open-structure database (not dedicated to the software of any particular vendor or data supplier) in which data from multiple information systems are stored so that an integrated, multidisciplinary (includes a variety of healthcare providers) view of the data can be achieved in a single source. The data repository is updated by the various ISs in real time, thus providing users with access to the most current information available. This real-time access to data is called online or real-time transaction processing.

50
Q

nosocomial infection

A

also referred to as healthcare-associated infection (HAI); they are an infection acquired during the process of receiving health care that was not present during the time of admission

51
Q

data mart

A

a subset of the data warehouse designed for a single purpose or specialized use. The data mart performs the same type of analysis as a data warehouse; however, the scope of the data is narrower. The healthcare organization may choose to develop the data warehouse before the data mart, or the data mart can be developed first, or both can be developed at the same time. This order of development depends on the needs of the organization. Examples of how a data mart may be used include patient satisfaction and medical research. Patient satisfaction would not require the patient-specific information that would be stored in the data warehouse, but it would include the types of services, the nursing unit, and other basic information in addition to the patient survey or other patient satisfaction information collected. The data mart can be used in research because it can be used to provide deidentified information and the limited information required to conduct the research and thus can protect the confidentiality of the patient by providing only the minimum information necessary.

52
Q

data mining

A

the process of extracting and analyzing large volumes of data from a database for the purpose of identifying hidden and sometimes subtle relationships that would be unnoticed without the analysis. Data mining may also be called database exploration or information discovery.

53
Q

anomaly detection (data mining)

A

In anomaly detection, the goal is to identify data that does not follow expectations. This can be used to identify fraud and or other issues that need investigation.

54
Q

association rule learning (data mining)

A

This type of data mining identifies interesting relationships between two concepts in the database. For example, it may identify that patients who are treated with drug A have a better outcome than patients who are treated with drug B.

55
Q

cluster analysis (data mining)

A

Cluster analysis is identifying concepts that have traits in common. For example, monitoring the treatment practices of physicians for a specific diagnosis or procedure.

56
Q

classification analysis (data mining)

A

Classification analysis is a method of identifying important information about the data in the database by grouping data much like grouping diagnoses and procedures. This grouped data can then be used in the cluster analysis.

57
Q

regression analysis (data mining)

A

This method identifies the dependency between two (or more) variables, one being the outcome or dependent variable, the other being the influencing or independent variable. A regression analysis could determine how much influence hypertension or diabetes has on the outcome of a COVD-19 patient’s survival.

58
Q

dependent variable

A

the variable being tested and measured in an experiment, and that is ‘dependent’ on the independent variable

59
Q

independent variable

A

the variable in an experiment that is specifically manipulated or is observed to occur before the dependent variable, in order to assess its effect or influence on the dependent variable

60
Q

positive correlation

A

a relationship between two variables in which both variables move in tandem—that is, in the same direction

61
Q

negative correlation

A

a relationship between two variables in which one variable increases as the other decreases, and vice versa (i.e. they move in opposing directions)

62
Q

online analytical processing (OLAP)

A

a data access architecture that allows the user to retrieve specific information from a large volume of data. The use of OLAP turns the data warehouse into a decision support tool because it can analyze large amounts of data quickly. It does this in one of three ways: drilling down into the data, rolling up, or “slicing and dicing.” Drilling down is going deeper and deeper into the data. Rolling up refers to the aggregation of data, which is just the opposite of drilling down. “Slicing and dicing” is looking at the data from multiple viewpoints.

63
Q

Certificate of Need (CON) laws

A

state regulatory mechanisms for establishing or expanding health care facilities and services in a given area. In a state with a CON program, a state health planning agency must approve major capital expenditures for certain health care facilities, such as extra space for beds.