Quiz 4 Flashcards
Data
Data are the raw facts, and may be devoid of context or intent.
Data can be quantitative or qualitative. Quantitative data is numeric, the result of a measurement, count, or some other mathematical calculation. Qualitative data is descriptive.
Information
Information is processed data that possess context, relevance, and purpose.
Knowledge
Knowledge in a certain area is human beliefs or perceptions about relationships among facts or concepts relevant to that area.
Knowledge can be viewed as information that facilitates action.
Explicit vs tacit knowledge
Explicit knowledge typically refers to knowledge that can be expressed into words or numbers. In contrast, tacit knowledge includes insights and intuitions, and is difficult to transfer to another person by means of simple communications.
Wisdom
We can say that someone has wisdom when they can combine their knowledge and experience to produce a deeper understanding of a topic.
Information ladder
Data -> information -> knowledge -> wisdom
Big data
The term refers to such massively large data sets that conventional data processing technologies do not have sufficient power to analyze them.
Goal of information systems
The goal of many information systems is to transform data into information in order to generate knowledge that can be used for decision making. In order to do this, the system must be able to take data, allow the user to put the data into context, and provide tools for aggregation and analysis. A database is designed for just such a purpose.
Common defects in data resources
(1) No control of redundant data
(2) Violation of data integrity
(3) Relying on human memory to store and to search needed data.
Organized
A database is an organized collection of related data. It is an organized collection, because in a database, all data is described and associated with other data.
Relational database
In a relational database, data is organized into tables (or relations). Each table has a set of fields which define the structure of the data stored in the table. A record is one instance of a set of fields in a table. To visualize this, think of the records as the rows (or tuple) of the table and the fields as the columns of the table.
Primary key
A special filed or a combination of fields that determines the unique record is called primary key (or key).
Foreign key
A relationship between two tables is implemented by using a foreign key. A foreign key is a field in one table that connects to the primary key data in the original table.
Normalization
When designing a database, one important concept to understand is normalization. In simple terms, to normalize a database means to design it in a way that: 1) reduces data redundancy; and 2) ensure data integrity.
Personal database management systems examples
Microsoft Access and Open Office Base are examples of personal database-management systems. These systems are primarily used to develop and analyze single-user databases. These databases are not meant to be shared across a network or the Internet, but are instead installed on a particular device and work with a single user at a time.
Structured Query Language (SQL)
Almost all applications that work with databases (such as database management systems, discussed below) make use of SQL as a way to analyze and manipulate relational data. As its name implies, SQL is a language that can be used to work with a relational database.
NoSQL is looser, allowing for a more unstructured environment, communicating changes to the data over time to all the servers that are part of the database.
Query by example (QBE)
a graphical query tool, to retrieve data though visualized commands. QBE generates SQL for you, and is easy to use. In comparison with SQL, QBE has limited functionalities and is unable to work without the DBMS environment.
Relational databases do not scale well
As stated earlier, the relational database model does not scale well. The term scale here refers to a database getting larger and larger, being distributed on a larger number of computers connected via a network.
Metadata
The term metadata can be understood as “data about data.” Examples of metadata of database are: • number of records • data type of field • size of field • description of field • default value of field • rules of use. When a database is being designed, a “data dictionary” is created to hold the metadata, defining the fields and structure of the database.
Business intelligence
The term business intelligence is used to describe the process that organizations use to take data they are collecting and analyze it in the hopes of obtaining a competitive advantage.
Data visualization, data warehouses, data mining and machine learning
Data Warehouse
A data warehouse is a set of databases designed to support decision making in an organization. It is structured for fast online queries and exploration. Data warehouses may aggregate enormous amounts of data from many different operational systems.
There are two primary schools of thought when designing a data warehouse: bottom-up and top-down. The bottom-up approach starts by creating small data warehouses, called data marts, to solve specific business problems. As these data marts are created, they can be combined into a larger data warehouse. The top- down approach suggests that we should start by creating an enterprise- wide data warehouse and then, as specific business needs are identified, create smaller data marts from the data warehouse.
Data mining (and conditions)
Data mining is the process of analyzing data to find previously unknown and interesting trends, patterns, and associations in order to make decisions. Generally, data mining is accomplished through automated means against extremely large data sets, such as a data warehouse
For data mining to work, two critical conditions need to be present: (1) the organization must have clean, consistent data, and (2) the events in that data should reflect current and future trends.