Exam 2- Chapters 5 and 12 Flashcards
T or F: The process of acquiring, keeping, and managing data is becoming increasingly difficult.
True
All information technology applications require:
data
Data should be of high quality, meaning it should be what 7 things?
- Accurate
- Complete
- Timely
- Consistent
- Accessible
- Relevant
- Concise
What are the 8 difficulties of managing data?
- The amount of data is increasing with time.
- Data is frequently stored in numerous servers and locations
- Data are generated from multiple resources
- New data is constantly being developed, and this data must be managed; data also becomes less current over time
- Data are subject to data rot
- Data security, quality, and integrity are easily jeopardized.
- Federal regulations complicate data management and companies are drowning in data
- Big Data
Inconsistent data prevents a company from what?
Developing a unified view of core business information
a collection of data held by one group that is not easily accessible by other groups
data silo
One major method to remove data silos is through ________ ________ _________.
cloud data management
What are three issues with data silo’s?
- Hinder the process of gaining actionable insights from organizational data
- Create barriers to an overall view of the enterprise and its data
- Delay digital transformation efforts
Data that are continuously generated by point-of-sale systems, clickstream data, social media, and sensors
Data streams
Point of Sale systems collect data in _____ _____.
real time
the information that is collected during the customer sales transaction (like name, product identification number, unit price of each item, etc.)
point-of-sale data
What are the four types of data streams?
- Point-of-Sale Data
- Clickstream Data
- Social Media Data
- Sensor Data
the data that visitors and customers produce when they visit a website and click on hyperlinks
clickstream data
What type of data stream would this be: data that includes shares, likes and dislikes, ratings, reviews, recommendations, and comments.
social media data
a system in which any object, natural or manmade, contains internal or external wireless sensor(s) that communicate with each other without human interaction. What type of data stream does this relate to?
The Internet of Things (IoT); sensor data
monitors and reports data on physical and environmental conditions around it, such as temperature, sound, pressure, vibration, and movement
sensor data
What can sensors do?
Control physical systems, such as opening and closing a valve and adjusting the fuel mixture in your car
T or F: Being able to collect massive amounts of data from many different sources is a double-edged sword.
True
problems with the media on which the data are stored
data rot
What are the two aspects of data rot?
- Over time, temp, humidity, and exposure to light can cause physical problems with storage media and make it difficult to access data
- Finding the machines needed to access the data can be difficult
What three things are critical for data, yet easily jeopardized?
Security, quality, and integrity
Today capturing data is (hard/easy) and storing that data is relatively (expensive/inexpensive).
easy; inexpensive
An increasing number of companies are turning to ________ to help them make sense of their data.
Enigma
a data management and business intelligence company that specializes in data integration and analytics.
Enigma
Enigma notes that it has always been difficult for banks to share data, principally due to ________ ______.
privacy concerns
a set of programs that provide users with tools to create and manage a database
database management system (DMS)
the process of accepting liability under an insurance policy, thus guaranteeing payment in case loss or damage occurs
underwriting
T or F: Databases and database management systems are essential to all areas of business, so they must be carefully managed.
True
Why do we focus on the RELATIONAL database model?
Because it is popular and easy to use
Database models, like the hierarchical and network models, are the responsibility of the _____ _____ and are NOT used by ___________ ________.
MIS function; organizational employees
What are two examples of relational databases?
- Microsoft Access
- Oracle
The relational database model is based on the concept of _____-________ tables.
two-dimensional
A relational database is usually designed with which of the following:
1. a big table that contains all of the records and attributes
2. a number of related tables
- a number of related tables
Relational database rows contain ____ and columns contain _____.
records; attributes
A key to designing an effective database is the _____ ______.
data model
a diagram that represents entities in the database and their relationships
data model
a person, place, a thing, or an event– such as a customer, an employee, or a product– about which an organization maintains information.
entity
Entities can typically be identified in the user’s _____ _____. A ______ generally describes an entity.
work environment; record
each row in a relational table, which is a specific, unique representation of the entity
instance of an entity
A student in a universities student database would be an ______. A particular student would be an ______ of the student entity.
entity; instance
each characteristic or quality of a particular entity
attribute
T or F: an example of an entity attribute of a customer would be customer name
True
T or F: entity attribute of a product could be product color
True
one field that uniquely identifies a record so that it can be retrieved, updated and sorted
primary key
What’s an example of a primary key?
a unique student id number
another field that has some identifying information but typically does not identify the record with complete accuracy
secondary key
What’s an example of a secondary key?
a students major
a field (or group of fields) in one table that uniquely identifies a row of another table; used to establish and enforce a link between two tables
foreign key
highly organized data in fixed fields in a data repository such as a relational database
structured data
data that do not reside in a traditional relational database
unstructured data
What are three examples of unstructured data?
- E-mail messages
- Videos/images
- Powerpoint presentations
80-90% of data in an organization are (structured/unstructured)
unstructured
T or F: Databases typically process data in real time
True
Do users have access to databases?
If not, where can users access data for decision making?
NO; data warehouses
The world generates over ___ ______ of data each year.
one zetabyte
The amount of data produced worldwide is increasing by ___% each year.
50%
Most stored information in the world today (98%) is (digital/non-digital)
digital
a collection of data that is so large and complex that it is difficult to manage using traditional database management systems
Big Data
large amounts of traditional data
big data
Why does Big Data systems perform well?
Bc they contain huge amounts of data on which to base their predictions
diverse, high-volume, high-velocity information assets that require new forms of processing in order to enhance decision-making, lead to insights, and optimize business processes
Big Data
Big Data generally consists of what four things?
- Traditional enterprise data
- Machine-generated/sensor data
- Social data
- Images
Big Data has three distinct characteristics:
- Volume
- Velocity
- Variety
Velocity in big data refers to the fact that the rate of which data flows into an organization is rapidly (increasing/decreasing).
increasing
Why is velocity critical?
It increases the speed of the feedback loop between a company, its customers, its suppliers, and its business partners
Big Data formats change (slowly/rapidly).
rapidly
What are the three issues with Big Data?
- Big data can come from untrusted sources
- Big Data is dirty
- Big Data changes, especially in data streams
refers to inaccurate, incomplete, incorrect, duplicate, or erroneous data
dirty data
What three things have made processing Big Data today affordable for most organizations?
- Hardware
- Cloud computing
- Open-source software
What is the first step for many organizations toward managing data?
integrate information silos into a database environment and then to develop data warehouses for decision making
an information system that does not communicate with other related information systems in an organization
information silos
What type of databases are particularly useful when working with Big Data?
NoSQL
the coordinated processing of an application by multiple processors that work on different parts of the application, with each processor utilizing its own operating system and memory
massively parallel processing
a database management system that primarily relies on main memory for data storage, in contrast to DBMSs that use hard -drive storage
in-memory database
refers to the storage or relatively inactive data that does not have to be accessed frequently or rapidly
cold data
refers to data that must be accessed frequently and rapidly
hot data
What are the 5 strategies that can be used by modern organizations to manage Big Data and gain value from it?
- Making Big Data Available
- Enabling Organizations to Conduct Experiments
- Microsegmentation of Customers
- Creating New Business Models
- Organizations Can Analyze Big Data
A/B experiments have only _____ possible outcomes.
two
dividing customers up into very small groups, or even down to the individual customer
microsegmentation
Big Data is valuable to various functional areas in the firm. What five functional areas does this include?
- Human Resources
- Product Development
- Operations
- Marketing
- Government Operations
applications, technologies, and processes for gathering, storing, accessing, and analyzing data to help business users make better decisions
business analytics
a repository of historical data that are organized by subject to support decision makers within the organization
data warehouses
Data warehouses are (inexpensive/expensive), so they are used primarily by (small/large) companies.
expensive; large
a low-cost, scaled down version of a data warehouse that is designed for the end-user needs in a strategic business unit (SBU) or an individual department
data mart
T or F: Data warehouses can be implemented more quickly than data marts, often in less than 90 days
false; data marts can be implemented more quickly
T or F: data marts support local rather than central control by conferring power on the user group. typically, groups that need a single or a few business analytics applications require only a data mart rather than a data warehouse.
true
What are the 6 basic characteristics of data warehouses and data marts?
- Organized by business dimension or subject
- Use online analytical processing
- Integrated
- Time variant
- Nonvolatile
- Uses a multidimensional data structure
A data warehouse/mart may store _____ of data, while transactional systems maintain _____ data.
years; recent
What does it mean to say data warehouses and data marts are “nonvolatile”?
Users cannot change or update the data
Who updates warehouses and marts?
Through IT-controlled load processes rather than by users
A common representation for this multidimensional structure (which is the structure data is stored in for data warehouses/marts) is the ____ _____.
data cube
The data in data warehouses and marts are organized by ______ ______, which are subjects such as product, geographic area, and time period.
business dimensions
What does the Business Analytics Process start with?
A business problem, often called pain points by practicing managers.
transmission speed
bandwidth
Transmission speed (bandwidth) in computer networks, particularly the Internet, is rapidly (increasing/decreasing)
increasing.
Advances in digital storage capacity and access speed are driving the cost of storage (up/down), meaning that what?
down; organizations are able to store and analyze huge amounts of data
What is the most important issue that you will face in the Business Analytics Process?
Defining the business problem
an ongoing collection of related activities that create a product or service of value to the organization, its business partners, and its customers
a business process
What is the next step in the Business Analytics Process after defining the business problem?
consider the data that you have for the analysis
What is the heart of the analytics process?
Big Data
What are the 3 types of analytics applications?
- Descriptive Analytics
- Predictive Analytics
- Prescriptive Analytics
In general, who usually performs descriptive, predictive, and prescriptive analyses?
Data scientists
What is the most common and popular business analytics tool?
Excel
the conversion of raw data into a smaller amount of more useful information
data reduction
Descriptive, predictive, and prescriptive analytics are essentially steps in _____ _______.
data reduction
Summarizes what has happened in the past and enables decision makers to learn from past behaviors
descriptive analytics
What are the business analytics tools in descriptive analytics?
- Online Analytical Processing (OLAP)
- Data Mining
- Decision Support Systems
the process of searching for valuable business information in a large database, warehouse, or mart
data mining
a data mining application that discovers co-occurrence relationships among activities performed by specific individuals or groups
affinity analysis
What is this an example of: young men often buy diapers and beer at the same time
affinity analysis
analysis in which retailers seek to understand the purchase behavior of customers (ex: customers who bought book A might also like to buy book B)
market basket analysis
simplified representations, or abstractions, of reality
models
combine models and data to analyze semi structured problems and some unstructured problems that involve extensive user involvement; enable business managers and analysts to access data interactively, to manipulate these data, and to conduct appropriate analyses
decision support systems
examines how sensitive an output is to any change in an input while keeping other inputs constant
sensitivity analysis
analysis that attempts to predict the impact of changes in the assumptions– that is, the input data– on the proposed solution
what if analysis
analysis that attempts to calculate the value of the inputs necessary to achieve a desired level of output
goal seeking analysis
examines recent and historical data to detect patterns and predict future outcomes and trends. _________ analytics provides estimates about the likelihood of a future outcome
predictive analytics
What is the difference between preventive maintenance and predictive maintenance?
Preventive: scheduled at regular intervals
Predictive: occurs as needed based on conditions of the asset (reduces labor and material costs)
a type of predictive analysis; is the process of analyzing opinions expressed in a piece of text (ex: a Tweet) or in a speech to determine whether the writer’s or listener’s attitude toward a particular topic, product, or service is positive, negative, or neutral
sentiment analysis
T or F: Problems with predictive analytics are most likely to arise when algorithms make things happen automatically, without human intervention or oversight
True (think about Uber armed man example)
T or F: The best analytics approach depends on the kind of data you’re working with, therefore, there cannot be one best approach for all problems.
True
recommends one or more courses of action and identifies the likely outcome of each decision
prescriptive analytics