Exam 1 Flashcards
Currently, most data analysis is performed by ______
A. Data analysis
B. Data scientist
C. Business users
D. All of the above
C. Business users
Which of the following is NOT part of the convergence of Data Analytics?
A. Domain Knowledge
B. Mathematics/Statics
C. Engineering
D. Computer Science
C. Engineering
Analytics takes us from Data to Decision - What is the order for the middle steps?
Wisdom
Knowledge
Data
Information
Data
Information
Knowledge
Wisdom
Which of the following is NOT one of the benefits of data analytics?
A. Performance
B. Longevity
C. Value
D. Training
D. Training
Data analytics and data science are different words for the same thing.
A. True
B. False
B. False
Place the data analytics step in the correct order.
A. Making decisions based on the information
B. Gathering data that are sometimes not in a usable form
C. Loading the data into storage models
D. Identifying the problem
D. Identifying the problem
B. Gathering data that are sometimes not in a usable form
C. Loading the data into storage models
A. Making decisions based on the information
Which of the following is an enabler of data analytics?
A. People
B. Performance
C. Infrastructure
D. Training
C. Infrastructure
Which one of the following is NOT one for the enablers of data analytics?
A. Tools
B. People
C. Infrastructure
D. Technology
B. People
Digital transformation is part of which industrial revolution?
A. 1
B. 2
C. 3
D. 4
D. 4
The 4th industrial revolution…
A. Uses water and Steam to mechanize production
B. Uses disruptive technologies and trends such as AI, IoT, robotics
C. Uses electronics and information technology to automate production
D. Uses electric energy to create mass production
B. Uses disruptive technologies and trends such as AI, IoT, robotics
Is the data described below structure, semi-structured, or unstructured or a mix of each?
A university tracks all of the classes that students sign up for each semester. The university records the course number, class decription, and course credit hours for each student.
A. Structured
B. Semi-structured
C. Unstructured
D. Mix of each
A. Structured
What is a flat file?
A. A single file linked to other single files
B. Multiple tables with no hierarchy
C. Multiple tables with hierarchy
D. Single file with no hierarchy
D. Single file with no hierarchy
Why is a primary key needed?
A. To uniquely identify a record
B. To uniquely identify a table
C. To uniquely identify an attribute
D. To uniquely identify an entity
A. To uniquely identify a record
Why is a foreign key needed?
A. To uniquely identify a record
B. To link two tables
C. To uniquely identify an entity
D. It is just an extra piece of information
B. To link two tables
Natural language processing (NLP) is the ability of a computer program to understand human language.
A. True
B. False
A. True
What is metadata?
A. A metro system
B. Provides information about other data
C. Graphically shows data
D. Show’s stored information
B. Provides information about other data
Is the data described below structured, semi-structured, or unstructured or a mix of each?
A company owns a football stadium and takes high definition photos of all fans. The company stores these images and plans eventually to use advanced technologies to see which fans are most likely to wear the team’s colors so they can market clothing to them.
A. Structured
B. Semi-structured
C. Unstructured
D. Mix of each
C. Unstructured
In online transactional processing (OLTP) data is stored one transaction at a time?
A. True
B. False
A. True
Three-tier architecture includes which of the following?
A. User interface level
B. Data level
C. Application level
D. Analysis level
A. User interface level
B. Data level
C. Application level
What is data concurrency?
A. Users are allowed access to the same data simultaneously
B. Provides access to all authorized users
C. No unnecessary replication of data
D. Separation of data from the programs that use the data
A. Users are allowed access to the same data simultaneously
A typical Enterprise Resource Planning (ERP) system will NOT support?
A. Customer Relationship Management
B. Human Resource Management
C. Supply Chain Management
D. Unique requirement of a specific business sector
D. Unique requirement of a specific business sector
What does OLAP stand for?
A. Online Analytical Processing
B. Old Angry Person
C. Online Literate Apes
D. Old Learning Algorithms Program
A. Online Analytical Processing
Online Analytical Processing (OLAP) is best defined as ______.
A. Technology for the very rapid analysis and processing of large datasets
B. Activities for detecting and correcting data in a database
C. Capability for manipulating and analyzing large datasets from many sources
D. Open-source software framework that enables distributed parallel process
C. Capability for manipulating and analyzing large datasets from many sources
A web crawler ….
A. Lists pages on the internet
B. Is used by search engines
C. indexes pages to make searching easier
D. Uses key information to return results
C. indexes pages to make searching easier
Clickstream is….
A. The fingerprint that web visitors leave
B. Sequence of hyperlinks to follow web visitor action in order
C. The links of a web page
D. The first and last page viewed by visitors
B. Sequence of hyperlinks to follow web visitor action in order
How do organizations gather data through sentiment mining?
A. Evaluate customer comments from social media (Facebook and Twitter)
B. Examine purchases through video camera
C. Uncover unknown patterns of databases and variables
D. Obtain data from UPC Scanner codes.
A. Evaluate customer comments from social media (Facebook and Twitter)
Data warehouses are informational systems.
A. True
B. False
A. True
Which of the following are true about a data warehouse (DW) structure?
A. Makes reporting and accessing data difficult
B. “Read only” and therefore modification anomalies are irrelevant
C. Relational database that has been denormalized
D. Can only hold numerical data
B. “Read only” and therefore modification anomalies are irrelevant
C. Relational database that has been denormalized
What does “denormalized” mean?
A. Breaking large database tables into many smaller tables to aid performance
B. Using sophisticated techniques to discover new relationships in a data set
C. Using techniques to investigate hypothesized relationships in data set
D. Some redundant data is added back to the database to reduce the # of tables
D. Some redundant data is added back to the database to reduce the # of tables
A multidimensional model is also referred to as a data cube or data mart.
A. True
B. False
A. True
What is data staging?
A. Area where data analytics and visuals are produced
B. Front end user interface (UI)
C. Area where data is stored indefinitely
D. Area where data are cleaned up and prepared (transformation)
D. Area where data are cleaned up and prepared (transformation)
A star schema typically has what type of relationship between a dimension and fact table?
A. Many to many
B. One to one
C. One to many
D. All of the above
C. One to many