Chapter 14 Flashcards
______ is the Big Data “3 V” that relates to the speed at which data is entering the system.
Velocity
Scaling out is also referred to as _______.
Clustering
_______ refers to the analysis of the data to produce actionable results.
Feedback Loop Processing
A method of text analysis that attempts to determine if a statement conveys a positive, negative, or neutral
attitude is referred to as ______ analysis.
Sentimental
______ is the coexistence of a variety of data storage and data management technologies within an
organization’s infrastructure
Polyglot Persistence
Within MapReduce, a _____ runs maps and reduces functions.
Task Tracker
Most organizations that use Hadoop also use a set of other related products that interact and complement
each other to produce an entire _____ of applications and tools.
Ecosystem
_______ languages allow the user to specify what they want, not how to get it which is very useful for query processing.
Declarative
Within Hadoop, _____ is used for producing data pipeline tasks that transform data in a series of steps.
Pig
Within Hadoop, _______ can transfer data in both directions - into and out of HDFS
Sqoop
_____ databases simply store data with no attempt to understand the contents of the value component or its
meaning.
Key-value
_______ is a human-readable text format for data interchange that defines attributes and values in a
document.
JavaScript Object Notation
______ do not store relationships as perceived in the relational model and generally have no support for join
operations.
Document Databases
______ refers to traditional, relational database technologies that use column-centric, not row-centric
storage.
Column-Oriented Database
In a column family database, a column that is composed of a group of other related columns is called a(n) _____.
Super Column