Chapter 14 Flashcards

1
Q

______ is the Big Data “3 V” that relates to the speed at which data is entering the system.

A

Velocity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Scaling out is also referred to as _______.

A

Clustering

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

_______ refers to the analysis of the data to produce actionable results.

A

Feedback Loop Processing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

A method of text analysis that attempts to determine if a statement conveys a positive, negative, or neutral
attitude is referred to as ______ analysis.

A

Sentimental

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

______ is the coexistence of a variety of data storage and data management technologies within an
organization’s infrastructure

A

Polyglot Persistence

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Within MapReduce, a _____ runs maps and reduces functions.

A

Task Tracker

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Most organizations that use Hadoop also use a set of other related products that interact and complement
each other to produce an entire _____ of applications and tools.

A

Ecosystem

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

_______ languages allow the user to specify what they want, not how to get it which is very useful for query processing.

A

Declarative

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Within Hadoop, _____ is used for producing data pipeline tasks that transform data in a series of steps.

A

Pig

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Within Hadoop, _______ can transfer data in both directions - into and out of HDFS

A

Sqoop

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

_____ databases simply store data with no attempt to understand the contents of the value component or its
meaning.

A

Key-value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

_______ is a human-readable text format for data interchange that defines attributes and values in a
document.

A

JavaScript Object Notation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

______ do not store relationships as perceived in the relational model and generally have no support for join
operations.

A

Document Databases

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

______ refers to traditional, relational database technologies that use column-centric, not row-centric
storage.

A

Column-Oriented Database

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

In a column family database, a column that is composed of a group of other related columns is called a(n) _____.

A

Super Column

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

In a graph database, the representation of a relationship between nodes is called a(n) ______.

A

Edge

17
Q

A query in a graph database is called a(n) _____.

A

Traversal

18
Q

A database model that attempts to provide ACID-compliant transactions across a highly distributed
infrastructure is _____.

A

NewSQL

19
Q

______ is a continuous spectrum of knowledge acquisition that goes from discovery to explanation to
prediction.

A

Data Analytics

20
Q

In the ______ phase of data mining, findings are used to predict future behavior and forecast business
outcomes.

A

Prognosis