11: Big Data Flashcards

You may prefer our related Brainscape-certified flashcards:
1
Q

Characteristics of Big Data (3)

A
  • Volume: too big to fit into a single server
  • Velocity: streaming data, milliseconds to seconds to respond
  • Variety: data in many forms such as structured, unstructured, text, multimedia
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Fact

A

Each fact in a fact-based model captures a single piece of information and is timestamped

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Graph Schema (4)

A
  • Graph schemas are graphs that depict the structure of a data set that is stored using a fact-based model
  • Nodes are used to represent the core entities in the data set. They are depicted with ovals
  • Edges are used to represent the relationships between nodes. They are depicted with directed or undirected solid lines
  • Properties are used to represent information about nodes. They are depicted with rectangles
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

When Data Sizes are so Big as not to Fit on a Single-Server (2)

A
  • The processing may be distributed across more than one machine
  • Functional programming is a solution, because it makes it easier to write correct and efficient distributed code
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Features of Functional Programming (3)

A
  • Immutable data structures
  • Statelessness
  • High-order functions
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Immutable Data Structures

A

They cannot be changed during program execution. This eliminates errors caused when data is overwritten by another server

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Statelessness

A

Data structures are immutable and variables are not used so the program state does not change in execution. This means that a function with the same inputs will always produce the same output

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

High-Order Functions

A

They can take other functions as parameters and return functions as a result. They can run in parallel systems without disturbing other parts of the data sets

How well did you know this?
1
Not at all
2
3
4
5
Perfectly