4.11 Big Data Flashcards

You may prefer our related Brainscape-certified flashcards:
1
Q

What is Big Data?

A

Big Data is a catch term phrase for all things that won’t fit their usual containers (biggest difficulty lacks structure)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the three main features of Big Data?

A

1) Volume: Ammount of data won’t fit in a single server
2) Variety: Data is in many different forms such as structured, unstructured, text multimedia etc
3) Velocity: Streaming data milliseconds to seconds to respond

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is Structured Data?

A

Structured Data is data that can be defined using traditional database techniques using fields and records

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is Unstructured Data?

A

Data that can not be defined in columns and rows e.g., multimedia files web pages and the contents of emails

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Why is Machine Learning Used?

A

Machine learning can be used here as it is good at looking at Qualitative (non-numerical) data in an automated way

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How is Big data modelled?

A

Big Data is modelled using a graph schema which can be created using the graph data type

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are the Properties of a graph schema?

A

A Graph schema is made up of nodes, properties and edges

1) Node: An entity such as a customer, product or picker
2) Properties: Relevant data relating to that node
3) Edges: Shows the link and describes the relationship between two nodes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is distributed processing?

A

Distributed processing refers to how work is split over several computers by adding more servers or workstations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is Functional Programming?

A

Functional programming uses functions (a subroutine that returns a value) to create programs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are the benefits of Functional programming?

A

1) Functional programming doesn’t make use of variables which means that it is immune to side effects that impact how the code runs.
2) Functional programs rely on just the function and it isn’t dependant on other variables avoids concurrence
3) Functional Programs code can be distributed across multiple servers and used multiple users
4) Easier to write ‘correct code’

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Why can’t relational databases be used with Big Data?

A

Relational databases cant be used as they require the data to fit into a row and column format

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is a fact-based model?

A

1) A fact-based model is used to represent, model and query data sets at the scale of Big Data
2) Structured around ‘facts’ instead of entities with attributes
3) Data in a fact-based model cant be deleted or changed (immutable)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly