Big Data Flashcards

1
Q

What is Big Data

A

Extremely large data sets that may be analyzed computationally to reveal patterns, trends, and associations, especially relating to human behavior and interactions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the three Vs of Big Data?

A

Volume, Variety, Velocity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is RAID?

A

Redundant Array of Inexpensive Disks is a way of storing data in different places on multiple hard disks to protect data in the case of a drive failure.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is Bit Torrent Storage Architecture

A

A protocol for distributed file sharing that segments files into smaller pieces, distributed across network nodes to enable efficient data distribution and access without a central server.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

DDA-RM

A

designed around the structured schema of relational databases, which support ACID properties ensuring reliable and consistent transactional processing across distributed networks.
Struggles with the high demands of Big Data’s volume and velocity due to its rigid schema

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

DDA-NRM

A

utilizes NoSQL databases, which are inherently more flexible and scalable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

MapReduce

A

A programming model for processing large data sets with a distributed algorithm on a cluster. It processes data in two phases (Map and Reduce), making it suitable for processing large volumes of data in parallel.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

NoSQL

A

A category of database management systems that does not adhere to the traditional relational database management system (RDBMS) features and is useful for storing unstructured data. It supports a variety of data models, including document, graph, key-value, and columnar.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Hadoop

A

An open-source software framework used for distributed storage and processing of Big Data using the MapReduce programming model. It includes HDFS (Hadoop Distributed File System) and is designed to scale up from single servers to thousands of machines.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly