The Hadoop Distributed File System Flashcards

1
Q

Hadoop provides ___

A

a distributed filesystem and a framework for the analysis and transformation of very large data sets using the MapReduce paradigm

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

A Hadoop cluster ___ by simply adding servers.

A

scales computation capacity, storage capacity, and I/O bandwidth

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

HDFS stores ___ and ___ separately.

A

filesystem metadata

application data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

HDFS’ architectural pieces (NIJDHcCBFs)

Never Intentionally Just Dropin Here ‘cause Charlie Brings Friends sometimes

A
NameNode
Image
Journal
DataNodes
HDFS Client
CheckpointNode
BackupNode
Filesystem Snapshots
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Files and directories are represented on the NameNode by ___

A

inodes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

inodes record attributes like ___

A

permissions
modification and access times
namespace and disk space quotas

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

The NameNode maintains ___

A

the namespace tree and the mapping of blocks to DataNodes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

image

A

the inodes and the list of blocks that define the metadata of the name system

How well did you know this?
1
Not at all
2
3
4
5
Perfectly