Module 10: Data Protection (Data Deduplication + Data Archiving) Flashcards

1
Q

What are the cons of duplicate data?

A

impacts backup windows
increases network bandwidth
difficult to protect data within budget

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is data deduplication?

A

process of detecting and identifying the unique data segments within a given set of data to eliminate redundancy

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the deduplication ratio?

A

ratio of data before deduplication to the amount of data after deduplication

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are the key benefits of data deduplication?

A

reduces infrastructure costs
enable longer retention periods
reduces backup windows
reduces network bandwidth

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is source based deduplication?

A

data is deduplicated at the source (backup client)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

When is source based deduplication recommended?

A

ROBO environments

also commonly used by cloud service providers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are the advantages of source based deduplication?

A

reduces storage capacity and network bandwidth requirements

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is target based deduplication?

A

data is deduplicated at the target (inline vs postprocess)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are the advantages and disadvantages of target based deduplication?

A

offloads backup client from deduplication process

requires sufficient network bandwidth

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is a disadvantage of source base deduplication?

A

puts more burden on the host since its responsible for generating safe set and deduping

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What does inline deduplication mean?

A

dedupes in cache and than send to disk

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is file based dedupe?

A

takes full backups of a file and can dedupe it to reduce copies - but if any part of file changes need to do another backup

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is sub-file based dedupe?

A

when you generate a file the first day it breaks it down into sub-file/objects

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is data archiving?

A

moves fixed content that is no longer actively accessed to a separate low cost archive storage system

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What are the advantages of data archiving?

A

saves primary storage capacity

reduces backup window and backup storage costs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is a data archive?

A

primary copy of data
available for data retrieval without recovery
typically long term retention

17
Q

What is the difference between data archive and data backup?

A

secondary copy of data
used for data recovery operations
typically short term retention

18
Q

What are the components of data archiving?

A

archiving agent
archiving server (policy engine)
archive storage

19
Q

What does the archiving agent do?

A

scans primary storage to find files that meet archiving policy

20
Q

What is the archive server?

A

indexes the files

21
Q

What is a small stub file?

A

contains the addess of the archived file and stays on the primary storage - small in size/capacity