Chapter 6 - Data Deduplication Flashcards
Name three scenarios that would be ideal for data deduplication.
General-purpose file servers
VDI deployments
backup targets
What data deduplication policy specifies that files should be considered for data
deduplication?
Optimization policy
What two fields in Get-DedupStatus are relevant to the optimization rate?
OptimizedFilesSavingsRate and SavingsRate
What is a Chunk?
A part of a file that Data Deduplication selected by the chunking algorithm as likely to occur in other, similar files
What is a Chunk Store?
An organized series of container files in the System Volume Information folder that DDPEval uses to uniquely store chunks
What is Dedup?
An abbreviation for data deduplication that is commonly used in PowerShell, Windows Server APIs and components, and the Windows Server community
What is File Metadata?
Information that describes properties about the file that are not related to the main content of the file
What is File Stream?
The main content of the file
What is a File system?
The software and on-disk data structure that the operating system uses
to store files on storage media
What is a File System Filter?
A plugin that modifies the default behavior of the file system
What is Optimization?
The process of chunking a file and storing its unique chunks in the chunk store
What is Optimization Policy?
A policy which specifies the files that should be considered for data deduplication
What is Reparse Point?
A special tag that notifies the file system to pass off I/O to a specified file system filter; in data deduplication, it is the way optimized files are stored (pointers to a chunk map)
What is Volume?
A Windows construct for a logical storage drive that may span multiple physical storage devices across one or more servers
What is Workload?
An application that runs on Windows Server
What are some usage scenarios for data deduplication?
User documents: 30% to 50%
Deployment shares: 70% to 80%
Virtualization libraries: 80% to 95%
General file shares: 50% to 60%
How does data deduplication help in general file servers?
There will be plenty of opportunity for data deduplication to work its magic in these environments—often consisting of team shares, user home folders, work folders, and software development shares.
How does data deduplication help in Virtualized Desktop Infrastructure (VDI) deployments?
Many virtual hard disks are practically identical.
How does data deduplication help with backup targets?
So much of the data we store as backups is identical to other data we have backed up!
What is DDPEval?
Data Deduplication Savings Evaluation tool can evaluate the potential for optimization against directly connected volumes and mapped or unmapped network shares.