Module 10: Data Protection (Data Backup) Flashcards
What is a backup?
an additional copy of your prod data created for the sole purpose of recovering in the case of loss or corruption
What are the components of a backup?
backup client
backup server
storage node
backup target
What is the role of a backup client?
to gather data that must be backed up and sends it to storage node
also sends tracking number info to backup server
What is the role of the backup server?
manages the backup operations and maintains the backup catalog
What is the backup catalog?
contains information about the backup configuration and backup metadata
What is the backup configuration?
contains information about when to run backups - which client needs to be backed up etc.
What does the backup metadata contain?
contains information about the backed up data
What is the storage node for?
responsible for organizing the client’s data and writing the data to a backup device - controls one or more backup devices
What is true in most backup implementations?
the storage node and backup server run on the same system
What is a full backup?
copies all data on the production volume to a backup device
What are the pros and cons of full backups?
provide faster data recovery
requires more storage space/takes longer to backup
What is RPO responsible for when it comes to backups?
determines which backup copy is used to restore the production
What is an incremental backup?
copies the data that has changed since the last backup
What is the main advantage of incremental backups?
fewer files are backed up daily allowing for shorter backup windows
What is a backup window?
period during which a production volume is available to perform a backup
What is a cumulative backup?
copies the data that has changed since the last full backup
What is the advantage/disadvantage of cumulative backups?
shorter restore times but as they grow they can contain more data than an incremental backup
What is agent based backup?
an agent of client is installed on a VM or physical compute system - the agent streams the backup data to the backup device
What is the affect of agent based backup?
impacts performance of applications running on compute systems
agent running on compute system consumes CPU cycles and memory resources
What is image based backup?
makes a copy of the VM disk and configuration that is associated with a VM - backup saved as a single entity called a VM image
What can the backup software do in image based backups?
send requests to VM management server to create a snapshot of the VMs to be backed up and mount it on to a proxy server
How is an image based backup performed?
using the snapshot by the proxy server
When is an image based backup used?
for restoring an entire VM if there is any hardware failure or human error
What is the advantage of image based backup over agent based?
no agent means no CPU pulling
What are the advantages of cloud backups?
reduces management overhead
gives backup services on demand
allows to recover the data anywhere
What is key for cloud backups to work?
bandwidth
What is a backup client?
piece of software installed on a host that is going to be backed up
What is the main job of the backup client?
generate safe sets (the data you want backed up) - will transport safe sets over the LAN to storage node
What is the storage node?
layer of software that resides on host that has physical connectivity to the media
what is the main goal of the storage node?
to control the media - all read and write access
What is the storage nodes relationship to safesets?
pack them together and start sending them to media
What is multiplexing?
instead of taking one safeset at a time storage node can pack them together and ship them to media at same time
Where does the client get installed?
wherever a storage node is placed
What is the server layer?
gets installed on the host - manages the whole backup process
What does the server layer manage?
tells clients when/where to backup data
tells storage what media to use
keeps track of all the backup catalogs
What is metadata?
data of the data