Evaluation guide for the dedupable storage backend #2062
pstorz
started this conversation in
Show and tell
Replies: 2 comments 6 replies
-
|
@pstorz I have seen some references saying that the "Maximum Network Buffer Size" option is important.. Is this true? |
Beta Was this translation helpful? Give feedback.
4 replies
-
|
One more question: Are there any benefits from having compression activated in filesets, if compression is on on zfs volumes? Or maybe drawbacks? |
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Evaluation Guide for the Bareos Dedupable Backend
Prerequisites
This guide assumes that you know how to set up Bareos, know how to configure and use multiple
storage devices and have successfully backed up to and restored from different storages.
This example works on the default configuration that is automatically installed when setting up Bareos.
To take advantage of the deduplicability, you will need a deduplicating storage like ZFS, VDO or other.
This guide will use ZFS.
Create a deduping ZFS mountpoint and allow the user "bareos" to write to it
zfs create tank/deduping_zfs zfs set dedup=on tank/deduping_zfs chown bareos:bareos /tank/deduping_zfsInstalling the dedupable backend
Install the package bareos-storage-dedupable
Configure the dedupable backend
Configure the Device in the Storage Daemon
# copy the example device configuation $ cp -a /etc/bareos/bareos-sd.d/device/Dedupable.conf.example /etc/bareos/bareos-sd.d/device/Dedupable.confAdapt the configuration to your environment and alter the settings for
In case of ZFS, the default block size is 128k.
Change the Archive Device to match your mountpoint of the deduplicating storage.
Caution
It is very important that the block size of the dedup device matches the block size of the underlying storage to achieve optimal deduplication rates.
Configure the Device in the Storage Daemon
# copy the example storage definition $ cp -a /etc/bareos/bareos-dir.d/storage/Dedupable.conf.example /etc/bareos/bareos-dir.d/storage/Dedupable.confEdit the configuration to your environment. Set Address and Password as in File.conf
Restart director and storage daemons
Verify that the dedupable device is available
Make a first full test backup
*run job=backup-bareos-fd storage=Dedupable level=Full messages ..... FD termination status: OK SD termination status: OK Bareos binary info: Bareos pre-release (UNSUPPORTED): Get professional support from https://www.bareos.com Job triggered by: User Termination: Backup OKCheck the contents of the dedupable storage
Check the dedup status in ZFS
Make a second full test backup
*run job=backup-bareos-fd storage=Dedupable level=Full messages ..... FD termination status: OK SD termination status: OK Bareos binary info: Bareos pre-release (UNSUPPORTED): Get professional support from https://www.bareos.com Job triggered by: User Termination: Backup OKCheck the dedup status in ZFS
Results of the test backups
The dedup factor is 1.61x for the fileset being backed up twice.
The second backup only occupies 24.7M instead of 47.6M, which is roughly 52% of the original size.
Backups in the real-world probably have a higher deduplication potential than the small files of the SelfTest fileset.
Estimate the deduplication potential of your existing backups.
bdedupestimateis a tool which reads in your existing volumes and estimates their deduplication potential for different blocksizes.To show how
bdedupestimateworks, we backup the test fileset twice to the conventional FileStorage device and then analyze it:Run bdedupestimate
As you can see, the estimated Dedup Factor matches the ZFS reality quite well.
Smaller block sizes usually result in better deduplication ratio. VDO for example can work with 4K blocks:
Summary
bededupestimatetool, existing backups can be analyzed to estimate the achievable deduplication rate even before using the dedupable backend.Beta Was this translation helpful? Give feedback.
All reactions