SnapRAID is a backup program for JBOD disk arrays. SnapRAID stores data parity information which enables the recovery of disk failures. SnapRAID is targeted toward home media centers, with a lot of large files that rarely change. However, SnapRAID works well in similar environments were data stores are largely static.
Beside the ability to recover from disk failures, other features of SnapRAID are:
While this OMV plugin makes the SnapRAID package easy to integrate into openmediavault, the SnapRAID package itself was created by a third party. See the project / author's → web page for more detailed information and the SnapRAID forum for technical support.
To explain how SnapRAID works, a comparison to RAID5 may be helpfull.
SnapRAID is in between RAID and a Backup program, trying to get the best benefits of both. During normal operation SnapRAID does not affect data in any way.
Features:
Traditional RAID5 stripes data and interleaves parity information across multiple drives.
Features:
In OMV7's GUI:
Under System, Plugins, find and highlight openmediavault-snapraid 7.X, and click the install button.
Under Services, SnapRAID, Arrays, click the Create button :.
In the Name field, specify a name for the new array and Save.
Note | |
Naming a SnapRAID array is a new feature in OMV7 which allows for the creation of independent arrays. However, this feature is not intended for the typical home use case. Multiple SnapRAID arrays are not recommended for users who are unfamiliar with SnapRAID. |
Under Services, SnapRAID, Drives, click the Create button :
In the Array field: Select the name of the previously created array.
In the Drive field: Select the drop down menu button on the right. Select the first drive to be protected by SnapRAID.
In the drive Name field: Provide a name for the drive.
Check the boxes for Content and Data
Info Note | |
Since at least one good copy of the Content File is required for a full drive restoration, it's best to have a copy of the Content File on all protected drives. |
Again, the parity disk must be the same size, or larger, than the largest drive in the collection of disks to be protected. This is necessary for storing parity data.
In the Array field: Select the name of the previously created array.
In the Drive field: Select the drop down menu button on the right. Select the drive that will be the SnapRAID parity disk.
In the Name field: Provide a name for the parity drive.
Check the box for Parity.
Note | |
Split Parity is a new SnapRAID feature that is supported in OMV7: - While the rule, "the parity drive must equal or exceed the size of the largest protected data drive" still applies; split parity allows combing smaller drives to create what appears to be a single large parity drive. - New users should note that split parity is an advanced feature that increases admin complexity. Split parity configurations are not recommended for users who are not familiar with SnapRAID. |
Warning | |
Parity drive(s) are reserved exclusively for SnapRAID's use. Attempting to use a Parity drive for any other purpose will compromise SnapRAID's protection and result in bizarre behavior. |
The functions of SnapRAID are supported after the first running of the sync
command. Depending on the amount of data present on protected disks, the initial sync operation may take considerable time. Potentially, it may take hours. If little or no data is present on a protected disk, or set of disks, the sync
operation will be near instant.
Under Services, SnapRAID, Arrays, select the Tools Icon . Click on Sync.
A window will pop up that will show the progress of the Sync operation.
Note, during the very first Sync, a warning may be in the window “content file not found”. This is normal, at the end of the Sync, content files will be created.
Depending on the amount of data, the Sync operation may take considerable time. When the Sync is done, all files on protected disks will be logged into content files along with their checksums, timestamps, parity data and other information. When the command completes, END OF LINE will be displayed. At this point data on protected disks is SAFE.
The remainder of the tools, under the tools icon, can be used for manual operations within the GUI. Some of these tools are discussed in manual operations.
There are two methods of SnapRAID Administration and maintenance; Manual and Automated. This guide will focus, primarily, on using the automation provided by this plugin for managing SnapRAID. However, brief explanations of the manual tools available will be helpful in understanding processes that are automated.
The basic order of SnapRAID maintenance operations is; Diff, Sync and Scrub.
The Diff plugin tool is found under Services, SnapRAID, Arrays, under the Tools Icon.
(The CLI command is snapraid diff
)
Diff (short for “differences”) is the first tool used to check protected disks. The Diff tool tallies and displays files added, deleted, modified, etc., since the last sync operation. This information requires administrators to decide, whether or not, the next sync operation should take place. Why? After a sync operation, parity and content information for added, changed or deleted files is reset. Once this information has been reset, changed or deleted files before the last sync operation can not be recovered.
In accordance with the above, it's important to determine the following:
- Were there excessive deletes?
In normal data operations a hand full of user deletes are expected. In some cases, where media servers or downloaders are involve, the number of automated deletes may be higher. Excessive deletes may indicate a data disaster.
- Were there an excessive number of updated or modified files?
In most cases, administrators will have a rough idea of what is normal for updated or modified files. As an example, a huge number of file modifications may indicate a ransomware attack.
In either case, if there are excessive deletes or an excessive number of updated / modified files, Diff settings within this plugin can be used to stop an automated sync operation, allowing for the recovery of deleted or modified files. (For recovery information see the → Fix tool.)
The Sync plugin tool is found under Services, SnapRAID, Arrays, under the Tools Icon .
(The CLI command is snapraid sync
)
After the initial sync, subsequent sync operations log only new or changed file information into content file(s). Sync also creates new checksums and updates parity information for the same files. Sync does not change checksums or parity information for unaltered files from previous sync operations. After the initial operation, with a small number of added or changed files, sync tends to run quickly.
Sync considerations:
The Scrub plugin tool is found under Services, SnapRAID, Arrays, under the Tools Icon .
(The CLI command is snapraid scrub
)
Scrub uses file information and their checksums, to check for the presence and health of files and to detect bad blocks.
If bad blocks are found, during the Scrub, SnapRAID status will list them. (In the GUI, SnapRAID status is found under Services, SnapRAID, Arrays, under the Information icon.
The Fix command, executed on the command line, will repair bad blocks.
Use snapraid -e fix
to recover them.
Then use snapraid -p bad scrub
The Fix plugin tool is found under Services, SnapRAID, Arrays, under the Tools Icon .
(The CLI command is snapraid fix
)
If files are missing, that were not intentionally deleted, use the Fix tool to recover them.
If using the plugin's Fix tool for undeleting files or for returning files to their previous state, realize that using the tool is a “shotgun” approach. The Fix tool will restore ALL deleted files and restore ALL changed files to their previous state, as of the last Sync operation. (However, when in doubt, this may to be the safest approach.)
For guidance on using the Fix command more selectively on the command line, on a “per file” basis, see the → SnapRAID Faq for details.
Automation of SnapRAID housekeeping is done with what is known as a “Diff Script”.
In a Diff script, the first command Diff, checks primarily for deleted, changed or added files. Inputs to this script set thresholds on what is (or is not) allowed before running the next command.
The second command, Sync, catalogues new files and assigns checksums and creates parity information for them. Sync also updates checksums and parity information for changed files.
Finally the third command, Scrub, is run to check the health of a specified percentage of existing files.
This plugin provides Diff script functionality, with user provided inputs, that can be varied according to the user's use case. After commands are run, the plugin will provide the user with an e-mail of the outputs from each command.
Under, Services, SnapRAID, Settings:
If either of the above thresholds are exceeded, a Sync and Scrub will not be performed. The Diff script will halt these operations and an E-mail will be sent to the user admin advising of the result.
(As noted in the GUI, if these thresholds are set to 0, Sync and Scrub will be performed regardless.)
In the screen shown above, click on Schedule Diff. The Scheduled Diff window will appear as follows.
This example is configured as follows:
Under Time of execution:
A consideration, when running the Diff Script, is to prevent file changes, additions or deletions, between a Sync operation and the subsequent Scrub operation. If file changes occur during these operations, SnapRAID errors will result. To prevent these errors from occurring, it's best to automate the Diff Script and run it after-hours when users are unlikely to add or change files.
When it comes to speed of operations, in most use cases, Diff and Sync will be fast. However, depending on the scrub percentage chosen, the total amount of data on the collection of protected disks, the speed of protected disks and other factors, a Scrub may take several hours. If large amounts of data exist within a SnapRAID array, to keep the Scrub operation within a reasonable period of time, it may be necessary to reduce the percentage scrubbed. As an anecdotal example, if it takes 10 hours to scrub 50% of an array, it will take 5 hours to scrub 25% of the array. Adjusting the percentage scrubbed and the interval in Scrub Frequency may be necessary to keep a Scrub within an after hours window.
snapraid touch
If files are detected with “zero sub-second timestamps”, the command snapraid touch
will correct this.
sync –force-full
If parity issues with the parity drive are persistent and the user admin is reasonably sure there are no data issues, the command sync –force-full
may correct the problem. Since this command is the rough equivalent of the a “first sync”, depending on the amount of data on protected disks, it may take several hours to complete.
Recovery operation examples for single files, missing files, etc., are provided in the → SnapRAID FAQ page.
One of the more desirable features of SnapRAID is it's ability to restore data to a replacement drive. There are, however, many factors to consider when dealing with a failing hard drive.
Contrary to popular belief, spinning drives rarely fail “all at once” as if turning off a light switch off. In many cases, drives will begin to reallocate sectors among other symptoms that indicate that they are beginning to fail. If these symptoms are ignored, the drive may begin to corrupt data that is stored on it or written to it.
Configuring → Server Notifications and configuring → SMART, as recommend, will go a long way toward providing as much notice as possible that a hard drive is begining to fail. If user / admins receive SMART advisory E-mails regarding a particular drive, take note of the → SMART attributes that are associated with drive failure. In addition, the SnapRAID plugin has a “drive failure probability” indicator. It can be found under; Services, SnapRAID, Drives. Under the Info button select SMART from the drop down menu.
When it has been determined that a drive is beginning to fail, it is crucial that user / admins DO NOT run the Diff Script OR a manual Sync operation.
Replacing a failing or failed drive, requires a number of preliminary steps:
In the following scenario, a SnapRAID protected drive has failed completely.
When server notifications are → configured as recommended , user / admins are notified by e-mail of file system failures and other issues.
Under Storage, Filesystems:
Click the “Create and Mount a File System” button :.
In this example case, the file system selected from the pop-down will be EXT4.
In the Device * pop-down, /dev/sde is selected.
Click the Save button.
The following Mount window will be immediately presented.
In the File system * field, click the pop-down arrow and select the previously formatted drive (/dev/sde) to be mounted.
In the same window (Services, SnapRAID, Drives) select the Tools icon :, and select Fix from the pop-down menu list.
Depending on the size and speed of the drive and the amount of data, the Fix command may run for several hours.
When END OF LINE is displayed, the Fix operation is complete.
Under, Services, SnapRAID, Drives, select the Tools icon . In the pop-down menu select Sync. This will update the content file with the new drive's UUID. (This is noted in the Sync output window.)
This concludes data restoration to a SnapRAID protected drive.
MergerFS users skip see the section below titled; Replacing a Failed Drive in MergerFS.
—-
The remainder of tasks, within OMV, would be to repointing Shared Folders and other services to the replacement drive. To repoint shared folders, see the guidance provided → here.
When all references to the failed drive have been redirected to the new drive, or removed, under Storage, File Systems, the “Referenced” check mark should be removed. Highlight the missing drive, unmount it and Apply the change.
Finally, re-enable all automated tasks (Diff script) enable SMB network sharing (if previously disabled) and inform users that shares are available.
Note | |
Device names, dev/sda1, dev/sdb1, etc., may be reordered when a device goes "missing". Device name designations are dependent on BIOS and are subject to change after a reboot. |
The result now shows mount points by UUID (Universally Unique IDentifier). Where device names may be reordered by BIOS or UEFI on bootup, UUID's do not change.
Note the “Copy and Paste” Icon at the end of each drive UUID. This tool copies exact drive paths and UUID's to the clipboard.
Under Storage, mergerfs, Highlight the MergerFS array click the Edit icon .
Under Storage, File Systems, the missing drive should no longer be “Referenced”. Highlight the missing drive, unmount it and Apply the change.
Finally, re-enable all automated tasks and enable network shares (if previously disabled.)
This concludes adding a SnapRAID restored drive to a MergerFS array.
We, who support the openmediavault project, hope you’ve found this guide to be useful and that you’ll find your openmediavault server to be efficient, easy to use, and enjoyable.
If you found this plugin guide to be helpful, please consider a modest donation to support the hosting costs of this server.
OMV-Extras.org