MergerFS is a union filesystem geared towards simplifying storage and management of files across numerous commodity storage devices. It is similar to mhddfs, unionfs, aufs. MergerFS is a good choice for combining a collection of hard drives when other pooling techniques (such as RAID5) should not be used. MergerFS works well with SBC's (Single Board Computers) and other platforms that use USB connected hard drives. MergerFS can be added or removed at any time, without a loss of data. If a member of the merged drive pool fails, data saved to the pool's remaining member drives will be unaffected.
MergerFS, installed on the root of member drives (Option 1 below), should not be used for relational database storage, where latency can impact performance, or for Dockers where a custom version of overlyfs is used. In such cases, look at Option 2 where SQL databases and Dockers can be accommodated outside of Shared Folders that are dedicated to MergerFS.
While this OMV plugin makes the MergerFS package easy to integrate into openmediavault, the MergerFS package itself was created by a third party. See the MergerFS project / author's → web page for more detailed information and support.
In OMV6's GUI:
Under System, Plugins, find and highlight openmediavault-mergerfs 6.X, and click the install button.
Before proceeding with the installation, a decision on the storage policy to be used should be made. While there are many storage policies to chose from, for the purposes of this document, the default “Existing path - Most Free Space” and “Most Free Space” will be explained. Explanations for additional storage policy options are available → here.
Existing Path - Most Free Space contains two directives. The first directive, Existing Path, will direct files to a preexisting folder. For example, if the folder Videos exists on one drive in the merged array, all new files that are saved to Videos will be saved to the same drive - I.E. to the Existing Path. If files are saved to new folder, Documents for example, the second directive takes control. The new folder Documents will be created on the drive with the Most Free Space. Files saved to the Documents, thereafter, will be saved only to drive with the Documents folder. This behavior is known as “path preserving” where paths, and the files that are saved to them, exist on one drive.
Note, in the following, the distribution of paths among member drives. Once a new folder is created, on a drive with the Most Free Space, a path is established. According to the first directive all subsequently saved files will go to the Existing Path.
The issue that the Existing Path directive can create is due to the difference in files sizes. When compared to documents or even picture files, music and video files can be massive, ranging from 100's of megabytes to several gigabytes. If users increasingly add large video files to a MergerFS array, the Existing Path directive can easily fill one drive of the array completely. At that point, the array will stop accepting files destined for the full drive.
This issue can be corrected using the Balance Pool Tool which will redistribute data among the array's member drives.
Note For users who may test MergerFS using small virtual drives: | |
The MergerFS balance tool balances drives to within 2% fill of all member drives. Populating tiny 8 to 10GB virtual drives with video files that may be as large as 1 or 2GB each (which is roughly 10 to 20% of a member drive) may trigger "ping pong" file copying between drives. Filling tiny drives with huge files makes the "within 2%" balance goal, of the algorithm, unachievable. When testing MergerFS using small virtual drives, use small files such as documents, pictures, or small audio files. |
Using Entire Drives:
The typical setup of MergerFS is “at the root” of each member drive. Setting up in this manner dedicates each member drive to MergerFS. All folders and files on each drive are under MergerFS control and are subject to file distribution between drives to balance the MergerFS array. In the event that the balance tool is used, any number of files and folders may be moved between member drives to achieve the “within 2%” of drive space balance objective.
Using Root Folders:
This option creates shared folders at the root of members drives. Drives with these folders will aggregate files and and folders, within MergerFS dedicated Shared Folders, in the exact same manner as the Option 1 - “Full drive” method. The difference and advantage is, files and folders outside of MergerFS dedicated folders are not under MergerFS' influence.
Root Folder Advantages:
First, it should be noted that using a folder, versus the root of the drive, is not a limitation. All of the physical space available on a hard drive can be used whether data is stored in a designated folder or at the root of the drive. However, with MergerFS storage activity contained within a dedicated root folder, on each member drive, drive locations outside of MergerFS folders can be used for SQL databases or for Docker Storage.
Simply create / save Dockers or SQL DB's to the root of a hard drive or to a Shared Folder that is outside of the MergerFS shared folder. While MergerFS can not move data outside of it's own folders, continuing file and folder growth will be saved “around” SQL DB's and Dockers, thereby balancing storage across drives in the array. Further, the MergerFS balance tool will have no effect on data that is outside of MergerFS' folders.
When considering which of the two set up options to use, read over the information in Notes and File Systems.
For users interested in Option 1, see Option 1 following.
For those who are interested in Option 2, follow this link → Setting up Option 2.
For demonstration purposes, setting up option 1 will focus on pooling three hard drives.
- The Label is Merger1
- Note that the Device Name and the Mount Point are the same.
- Under Type, which ordinarily displays the file format type of a Block Device, the Type is FUSE.MERGERFS.
- Note the “copy icon” next to the Mount Point. The copy icon copies the exact Mount Point path to the client workstation's clip board. This may be useful for command line operations.
The “Merged Drive”, Merger1, contains the data that was on /dev/sdc1 along with the free space of the two additional drives. This it the main feature of MergerFS. It can be used to add new drives, as needed, to drives that have existing data, easily creating more storage space.
As noted prior, in the merged drive (Merger1) existing data is contained on /dev/sdc1 and the drive is nearly full. When Merger1 was set up, the Most Free Space storage policy was used which distributes files and folders evenly among member drives. However, this policy will not retroactively balance data among the member drives, evening out storage among all members. Most Free Space will simply fill drives with less data on them. This may work well enough if an existing drive, or drives, are not at the edge of their capacity (where they may be triggering alarms).
If needed or desired, to even out storage among member drives, the Balance Pool tool can be used.
With the MergerFS array set up, continue on to → additional tools.
For demonstration purposes, setting up Option 2 will focus on pooling three 3 separate hard drives, by creating 3 Shared Folders for MergerFS use (one on each drive).
The Mount Point column is not a default. Optionally, this column can be added by clicking on the grid icon and selecting Mount Point. The Mount Point column displays drives by UUID and their “mounted” absolute path.
Creating a Shared Folder, for each member drive (dev/sdb1, dev/sdc1, and dev/sdd1) is necessary.
Note: These Shared Folders will be used, exclusively, by MergerFS.
Under, Storage, Shared Folders, Click the Create “” button.
In the Name * field:
Provide a name that will easily identify the Shared Folder.
In the File System * field:
Select the first drive to be used. (In this example, it's /dev/sdb1)
The Relative path * field:
Accept the default.
The Permissions field:
Everyone read/write is recommended. Note: This won't have an effect on permissions assigned to Shared Folders created within the MergerFS drive.
Repeat this process, creating a designated Shared Folder for each drive that will be in the MergerFS array. In this example, the following Shared Folders were created, placed on 3 individual drives, .
Merg_SF1 - /dev/sdb1
Merg_SF2 - /dev/sdc1
Merg_SF3 - /dev/sdd1
Note the above naming convention. While the actual name is not important, it's helpful if the Shared Folder name identifies it's intended use.
Name *: Merger1
File Systems: *NOT USED*
Shared Folders: Check the boxes for the three Shared Folders, created on each of the three drives, for the MergerFS array.
Create Policy: The storage policy was changed from the default to Most Free Space.
The remaining settings were left at the defaults.
- The Label is Merger1
Note that the Label column is not a default. Optionally, this column can be added by clicking on the grid icon and selecting Label.
- Under Type, which ordinarily displays the file format type of a Block Device, the Type is FUSE.MERGERFS.
- Note the “copy icon” next to the Mount Point. The copy icon copies the exact Mount Point path to the client workstation's clip board. This may be useful, later, for command line operations.
Alternately, if data does not exist:
The remaining tools are:
Since MergerFS is frequently used with SnapRAID, replacing a failed MergerFS Drive is part of the final step in recovering a MergerFS / SnapRAID array, as detailed → here.
We, who support the openmediavault project, hope you’ve found this guide to be useful and that you’ll find your openmediavault server to be efficient, easy to use, and enjoyable.
If you found this plugin guide to be helpful, please consider a modest donation to support the hosting costs of this server.
OMV-Extras.org