Open-E JovianDSS ver.1.0 up21 Release Notes

From Open-E Wiki
Jump to navigation Jump to search

Release date: 2017-07-12

Build: 24097


New

Support for Samba vfs_fruit (Enhanced OS X and Netatalk interoperability)

Updated

Samba 4.6.4

Driver for Microsemi RAID controllers (aacraid, v. 1.2.1-52033)

Fixed

Streaming video parallel with writing on share cause that streaming hangs

Problem with display large number of disks on GUI

Default value of ZFS_ARC_MAX does not adjust to changing amount of RAM in server

Date on charts does not agree with System date

S.M.A.R.T. in logs does not work with disks in MPIO (dm devices)

"Clear errors counters" functionality does not clear counters from disks in read cache (L2ARC)

In case of using RAID controllers with Open-E JovianDSS Advanced Metro HA Cluster cause write errors on RAID array in Pool

Large number (thousands) of disks, snapshots, ZFS Volumes or Pools cause unstable behavior of system like problem with import Pool

Important notes for JovianDSS HA configuration

It is necessary to use sync always option for zvols and datasets in cluster 

It is necessary to run Scrub scanner after failover action triggered by power failure (dirty system close)

HA Cluster requires to set “Immediate” reboot policy on both cluster nodes if IO-errors should execute the failover. To do this please go to System Settings -> Critical I/O errors -> Reboot procedure function on WebGUI

It is necessary to use different Server names for cluster nodes

Known issues

Performance tuning

     Sync always for zvol is the safest option and is set by default. However, it can cause write performance decreases since all operations are written and flushed directly to the persistent storage. In case of using sync=always, it is strongly recommended using mirrored write log devices (very fast random writes devices).
     Sync=standard or sync=disabled zvol options provide huge performance improvement but the most recent (up to 5 seconds) cached data can be lost in case of a sudden power failure. Use this option only in environments equipped with UPS.
     For NFS shares the Synchronous data record is enabled by default. This option causes performance to be worse, but data can be safely written. In order to improve the NFS performance you can use Asynchronous data record but in such case, it is strongly recommended to use UPS.


Browser recommendations

     It is strongly recommended to use Mozilla Firefox browser to navigate the system’s GUI. When using other browsers some slight problems with displaying content may occur.


Web browser’s cache

     After updating from previous versions, some problems with WebGUI content and navigation may occur. To resolve this problems, please clear Web browser cache.


System as a guest in virtual environments

     Hyper-V:
     In case of installing the system as a Hyper-V guest please use the following settings:
       - Number of virtual processors: 4
       - Memory: Minimum 8GB
       - Boot Disk: 20GB IDE Disk
       - Add at least 6 virtual disk

 
     VMware ESXi:
     In case of installing the system as a VMware ESXi guest please use the following settings:
       - Guest OS: Other 2.6.x Linux ( 64bit )
       - Number of Cores: 4
       - Memory: Minimum 8GB
       - Network Adapter: VMXNET 3
       - SCSI Controller Type: Paravirtual or LSI Logic SAS
       - Boot Disk : 20GB Thick Provision Eager Zeroed
       - Add at least 6 virtual disk
       - Edit Settings->Options->Advanced-General->Configuration-> Add row: disk.EnableUUID : TRUE


Reclaim deleted blocks on thin-provisioned LUNs in various systems

     In case of deleting large amounts of data, reclaimed deleted blocks on thin-provisioned LUNs in Windows 2012 can significantly slow down system performance. If you predict frequent deletions of large amounts of data, we recommend turning off the automatic reclaim function in Windows 2012. This can be done by disabling the "file-delete notification" feature in the system registry. To do so, follow the steps below:
       - start Registry Editor.
       - locate the following registry subkey: HKEY_LOCAL_MACHINE\SYSTEM\CurrentControlSet\Control\FileSystem       
       - double-click DisableDeleteNotification.
       - in the Value data box, enter a value of 1, and then click OK.
 
     In order to reclaim the free space in Windows 2012 please change the    HKEY_LOCAL_MACHINE\SYSTEM\CurrentControlSet\Control\FileSystem\DisableDeleteNotification key value back to 0 and use "Optimize" tool located in Disc Management->[disk]->Properties->Tools. As the operation can generate a very high load in the system, it is recommended to perform it after-hours.  
In case of VMware ESXi, the automatic reclaim feature is disabled by default. To reclaim the space of deleted blocks on thin-provisioned LUNs, please use vmkfstools. For details, please refer to the VMware Knowledge Base:

For VMware ESXi 5.0: http://kb.vmware.com/selfservice/microsites/search.do?language=en_US&cmd=displayKC&externalId=2014849
For VMware ESXi 5.5 and newer: https://kb.vmware.com/selfservice/microsites/search.do?language=en_US&cmd=displayKC&externalId=2057513

In case of using Windows 2008 or Citrix Server there is no possibility to reclaim the space released by deleted data of thin-provisioned LUNs.


Deduplication issues and recommendations

     Please be aware that deleting the zvol with deduplication enabled can generate a very high load in the system and lead to unstable behavior. It is strongly recommended to perform such operation only after-hours. To avoid this issue, please use (if possible) single zvol on zpools dedicated for deduplication and delete the zpool which includes the single zvol.
   
     To determine the amount of System RAM required for deduplication, use this formula:
 
                                                                                                                    (Size of Zvol / Volume block size) * 320B / 0.75 / 0.25
 
Where:
320B - is the size of entry in DDT table
0.75 - Percentage of RAM reservation for ARC (75%)
0.25 - Percentage of DDT reservation in ARC (25%)
 
     Example for 1TB data and 64KB Volume block size:
 
                                                                                                       (1099511627776B / 65536B) * 320B / 0.75 / 0.25 = 28633115306.67B
                                                                                                                       28633115306.67B / 1024 / 1024 / 1024 = 26.67GB
 
so for every extra 1TB of storage, system needs extra 26.67GB RAM.

     Example for 1TB data and 128KB Volume block size:
 
                                                                                                       (1099511627776B / 131072B) * 320B / 0.75 / 0.25 = 14316557653.33B
                                                                                                                        14316557653.33B / 1024 / 1024 / 1024 = 13.33GB
 
so for every extra 1TB of storage, system needs extra 13.33GB RAM.

IMPORTANT: The above calculations only apply to the worst case scenario, when data is completely unique and will not be deduplicated. For the deduplicable data, the need for RAM drastically decreases. If SSD based Read Cache is present, part of deduplication table will be moved to the SSD and deduplication will work with good performance using less RAM.


IMPORTANT: With SAN (iSCSI) it is CRITICAL to match User-File-System format block size with the zvol volume-block-size. A simple example is a Windows file system NTFS with default format block size 4k and zvol default volume-block-size is 128k. With defaults like this deduplication will mostly NOT match because files can be aligned in 32 (128/4) different positions on the pool. If the NTFS format is increased to 64k and the zvol default volume-block-size is 128k, deduplication match can fail only one time because a file can be aligned to 2 (128/64) different positions on the pool. Every next write will match already as both alignment options already exist on the pool. In order to achieve all files matching and efficient memory usage NTFS must use 64k format block size and the zvol volume-block-size must equal 64k. Another option is that the NTFS=32k and zvol=32k, but in this case the deduplication table will be twice as large. That is why the NTFS=64k and zvol=64k is the most efficient setting for deduplication.


IMPORTANT: With NAS (NFS, SMB/CIFs) deduplication matching works always due to the data blocks being aligned by ZFS natively.


IMPORTANT: De-duplication is working on the pool level in the pool-range. This is why zvol-Physical size cannot show de-duplication benefit. In order to prove that deduplication saved space run the scrub and notice the current physical data space on the pool reported by the scrub. Next copy of new data and run the scrub again. Now scrub will show new physical data space. Comparing the data size from storage client side with the data space growth from the scrub will give the deduplication advantage. The exact pool of the deduplication ratio can be found in LOGs in zfs.log.


Zvols configuration issues and recommendations

     It is strongly recommended to set the client file system block size same as the zvol volume block size. For example, when using 64k zvol volume block size, the Windows Allocation unit size of NTFS should be set to 64k.


Target number limit

    In case of more than 60 targets, GUI will not be displayed correctly. This issue will be fixed in the next releases.


Targets with the same name are not assigned correctly

    Having two or more targets with the same name but belonging to various Zpools, will cause that all targets with the same name will be assigned to one Zpool during the import process.


Installation on disks containing LVM metadata

     There is no possibility to install the system on disks containing LVM metadata. You will need to clear those disks before installation. To do so, use the “Remove ZFS data structures and disks partitions” function located in the Extended tools. To access this function, boot the system from a temporary media like a USB drive or DVD.


Import Zpool with broken write log

     There is no option to import Zpool with a broken write log disk using the system’s functions. This is why it is STRONGLY recommended to use mirrored disks for write logs. In case it is necessary to import Zpool with a broken write log, please contact technical support.


Replacing disks in data groups for larger ones can cause your storage license capacity to be exceeded

     In case of replacing damaged disks for larger ones, the size of the entire Zpool will increased. Make sure that the new size will not exceed your purchased storage license.


Periodically after some operations, the GUI needs to be manually refreshed

     After performing some operations, e.g. resilvering, the GUI will show outdated information. In this case refresh the web page manually by pressing F5 on your keyboard. This issue will be fixed in next releases.


Replacing disks in data groups for smaller ones can cause an error and make the disk disappear from the list of available disks

     Operation of replacing a disk in a data group for a smaller one will cause an error "zpool unknown error, exit code 255", and the disk will become unavailable. In order to reuse this disk, please use function "Remove ZFS data structures and disks partitions" located in the Extended tools on the Console screen.


     Smaller than 64KB block sizes used with deduplication or read cache will cause very high memory consumption.


RAM recommendations for Read Cache

     To determine how much System RAM is required for Read Cache, use the following formula:


 
                                                                      RAM needed = (Size of Read Cache - reserved size and labels) * bytes reserved by l2hdr structure / Volume block size
 

     For 8KB Volume block size and 1TB Read Cache:

                                                                           RAM needed = (1099511627776B - 4718592B) * 432B / 8192B = 57981809664B
                                                                                                        57981809664B / 1024 / 1024 / 1024 = 54GB
 
Where:
1099511627776B - 1TB Read Cache
4718592B - reserved size and labels
432B - bytes reserved by l2hdr structure
8192B - Volume block size
 
                                                                                                  For 64KB Volume block size and 1TB Read Cache:
                                                                    RAM needed = (1099511627776B - 4718592B) * 432B / 65536B = 7247726208B
                                                                                                       7247726208B / 1024 / 1024 /1024 = 6.75GB


                                                                                                  For 128KB Volume block size and 1TB Read Cache:
                                                                    RAM needed = (1099511627776B - 4718592B) * 432B / 131072B = 3623863104B
                                                                                                       3623863104B / 1024 / 1024 /1024 = 3.37GB


Multiple GUI disk operations may result in an inaccurate available disks list

     Multiple operations of adding and detaching disks from groups can cause that the next operation of detaching will fail, but the disk will be shown on a list of available disks. When trying to add this disk to one group it will fail with the following error "[zfslib-wrap-zpool-ZpoolCmdError-1] invalid vdev specification". In this case, detach this disk once again.


After removing disks from groups they may not be displayed on a list of available disks

     Sometimes after removing disks from groups, Spare/Read Cache/Write Log disks are displayed on a list of unassigned disks, but they are not on a list of available disks. In this case, click the rescan button located in the adding group form.


Reusing disks from an exported and deleted Zpool

     After deleting an exported Zpool, not all disks which were a part of a Zpool become immediately available. Before you can reuse disks, which were previously used as a Spare or a Read Cache, you must first clean them. Use “Remove ZFS data structures and disks partitions” function located in the “Extended tools”.


Negotiated speed of network interfaces may not display correctly

     For some network interfaces, the negotiated speed field may display an incorrect value in GUI and Console. This issue will be fixed in next releases.


Limited possibility to display a large number of elements by the GUI

     After creating multiple snapshots, clones or zvols some forms in GUI work very slow. If you need to create many snapshots, clones or zvols, it is strongly recommended to use CLI in order to perform operations on them.


Open-E VSS Hardware Provider system recommendations

     It is strongly recommended to use Windows Server 2012. On the other Windows systems, Open-E VSS Hardware Provider Configuration works unstable.


Exceeded quota for dataset does not allow to remove files

     Files located on datasets with exceeded quota cannot be removed. In this case, please resize quota and then remove unnecessary files.


Slow WebGUI with multiple datagroups

     Zpool with more than 20 datagroups causes that some forms on WebGUI work very slow. If you need to create many datagroups, it is strongly recommended to use CLI API.


Slow WebGUI with multiple datasets

     More than 25 datasets cause that WebGUI works slow.


ZFS Upgrade

     For Open-E JovianDSS users, it is recommended to upgrade Zpools to the latest ZFS file system. Although the file system upgrade is absolutely safe for your data, and takes only few minutes, please be aware that this operation cannot be undone. In order to upgrade a single Zpool, please use "WebGUI -> Zpool options -> Upgrade file system" from Zpool's option menu.


Intel® Ethernet Controller XL710 Family

     In case of using Open-E JovianDSS with Intel® Ethernet Controller XL710 Family, it is necessary to update firmware’s network controller to the version: f4.33.31377 a1.2 n4.42 e1932.


Motherboards with x2APIC technology

     In case of using a motherboard with x2APIC technology enabled, it is necessary to disable x2APIC in BIOS. Otherwise, problems with CPU cores will occur.


NFS FSIDs and Zpool name

     One of the factors that have been taken into account when NFS FSIDs are generated is Zpool name. It indicates that when Zpool name is changed, e.g. during export and import with different names, FSIDs for NFS Shares located on this Zpool will also be changed.


High Availability shared storage cluster does not work with Infiniband controllers

     Due to technical reasons the High Availability shared storage cluster does not work properly when using the Infiniband controllers for VIP interface configuration. This limitation will be removed in the future releases.


Static routing functionality was removed

     Starting from up10, there is no possibility to configure static routing in TUI. In case the static routing was configured in previous versions, this configuration will be removed from the system.


Disks with LVM data cannot be used with the created Zpool

     Attempt to create Zpool with drives that contain LVM data will fail with the following error:

"cannot open 'lvm-pv-uuid-R25lTS-kcDc-eiAN-eAlf-ppgi-rAqu-Oxy1Si': no such device in /dev must be a full path or shorthand device name"

In this case, if you want use those disks, please use “Remove ZFS data structures and disks partitions” function located in “Extended tools”.


Unexpected long failover time

     In some environments, under heavy load a problem with too long time of cluster resources switching may occur. If the switching time exceeds the iSCSI initiator timeout, the timeout may be increased up to 255 seconds.

     In case of using Windows, to increase iSCSI initiator timeout, please perform following steps:

1. Run regedit tool and find: HKEY_LOCAL_MACHINE\SYSTEM\CurrentControlSet\Control\Class\...\Parameters\MaxRequestHoldTime registry key

2. Change value of the key from default 60 sec to 255 sec (decimal)

     In case of using VMware, to increase iSCSI initiator timeout, please perform following steps:

1. Select the host in the vSphere Web Client navigator

2. Go to Settings in the Manage tab

3. Under System, select Advanced System Settings

4. Choose the Misc.APDTimeout attribute and click the Edit icon

5. Change value from default 140 to 255 sec.


Activation may be lost after update

     In some environments, after update to up11 system may require re-activation. This issue will be removed in the future releases.


Bonding ALB does not work in Hyper-V environment

     In case of using JovianDSS as Hyper-V guest, bonding ALB is not supported. Please use another type of bonding.


Continuous writing in VMware guest can cause that deleting VMware snapshot can take long time

     Using ODPS on zvol/dataset with VMware guest where many I/O operations are performed can cause that the process of deleting VMware snapshot can take long time. Please take this into consideration while you set up the scheduler for Off-site Data Protection Service task.


Enabling quota on dataset can cause file transfer interrupt

     Enabling quota functionality on a dataset can cause file transfer interrupt. Before using it in production environment, please enable quota on dataset, or make sure that no file transfers are active.


Nodes connected to the same AD server must have unique Server names

     If JovianDSS nodes are connected to the same AD server, they cannot have the same Server names.


Share can not be named the same as Zpool

     In case of share with the same name as Pool connections problem will occur. Please use different names.


No persistent rules for network cards in virtual environment

     Changing settings of virtual network cards (delete, changing MAC, etc.) can cause unstable system behaviour. Please do not change settings on production system. This issue will be fixed in next releases.


Downgrade to up17 or earlier is not possible

     Starting from up18 bootable medium has always SW RAID structure. Attempt to come back to earlier version is impossible. If you need come back to earlier version, you must reinstall version again.


System cannot be installed on NVMe disks and on cciss based controllers

     This issue will be fixed in next releases.


Interrupt the process of adding second disk to SW RAID (bootable medium) can cause run system from disk with uncompleted data

     Performing operation like: reboot, shutdown, power off, etc. during mirroring data on new added disk can cause that system will be booted from new disk which has incomplete data. In this case, SW RAID function shows empty status and wrong number of RAID members. To resolve this issue, please plug off disk which has incomplete data, boot system, plug in disk and add it once again to SW RAID.


MPIO cannot be used with Cluster over Ethernet

     It is strongly not recommended to use Cluster over Ethernet with MPIO functionality. Such a configuration can lead to a very unstable cluster behavior.