background image

NIC Failure test

We performed the hard NIC failure test by removing one NIC cable from the
active node that was involved in active recording. After the NIC failure, writing to
the same node failed. When the network fails, the server must recognize the
failure, then it must establish a new connection. Also, when the network fails TCP
socket connections are left open and remain open on the cluster until Isilon's
OneFS forces them closed, which allows the server to continue writing.

We can force the open TCP sockets to close for a duration of less than 2 minutes
by reducing the 

TCP keep idle

 and 

TCP keep interval

 timeout to the

optimum values recommended by Isilon Engineering.

To reduce the video loss duration due to the 

TCP Socket Open

 condition, set

the persistent values in the 

sysctl.config

 file as follows to reduce the impact

duration time significantly:

 isi_sysctl_cluster

net.inet.tcp.keepidle=61000

 isi_sysctl_cluster

net.inet.tcp.keepintvl=5000

Refer to the KB article: 000089232 for further information about how to
configure these parameters.

Note

NIC failure impact can be overcome by using NIC aggregation in Active/Passive
Failure aggregation mode, which is explained in the next test case. Connectivity
to the nodes that are not affected by the network outage continues to be
available throughout the test scenario and no impact was observed.

NIC Failure test with NIC aggregation in Active/Passive

We did a hard NIC failure test with Active/Passive aggregation by removing the
active NIC port cable. After the network failure, writing to the same node
continued and the NIC that was passive was immediately changed to the active
NIC. The NIC failure caused no apparent loss.

Note

NIC aggregation in Active/Passive mode remedies only a network
disconnection/NIC failure that happens on the Isilon node or the corresponding
switch port where it is connected.

Node Poweroff Test

To simulate an unexpected single node hard failure, we held down the power
button until the node powered off. This causes the servers that were writing to
that node to reconnect to a new node. In our tests, the servers on the failed node
reconnected to a new node, but did not start writing again for an aggregate
(reconnect and start writing) duration of up to 52 seconds while waiting for
writing to the SMB share to be re-started.

The second issue is that the removal or addition of a node causes an interrupt to
the cluster. Therefore, video servers writing to the other nodes might experience
a short interruption. The duration of the interruption can be reduced by modifying
the OneFS environment variables.

Testing and validation

Tests conducted

     

23

Summary of Contents for EMC Series

Page 1: ...Surveillance Dell EMC Storage with ISS SecurOS Sizing Guide H14192 REV 1 1 ...

Page 2: ...RMATION IN THIS PUBLICATION AND SPECIFICALLY DISCLAIMS IMPLIED WARRANTIES OF MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE USE COPYING AND DISTRIBUTION OF ANY DELL SOFTWARE DESCRIBED IN THIS PUBLICATION REQUIRES AN APPLICABLE SOFTWARE LICENSE Dell EMC and other trademarks are trademarks of Dell Inc or its subsidiaries Other trademarks may be the property of their respective owners Published ...

Page 3: ...ocols 14 ISS SecurOS Enterprise 14 Sizing the solution 17 Bandwidth sizing guidelines 18 EMC VNX 18 Dell EMC Isilon node and cluster SMB2 19 Testing and validation 21 Test objectives 22 Test parameters 22 Tests conducted 22 Conclusion 25 Summary 26 Dell EMC Isilon scale out storage 26 EMC VNX arrays 26 EMC VNX VSS arrays 26 Chapter 1 Chapter 2 Chapter 3 Chapter 4 Chapter 5 Chapter 6 CONTENTS Dell ...

Page 4: ...CONTENTS 4 Dell EMC Storage with ISS SecurOS Sizing Guide ...

Page 5: ...CHAPTER 1 Introduction This chapter provides information on the purpose and scope of this solution l Solution overview 6 l Scope 6 l Key objectives 7 Introduction 5 ...

Page 6: ...out by Dell EMC engineers in a VMware ESXi virtualized infrastructure Use this guide to determine the best configuration for the following l Number of ISS Video Servers l Mix of nodes and ISS Video Servers based on the expected bandwidth in an Isilon implementation l Storage using Fibre Channel FC and Internet SCSI iSCSI on VNX systems l Storage using Server Message Block SMB on Isilon systems l L...

Page 7: ...ds for specific system requirements so that an implementation can be correctly sized and the appropriate Dell EMC products can be matched to a customer s requirements l Determine the VNX and VSS LUN bandwidth within the storage pool l Recommend an Isilon SMB configuration l Calculate array or node maximum bandwidths l Recommend disk drive types l Determine Isilon bandwidth for each ISS Video Serve...

Page 8: ...Introduction 8 Dell EMC Storage with ISS SecurOS Sizing Guide ...

Page 9: ...R 2 Configured components This chapter provides information about the components configured in this solution l Isilon clustered storage system 10 l Data protection 10 l Cluster size 11 Configured components 9 ...

Page 10: ...bout Isilon data protection see Dell EMC Isilon OneFS A Technical Overview Although cluster sizes as small as three nodes are possible for surveillance applications we recommend a minimum of five nodes Sizing calculations need to include a minimum free space calculation for proper cluster sizing We recommend a cluster size that enables a node to be removed while retaining a minimum of 10 percent f...

Page 11: ...practices for data protection To estimate the ideal number of nodes in a cluster you need to consider cluster bandwidth and capacity Sizing by bandwidth We recommend a cluster size with one or more additional nodes than calculated in bandwidth sizing This ensures that failover of a node allows for redistribution of NAS connections and avoids any frame loss Sizing by aggregate capacity We recommend...

Page 12: ...Configured components 12 Dell EMC Storage with ISS SecurOS Sizing Guide ...

Page 13: ...APTER 3 Solution components This chapter provides information about storage options for video and audio data l Dell EMC storage 14 l Storage protocols 14 l ISS SecurOS Enterprise 14 Solution components 13 ...

Page 14: ...r The Video Server is a computer that is used to connect cameras IP and or CCTV analog cameras from which to receive video In addition to cameras this installation type enables the installation or connection of various security devices fire alarm system controllers access control subsystem controllers among others and ISS analytics modules for example SecurOS Auto Module The Video Server installat...

Page 15: ...primary services continued Service Description l Create and configure system objects l Delete existing objects l Change the parameters of the created and operating objects Solution components ISS SecurOS Enterprise 15 ...

Page 16: ...Solution components 16 Dell EMC Storage with ISS SecurOS Sizing Guide ...

Page 17: ...r provides information to enable you to quickly determine the correct storage array based on your customer s bandwidth requirements l Bandwidth sizing guidelines 18 l EMC VNX 18 l Dell EMC Isilon node and cluster SMB2 19 Sizing the solution 17 ...

Page 18: ...sk rebuild or similar performance intensive events The VNX 5300 and VNX VSS100 can exceed 500 MB s in an environment without induced errors Table 2 EMC VNX VNXe storage array results Storage Protocol Array Bandwidth MB s Storage Pools RAID Array Pool LUN Pools LUNs Pool Disks Pool iSCSI VNX5300 60 60 1 1 72 5 VNX VSS100 60 60 1 1 72 5 VNX5200 60 60 1 1 81 5 300 30 1 5 81 5 This bandwidth is the ma...

Page 19: ...mple 200 low resolution cameras the numbers in the following table are valid with a high confidence level for bandwidth performance for up to 2 000 cameras We performed all tests with node or drive failures in place in the cluster for example with Isilon FlexProtect running to ensure a worst case scenario for all sizing parameters The following table provides bandwidth sizing guidelines based on o...

Page 20: ...Sizing the solution 20 Dell EMC Storage with ISS SecurOS Sizing Guide ...

Page 21: ...CHAPTER 5 Testing and validation This chapter describes the testing used to validate this solution l Test objectives 22 l Test parameters 22 l Tests conducted 22 Testing and validation 21 ...

Page 22: ...ollowing parameters to perform the tests l Tests were conducted by loading each recorder at the maximum sustainable bandwidth of 64 MB s with 7Mbps camera bandwidths were used l The IP network Layer 2 is a flat high availability network with plenty of capacity which enabled us to focus on the products we were testing l All tests assumed uniform distribution of bandwidth from the ISS Video Server T...

Page 23: ...to the nodes that are not affected by the network outage continues to be available throughout the test scenario and no impact was observed NIC Failure test with NIC aggregation in Active Passive We did a hard NIC failure test with Active Passive aggregation by removing the active NIC port cable After the network failure writing to the same node continued and the NIC that was passive was immediatel...

Page 24: ...ASE 0 75 isi_sysctl_cluster kern maxvnodes BASE isi_sysctl_cluster kern minvnodes BASE isi_sysctl_cluster efs lin lock initiator lazy_queue_goal COUNT isi_sysctl_cluster efs ref initiator lazy_queue_goal COUNT isi_sysctl_cluster efs mds block_lock initiator lazy_queue_goal MDS isi_sysctl_cluster efs bam datalock initiator lazy_queue_goal MDS WARNING If running a mixed workload these changes can ad...

Page 25: ...CHAPTER 6 Conclusion This chapter summarizes the testing for this solution l Summary 26 Conclusion 25 ...

Page 26: ...ch Video Server view of the storage is based on the assigned quota and not the entire file system We recommend using SmartQuotas with ISS SecurOS as a best practice EMC VNX arrays The use of storage pools to create LUNs within the EMC VNX arrays greatly simplifies the configuration and increases the performance when compared to traditional block level storage Either iSCSI or FC can be implemented ...

Reviews: