Understanding Network Redundancy

Author
Terry Slattery
Principal Architect

I am regularly surprised by network redundancy designs that are either overbuilt or underbuilt.  There seem to be two major themes that drive these designs.  I’ll address the first theme in this blog entry and the second theme in a successive blog entry.

The first theme results in a network with excessive redundancy.  Instead of a primary path and one backup path, it will have more than one backup path.  The problem that this creates is the additional design, implementation, and operational complexity.  With multiple backup paths, it is difficult to determine which path should be used when the primary path fails.  Troubleshooting becomes substantially more challenging when packet flows shift to multiple paths.  Add firewalls to different paths and it becomes equally difficult to make sure that all the paths have the same security implementation.

Another problem scenario is that with multiple backup paths, these paths are often also carrying operational data, so when a failure occurs, other operational paths may be negatively affected by the addition of the traffic that is now using the path.

Let’s clarify what happens with this type of design with an example.  In the HubSpokeWheel diagram below, each site has three paths connecting it to its neighbors.

hubspokewheel

The main communications path in this network was from each spoke site to the hub site.  The backup paths were via one of the two neighbors.  The problem was that the neighbor’s primary path was not sized to handle the load of its spoke site as well as the load of a neighbor’s site (this design was based on low speed links).

I found that the primary path was operating in unidirectional mode, where packets were not going over the primary path to the hub site.  Instead, packets were going to the neighbor at the bottom and then into the hub, impacting the operation of users at the neighboring site as well as at the site that had the original problem.

The secondary problem that existed here is that the network engineers didn’t know where their packets would go when a failure occurs and how to troubleshoot it.  If two sites have problems, they could have a serious impact on several other sites, resulting in a network that seems to have many more problems than actually exists.

The key to good network design is to design specific redundancy, know where the failure paths will be and make sure that both paths have the same performance and security implementation.  Ease of troubleshooting and monitoring are critical to a successful design.  In the example above, the customer didn’t know that the alternate paths were being used.  You need to know when a failure has occurred so that it can be addressed before a second failure causes a network outage.  Ease of troubleshooting will reduce the time to diagnose the failure so that it can be repaired.

-Terry

_____________________________________________________________________________________________

 

Re-posted with Permission

NetCraftsmen would like to acknowledge Infoblox for their permission to re-post this article which originally appeared in the Applied Infrastructure blog under http://www.infoblox.com/en/communities/blogs.html

infoblox-logo

Leave a Reply

 

Nick Kelly

Cybersecurity Engineer, Cisco

Nick has over 20 years of experience in Security Operations and Security Sales. He is an avid student of cybersecurity and regularly engages with the Infosec community at events like BSides, RVASec, Derbycon and more. The son of an FBI forensics director, Nick holds a B.S. in Criminal Justice and is one of Cisco’s Fire Jumper Elite members. When he’s not working, he writes cyberpunk and punches aliens on his Playstation.

 

Virgilio “BONG” dela Cruz Jr.

CCDP, CCNA V, CCNP, Cisco IPS Express Security for AM/EE
Field Solutions Architect, Tech Data

Virgilio “Bong” has sixteen years of professional experience in IT industry from academe, technical and customer support, pre-sales, post sales, project management, training and enablement. He has worked in Cisco Technical Assistance Center (TAC) as a member of the WAN and LAN Switching team. Bong now works for Tech Data as the Field Solutions Architect with a focus on Cisco Security and holds a few Cisco certifications including Fire Jumper Elite.

 

John Cavanaugh

CCIE #1066, CCDE #20070002, CCAr
Chief Technology Officer, Practice Lead Security Services, NetCraftsmen

John is our CTO and the practice lead for a talented team of consultants focused on designing and delivering scalable and secure infrastructure solutions to customers across multiple industry verticals and technologies. Previously he has held several positions including Executive Director/Chief Architect for Global Network Services at JPMorgan Chase. In that capacity, he led a team managing network architecture and services.  Prior to his role at JPMorgan Chase, John was a Distinguished Engineer at Cisco working across a number of verticals including Higher Education, Finance, Retail, Government, and Health Care.

He is an expert in working with groups to identify business needs, and align technology strategies to enable business strategies, building in agility and scalability to allow for future changes. John is experienced in the architecture and design of highly available, secure, network infrastructure and data centers, and has worked on projects worldwide. He has worked in both the business and regulatory environments for the design and deployment of complex IT infrastructures.