Data Center L2 Interconnect and Failover
Posted by: Pete Welcher
on Feb 12, 2011
The topic of Data Center Interconnect at L2, and failover between data centers, seems to be a hot topic! I'd like to briefly note some of the great interaction that's been occurring with some of my prior blogs, and also note an interesting article by Ivan Pepelnjak, whose technical skills I highly respect.
Concerning Data Center Interconnect (DCI), I really like the new Cisco OTV technology. (OK, you probably guessed that from all the prior articles about it.) Whatever technique you use for DCI, interconnecting data centers at L2, you have the problem of managing and optimizing inbound and outbound traffic to use the shortest path to or from the active virtual machine (VM) or cluster member(s).
Comments and discussion about that can be find in various of my blogs. I appreciate the feedback and the chance to learn and discuss! See in particular the article on Understanding Layer 2 over Layer 3 (Part 2) at http://www.netcraftsmen.net/component/content/article/69-data-center/796-understanding-layer-2-over-layer-3-part-2.html. I've noted below recent prior blogs and how many comments, for those inclined to go spelunking around this topic.
Other Takes on DCI and Optimal Routing
I was interested and amused to see a blog by Ivan Pepelnjak about Long Distance vMotion, at http://searchnetworking.techtarget.com/feature/Long-distance-vMotion-traffic-trombone-so-why-go-there?asrc=EM_NLN_13283529&track=NL-79&ad=813626.
Ivan makes a good case for multiple servers and Server Load Balancers in front of them, especially as a way of avoiding Data Center bridging. He also mentions "traffic tromboning", which is the sub-optimal traffic flows I've blogged about.
I'm not as allergic to data center bridging as he seems to be, despite having seen my share of data center meltdowns due to spanning tree loops. I hear good things about traffic storm control, and with L2 over L3 it seems like the L3 encapsulation will fail or reach capacity before the problem spreads as widely -- now that would be an interesting lab experiment. On the other hand, I like the idea of not using a risky technology unless you've got a darn good reason. Ivan raises one: some sort of server or application running on a single platform. And also of keeping management complexity down.
Overdoing It?
Frankly, with the DCI techniques, and especially with OTV, I worry about the "beer principle". (One beer might be a good thing. Too many beers leads to a headache.) Applied to OTV, you start doing it on a small scale, then it grows, and then one day you discover you've pushed it too far, it is unstable or having some problem ... and you have a headache.
I particularly worry about this in medical settings, and perhaps federal government datacenters. Hospitals tend to use real estate for medical needs, in good part because frankly that produces revenue. The management usually views IT as a necessary evil, a cost center. Funding for new hospitals and clinics takes priority over a new datacenter. Consequently, you end up with little datacenters scattered all over. A variant happens with the federal government. The current data center gets outgrown, then there isn't funding to build a new one that can hold all the servers (and after all, the old one is still working, albeit perhaps maxed on power and cooling or space), so another smallish one gets added one. Then maybe some space shows up in some other agency's datacenter (after the recent consolidation push), so that gets tacked on. When you view the overhead costs of operating small datacenters compared to one big one, this may be vastly suboptimal -- but there's apparently no good political and financial way out of it.
Over time, this scattered data center approach leads to design problems. One generally brings WAN links into the datacenter, but because the "main and backup" datacenters changed over time, the WAN links come in all over the place. Ditto Internet connections. With L2 between them, one might use stateful firewall pairs split across data centers, also L2 server clusters.
Split Firewall Pairs or Server Clusters
I do have a different concern about that situation, which is robustness of the cluster or stateful firewall pair. Links between datacenters might be L2 (or L2 over L3), but are generally not as reliable and error-free as links within a single datacenter. What happens to your firewall pair, or your Server Load Balancer pair, or your cluster, if the link between sites goes flaky but not down? I've heard horror stories about CheckPoint firewalls and Microsoft Exchange clusters where both sides thought they were primary. The issue seems to be that packet loss is not expect by the vendor, and when the link comes back up ... SURPRISE! In the case of CheckPoint, they both update each other's policy, which can lead to corrupted policy (i.e. missing ACL rules). In the case of Exchange, I've heard from one person who had experienced it that the servers started to re-synch their databases of email, but for 6 hours they then did not respond to email clients. I freely admit, I do not know nor have tested current versions of either.
If you have experience with situations like this (where the app continued to work correctly, or where there were problems), please add a comment -- more data is needed on this phenomenon, and collectively we have a lot more experience than I by myself do!
For some prior musings about this issue, see
- http://www.netcraftsmen.net/user-group/c-mug-archive/343-solving-real-data-center-design-problems.html
- http://www.netcraftsmen.net/archived-documents/petes-archives/doc_download/187-solving-real-data-center-design-problems-.html
Prior DCI and OTV Articles
Here are just the recent ones, see also the archives.
- Cisco Overlay Transport Virtualization (OTV) -- 2 comments
- First Hop Routing Protocol (FHRP) Info -- 4 comments
- OTV Optimal Routing -- 2 comments
- Exploiting Layer 2 over Layer 3 -- 4 comments
- Understanding Layer 2 over Layer 3 (Part 2) -- 2 comments
- Understanding Layer 2 over Layer 3 (Part 1)
- Working with EoMPLS Part 2 -- 5 comments
- Working with EoMPLS -- 5 comments

written by Dmitri Kalintsev, February 25, 2011
What about involving a "cloudy services" specialist provider, with big fat DCs and interconnects, and migrating your workloads there? And of course, the provider must be able to meet all the necessary regulatory and governance requirements before being considered.














Thanks for mentioning my article (in case your readers are interested in more details, they can find them in my blog @ http://blog.ioshints.info/search/label/Data Center). There are two reasons I'm so opposed to L2 DCI:
* It cannot be made reliable. Bridging was not designed for long distances or unreliable/low(er)-speed links. As you pointed out, the WAN link becomes the weakest link in the system and can (with 'proper' design) bring down the whole data center;
* There are other, way better, architectural options, including proper application architecture and load balancers. L2 DCI is just a kludge that networking vendors are promoting to make us spend more money.
Last but definitely not least, I do have a customer that experienced a split-brain cluster. Net result: two active database servers writing to the disk and totally corrupting the database. They spent several hours restoring the data from backups and lost a significant number of transactions.