Hello to anyone looking for a Friday afternoon diversion (or whenever your time zone puts you).... I have a small network where continuous uptime is *critical*. It's worked well for several years, but is in bad need of an overhaul.
Currently, I'm running my entire LAN (one segment) over 4 venerable ProCurve 4000m switches in a fully meshed topology, each connected with 1GB-SX "mesh" trunks. Two are in the data center, where I've got teamed NICs in my main servers connected one to each switch. If one fails, the servers are still connected through the survivor. The other two switches are in the closet serving the desktops. If one fails there, I move the wires off the dead switch into the spare capacity on the survivor, and am back at full strength quickly. I've never had a switch fail in 7+ years, but the resiliency has given me (and management) a lot of comfort. As a layer 2 network on one segment, it's really easy to manage.
Now, I need to upgrade for (1) more bandwidth, especially in the data center; (2) better security, by moving some functions out of the LAN into screened subnet/DMZs. We'll be deploying VoIP on the LAN soon, so high availability will be even more important, if that's possible.
My proposed solution to this is to (1) replace the 4000m with something faster, and (2) get a solid routing firewall box (ideally redundant) to manage the routing and access to the various DMZs and the Internet. (At the moment, I've got the Juniper SSG 140 in mind, but that's another topic.)
So, putting aside the firewall issue for the moment, I see these options to upgrade the LAN (1):
(a) Drop in faster switches with mesh capability. The pros are that this maintains the robust availability and simple administration. The cons are that HP has limited this function to the 5400zl and 5300xl series, which are on the high-end of what we can afford. Plus, meshing knocks out all the (potentially) valuable routing features of the box. If I'm using the firewall to route between my few VLANs, though, I'm not sure this is a big deal. Just seems like a waste to me. (HP - how about including mesh functions in the 4200vl ?!?) It's also interesting to me that, unless I've missed it, HP's competition doesn't offer anything this simple and effective for the small environment. You've got Foundry's VSRP, for example, but it's necessarily more complex....
(b) Create a more generic topology with 2 switches in the data center, and 2 in the closet. Run trunked 1Gb links (10GbE is too expensive or N/A) between the four in a ring topology for redundancy, and use Spanning Tree to manage the redundant links. So, this may be cheaper out of pocket to start up (4 x 4200vl-48G vs. 4 x 5400zl-48G ~ $10K), but it's tougher to configure, and seems like fail-over performance is not necessarily fast or guaranteed. Also, I appears that teamed NIC members can connect to different physical switches in this environment, but can anyone confirm that to me?
(c) Do like (b), but use layer 3 facilities like OSPF to manage the routes. Of course, this doesn't save money (in the HP line, anyway), and it certainly complicates things.
So, if I understand these options correctly - and not being too experienced with (b) or (c) - it sounds like (a) is the way to go, even if it comes at a relatively steep price to start. Am I missing something simple; is there a flaw back in my basic topology (LAN/firewall/DMZ); or does this sound reasonable?
I'd really appreciate your feedback. Thank you!!
Gary