[UFO Chicago] help in investigating a possible packet storm

Politik Durden politikdurden at yahoo.com
Thu Apr 1 21:15:24 PDT 2010


Hello all, 

Going to a client site at 6 AM tomorrow because at about 5 PM today (Thursday) all network traffic started getting really really slow.

Here's what I know:

- no recent changes (no new switch, NIC, changes to static routes, config changes, patches/upgrades, etc)

- about a dozen switches feed into a 3COM switch (no model #s yet). ballpark of 2 to 3 hundred nodes total

- no protocols are used, all devices are in "dumb" mode and act as just a plain 'ol switch. some can be managed but no features (snmp, etc) are turned on.

- most nodes *seem* to be pingable from both sides of the firewall, but everything is just crawling. 

- nothing (reports, scripts, etc) is timing out, but everything is just super super slow.

They tried swapping out switches one at a time to narrow down the culprit and that helped for a bit, but then traffic slowed down again and they couldn't really do any more during production hours.

Theories: 

- Can one bad port cause this kind of a traffic jam ? They started diags on all the major nodes (server NICs, the central 3COM switch, etc) but nothing obvious so far. 

- Some sort of protocol/feature was turned on by mistake and now all the switches are confused ? A quick "topeka" (ha!!) points to stories of spanning tree causing these kinds of traffic jams.

- Somehow a loop got introduced ? 

What I really need is suggestions on a good free traffic tool, something we can install on two or three laptops and put each switch through its paces. Any ideas ? 

Thanks in advance for your comments. This lot always points me in the right direction :-)



      
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://ufo.chicago.il.us/pipermail/ufo/attachments/20100401/e9495585/attachment.htm 


More information about the ufo mailing list