Hello, we have a network with a core of HP E3800 stacks (4 switches in each stack) as its core component, all the core stacks (4, 2 at each site) and the spur switches (Dell power Connect) are connected together with 10Gb SR Fibre links in DT-LACP lag groups. Please see attached picture.
We have noticed some poor performance recently with some types of traffic across the network (CIFS/SMB for example) and iperf3 shows lots of TCP retransmissions when we run an iperf3 test. Across any of the HP E3800 cores.
Port utilisation is generally low with most of the 10GB links at or below 10% most of the time.
If we look at interfaces queues for the 10Gb links I can see we are getting dropped packets (Q8) on most of them.
show interfaces queues ethernet 1/49-1/52,2/49-2/52,3/49-3/52,4/49-4/52
Status and Counters - Port Counters for port 1/49
Name :
MAC Address : 00fd45-66438f
Link Status : Up
Port Enabled : Yes
Port Totals (Since boot or last clear) :
Rx Packets : 496,270,407 Tx Packets : 1,534,515,450
Rx Bytes : 735,054,609 Tx Bytes : 3,558,745,591
Rx Drop Packets : 522,011,516 Tx Drop Packets : 4,051,605
Rx Drop Bytes : 794,834,275,823 Tx Drop Bytes : 6,101,769,119
Egress Queue Totals (Since boot or last clear) :
Tx Packets Dropped Packets Tx Bytes Dropped Bytes
Q1 73,803,186 0 9,323,942,197 0
Q2 140,429,853,917 21,786 153,316,962,222,997 32,796,738
Q3 1,053,845,077,266 6,259 1,987,368,459,433,373 17,106,239
Q4 223,757 0 53,005,164 0
Q5 219,900 0 223,715,201 0
Q6 30,173 0 3,389,376 0
Q7 27,198 0 2,978,783 0
Q8 370,553,375,856 4,023,560 397,647,087,041,912 6,051,866,142
Is this port queue / memory exhaustion or something else? Has anyone seen something similar or can provide further troubleshooting steps?
Is it worth dropping the number of queues from 8 to 4 or 2?
#e3800#packetloss#dt-lacp