Wireless Access

Reply
Contributor I

Missed heartbeat on 6.4.2.6, APs rebootstrap

Wondering if anyone else is seeing this problem.

 

We've experienced problems with after updating controllers to 6.4.2.6 

All APs on a controller will rebootstrap after a period of time, usually less than 48 hours. Ap debug info shows heartbeats missed and the reason for disconnect as: controller aged out

 

We've rolled our live controllers back to the previous version (with the ARM memory leak) for now and have a couple controllers on test. The problem occurred on one of the test controllers over the weekend, the rest of our 1900 APs were just fine on the older code.

 

We believe there's a bug in 6.4.2.6 and we're in conversation with support about this. Currently we're being told nobody else has reported this problem, so thought I'd ask here... Shout up if you've seen it.

Guru Elite

Re: Missed heartbeat on 6.4.2.6, APs rebootstrap

Ultimate_Fish,

 

How many controllers do you have serving those 1900 access points, and how are they connected to your switched network?

 



Colin Joseph
Aruba Customer Engineering

Looking for an Answer? Search the Community Knowledge Base Here: Community Knowledge Base

Contributor I

Re: Missed heartbeat on 6.4.2.6, APs rebootstrap

Up until recently we've had a pair of 7200 controllers with a VRRP address. As we're nearing the limit of one controller's ability to sustain our AP estate we've added another two controllers. 

 

As of now, we're back to two live, rolled back to 6.4.2.4, and the second pair are running 6.4.2.6 with a small number of APs.

 

The controllers each have two 10Gbe ports trunked to our procurve switches.

 

All was working just fine on 6.4.2.4, and indeed the original pair of controllers are now fine on that code with about 1897 APs. 

 

The APs we moved over to the test controllers running 6.4.2.6 all rebootstrapped over the weekend, so it isn't load dependent. When the problem occurs we lose all APs on the affected controller.

 

 

Super Contributor I

Re: Missed heartbeat on 6.4.2.6, APs rebootstrap

 

Anyway per my post on the other thread, you may be seeing what we were seeing.

 

We were told this time the fixed-in version to be 6.4.2.4.  We're currently testing on 6.4.2.5.

If you have TAC working on this, please tell them to review case #1552028.

 

Super Contributor I

Re: Missed heartbeat on 6.4.2.6, APs rebootstrap

...and we can now confirm we are seeing this behavior on 6.4.2.5 as well.

 

Contributor I

Re: Missed heartbeat on 6.4.2.6, APs rebootstrap

OK, well that's sort of good to know... Which version were you running when you first had the problem?

Super Contributor I

Re: Missed heartbeat on 6.4.2.6, APs rebootstrap

Ever since 6.4.1 but not before that.

 

Note our controllers are connected directly with nothing but a MAS between them, on the same VLAN, and often show no missed heartbeats when the problem happens.

 

We have other reasons to be going to EA again so we'll be trying this out on 6.4.3.1 soon.

 

 

Super Contributor I

Re: Missed heartbeat on 6.4.2.6, APs rebootstrap

So, almost a week now with HA intercontroller heartbeats enabled on 6.4.3.1 and no mass failover events.

 

Other than the software upgrade (from 6.4.2.5), the only other change we made was raising the port MTUs on the MAS directly connected to the controller to 9216 from 9000.

 

Search Airheads
cancel
Showing results for 
Search instead for 
Did you mean: