Wireless Access

Reply
Highlighted
Occasional Contributor I

Issues with 215s going down

Good day,

 

We are having an issue where 215s in the cluster go down. 115s will continue to run. If you restart the controller the 215s come back up but will go down again after some time (this time apparently varies, and can be as much as a few days, as little as a few hours). While the APs are listed as down in the controller you are still able to ping them. Also after the reboot, the listed uptime is not consistent with the time they were rebooted.

 

They are using a 7005 on Aruba OS 8.3.0.3

 

Does anyone know what could be causing this?

Guru Elite

Re: Issues with 215s going down

Don't reboot the controlller, because that will make all of the APs reboot.  On the controller commandline, type "show log system all".  When APs go down and come up, it will tell you why.


*Answers and views expressed by me on this forum are my own and not necessarily the position of Aruba Networks or Hewlett Packard Enterprise.*
ArubaOS 8.4 User Guide
InstantOS 8.3 User Guide
Airheads Knowledgebase
Airheads Learning Videos
Aruba Central Documentation
Sign up for Security Alerts
Aruba Technical Webinars
Regular Contributor I

Re: Issues with 215s going down

what is the output of the following command?

 

" show ap debug system-status ap-name NMB-EXCEL-BDLE | include Reboot "

 

--Give Kudos: found something helpful, important, or cool? Click Kudos Star in a post.
--Problem Solved? Click "Accepted Solution" in a post.


Ajay Kumar Ravipati
ACMA (V8) | ACMP (V8) | CCENT | CCNA (R&S) | PAN-OS 8.0 ACE
Occasional Contributor I

Re: Issues with 215s going down

Thank you, I will try it

Occasional Contributor I

Re: Issues with 215s going down

Thank you, im going onsite a bit later, so ill run it then and log the output.

Occasional Contributor I

Re: Issues with 215s going down

 


@A_RAK wrote:

what is the output of the following command?

 

" show ap debug system-status ap-name NMB-EXCEL-BDLE | include Reboot "

 

--Give Kudos: found something helpful, important, or cool? Click Kudos Star in a post.
--Problem Solved? Click "Accepted Solution" in a post.



Good morning, here is the output of the command:

 

Reboot Information

------------------

AP rebooted Fri Dec 31 16:45:49 PST 1999; SAPD: Unable to contact switch: HELLO-TIMEOUT. Last rebootstrap reason: HELLO-TIMEOUT, 229 sec before: Last Ctrl msg: HELLO len=1447 dest=192.168.254.24 tries=10 seq=0

-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------

 

Rebootstrap Information

-----------------------

Date       Time     Reason (Latest 10)

--------------------------------------

2000-01-03 02:49:51 Switching to LMS 192.168.254.24: Missed heartbeats: Last Sequence Generated=9 Sent=9 Rcvd=0; eth Sent=29230 Drop=0; gre Sent=29236 Drop=0 First=1; ipsec Sent=0 Drop=0. Last Ctrl message: STATUS_REPORT len=77 dest=192.168.254.24 tries=1 seq=2

2000-01-03 02:50:02 Switching to LMS 192.168.254.24: Missed heartbeats: Last Sequence Generated=9 Sent=9 Rcvd=0; eth Sent=29239 Drop=0; gre Sent=29245 Drop=0 First=1; ipsec Sent=0 Drop=0. Last Ctrl message: STATUS_REPORT len=77 dest=192.168.254.24 tries=1 seq=2

2000-01-03 02:50:13 Switching to LMS 192.168.254.24: Missed heartbeats: Last Sequence Generated=9 Sent=9 Rcvd=0; eth Sent=29248 Drop=0; gre Sent=29254 Drop=0 First=1; ipsec Sent=0 Drop=0. Last Ctrl message: STATUS_REPORT len=77 dest=192.168.254.24 tries=1 seq=2

2000-01-03 02:50:24 Switching to LMS 192.168.254.24: Missed heartbeats: Last Sequence Generated=9 Sent=9 Rcvd=0; eth Sent=29257 Drop=0; gre Sent=29263 Drop=0 First=1; ipsec Sent=0 Drop=0. Last Ctrl message: STATUS_REPORT len=77 dest=192.168.254.24 tries=1 seq=2

MVP Guru

Re: Issues with 215s going down

The APs rebooted due to being unable to reach the controller.

 

AP rebooted Fri Dec 31 16:45:49 PST 1999; SAPD: Unable to contact switch: HELLO-TIMEOUT. Last rebootstrap reason: HELLO-TIMEOUT, 229 sec before: Last Ctrl msg: HELLO len=1447 dest=192.168.254.24 tries=10 seq=0

Do you have any packet loss between the APs and the controllers? Reason being is you can see there is missed heartbeats between the controller and the AP.

 

2000-01-03 02:50:13 Switching to LMS 192.168.254.24: Missed heartbeats: Last Sequence Generated=9 Sent=9 Rcvd=0; eth Sent=29248 Drop=0; gre Sent=29254 Drop=0 First=1; ipsec Sent=0 Drop=0. Last Ctrl message:

Also, do you have NTP configured?


ACMP, ACSA, ACDX #985
If my post addresses your query, give kudos:)
Regular Contributor I

Re: Issues with 215s going down

What is the topology? Do you have a standalone setup or a redundant setup?

Do you have redundancy setup?(vrrp)

APs exchange keepalives (heartbeats ) with the controller every second.

If they miss 8 heartbeats (8 seconds in total) they rebootstrap.(basically a reboot with some nuances)

I am assuming that you have sufficient licenses or they would have come up with an IL flag if you don't have sufficient licenses.

Do you see any errors in the configuration profiles?

Show profile-errors.

Finally how are the APs setup to discover their master? Is the Masters ip hardcoded or are you using DHCP/DNS.

What is the configuration for the ap system-profile?

Show ap system-profile (name of the profile used)

Could you post the boot log of any one ap.

Ajay Kumar Ravipati
ACMA (V8) | ACMP (V8) | CCENT | CCNA (R&S) | PAN-OS 8.0 ACE
Occasional Contributor I

Re: Issues with 215s going down

Good morning all,

 

My apologies for not getting back on this sooner. 

 

We were able to fix the issue with a controller upgrade. We have also since configured an NTP server.

 

I appreciate all the responses that we got on the post. Thank you for your assistance.

Search Airheads
cancel
Showing results for 
Search instead for 
Did you mean: