Hi there...
I have a customer where I just installed an HPE 10508 chassis, running the latest firmware version of v7. In this next weekend I am going to add an slave member, forming the IRF Stack that has been previiously configured.
The first chassis is running for the last 10 days without problems, while the second rack is being fixed, where the second chassis will be installed. But yesterday a strange error started to appear in the log, without any clue of which could be the root cause:
% Jan 28 12: 06: 03: 519 2020 ESACSE_DC_CORE DEV / 4 / DEV_FAULT_TOOLONG: Card in chassis 1 slot 10 is still in Fault state for 10200 minutes.
The slot 10 refers to the first fabric card. This message was repeated every 60 minutes until this morning, when we pulled out the card and inserted it again, back to normal operation.
As I said before, there is no clue of what could be the root cause of this alarm in the log - just the error itself. I am not sure if I can swap the card with another fabric from the second switch, which is already configured, so I didn't made any offline test with the card. I know that the control plane and data plane runs separately in MPU and the fabric cards, but I am not sure if I could swap cards from different switches that are already configured.
Any idea of what is this error? I could not find any document referred to somtehing like that.
Thanks in advance for any help.