Wired Intelligent Edge

 View Only
  • 1.  8320 VSX Split recovery

    Posted Mar 10, 2022 05:36 AM

    Hi

    Had a split brain on our 8320 cluster running 10.05.0020.

    Split Recovery Mode: Enabled

    All MC-LAG interfaces on the secondary node remain down.

    ISL channel: Out-Of-Sync
    ISL mgmt channel: inter_switch_link_down
    Config Sync Status: out-of-sync
    NAE: peer_unreachable
    ...
    ISL version: same in Primary & secondary
    Software version: ame in Primary & secondary
    ...
    Device Role: Primary = Primary & Secondary = Secondary (Device roles inconsistent)

    I have tried to find examples on how to manually recover the current split situation but cannot find any information. 

    Since the secondary node is obviously not forwarding any traffic I thought that i could just go ahead and reboot it, BUT not sure what's gonna happen if nodes are not in sync.

    Appreciate any relevant advice.

    Cheers

    Timo Krjukoff



    ------------------------------
    Timo Krjukoff
    ------------------------------


  • 2.  RE: 8320 VSX Split recovery

    Posted Mar 11, 2022 03:51 AM
    What is strange is that ISL is seen as down.
    Could you try toggling the LAG used for ISL on both primary and secondary ?
    like checking LACP state.

    A very important note:
    In November 2020, Aruba communicate that all AOS-CX systems should be upgraded to at least 10.05.0021 for 10.05 (or 10.04.3031 for 10.04).
    due to a bug impacting the life of SSD. I see you still run 10.05.0020. I would strongly recommend to upgrade to the latest maintenance release of 10.08
    (which brings lot of new features and bug fixes).

    ------------------------------
    Vincent Giles
    ------------------------------



  • 3.  RE: 8320 VSX Split recovery

    Posted Mar 11, 2022 06:27 AM

    Hi Vincent

    I've toggled the ISL link, even rebooted the secondary node. State remains not-in-sync. Have opened a case with TAC.

    Cheers

    /timo



    ------------------------------
    Timo Krjukoff
    ------------------------------



  • 4.  RE: 8320 VSX Split recovery

    Posted Mar 11, 2022 08:19 AM
    ISL LAG LACP state is ALFNCD ?

    ------------------------------
    Vincent Giles
    ------------------------------



  • 5.  RE: 8320 VSX Split recovery

    Posted Mar 11, 2022 08:36 AM
    Yes, itis.

    ------------------------------
    Timo Krjukoff
    ------------------------------



  • 6.  RE: 8320 VSX Split recovery

    Posted Mar 14, 2022 05:13 AM
    ok, then TAC is the right approach.
    Since 10.05.0020 lot of bug fixes, I would strongly suggest to upgrade if you have a coming maintenance window.

    ------------------------------
    Vincent Giles
    ------------------------------



  • 7.  RE: 8320 VSX Split recovery

    Posted Mar 16, 2022 02:29 AM

    We did an upgrade to 10.06.0180 and rebooted the cluster. Did fix the problem :).

    /timo



    ------------------------------
    Timo Krjukoff
    ------------------------------