Greetings, Wizards!
I have been battling an issue with microbranch-VPNC communications for a customer. What appeared at first to be a end-device issue (ip-phone on a wired port, CL2 to the DataCenter VPNC), now appears to be the microbranch connection to the VPNC itself. It seemed to be just one of the microbranch APs, but I have replicated the issue on my own (303H).
What I see in the logs is the tunnel not establishing, or trying to re-establish often. The wired client is put into a "deny all" role since the VPNC is not responding for the auth-request (show ap debug auth-trace-buf shows "server out-of-service"). The AP "show ata state" information reveals something like the following:
ub_303h# show ata state-transition-history
Tunnel State Transition History
-------------------------------
Timestamp Peer IP UUID Current State Event Next State
--------- ------- ---- ------------- ----- ----------
2024-02-13 20:53:21 x.x.x.x 317de5bb-646c-4f30-adef-744b204dfe92 INIT TUN_RECV CONNECTING
2024-02-13 20:53:27 x.x.x.x 317de5bb-646c-4f30-adef-744b204dfe92 CONNECTING TUN_RECV CONNECTING
2024-02-13 20:53:30 x.x.x.x 317de5bb-646c-4f30-adef-744b204dfe92 CONNECTING PROBE_OK CONNECTED
2024-02-13 20:53:56 x.x.x.x 317de5bb-646c-4f30-adef-744b204dfe92 CONNECTED TUN_REKEY REKEYING
2024-02-13 20:54:27 x.x.x.x 317de5bb-646c-4f30-adef-744b204dfe92 REKEYING HB_TIMEOUT CONNECTING
2024-02-13 20:54:28 x.x.x.x 317de5bb-646c-4f30-adef-744b204dfe92 CONNECTING HB_TIMEOUT INIT
2024-02-13 20:57:00 x.x.x.x 317de5bb-646c-4f30-adef-744b204dfe92 INIT TUN_RECV CONNECTING
2024-02-13 20:58:12 x.x.x.x 317de5bb-646c-4f30-adef-744b204dfe92 CONNECTING TUN_RECV CONNECTING
2024-02-13 20:59:55 x.x.x.x 317de5bb-646c-4f30-adef-744b204dfe92 CONNECTING TUN_RECV CONNECTING
2024-02-13 21:01:58 x.x.x.x 317de5bb-646c-4f30-adef-744b204dfe92 CONNECTING PROBE_TIMEOUT INIT
2024-02-13 21:06:58 x.x.x.x 317de5bb-646c-4f30-adef-744b204dfe92 INIT TUN_RECV_TIMEOUT SURVIVING
2024-02-13 21:07:58 x.x.x.x 317de5bb-646c-4f30-adef-744b204dfe92 SURVIVING TUN_RECV CONNECTING
2024-02-13 21:08:12 x.x.x.x 317de5bb-646c-4f30-adef-744b204dfe92 CONNECTING TUN_RECV CONNECTING
2024-02-13 21:08:14 x.x.x.x 317de5bb-646c-4f30-adef-744b204dfe92 CONNECTING TUN_RECV CONNECTING
2024-02-13 21:08:16 x.x.x.x 317de5bb-646c-4f30-adef-744b204dfe92 CONNECTING TUN_RECV CONNECTING
2024-02-13 21:09:32 x.x.x.x 317de5bb-646c-4f30-adef-744b204dfe92 CONNECTING TUN_RECV CONNECTING
2024-02-13 21:11:35 x.x.x.x 317de5bb-646c-4f30-adef-744b204dfe92 CONNECTING PROBE_TIMEOUT INIT
2024-02-13 21:11:35 x.x.x.x 317de5bb-646c-4f30-adef-744b204dfe92 INIT TUN_RECV_TIMEOUT SURVIVING
2024-02-13 21:11:52 x.x.x.x 317de5bb-646c-4f30-adef-744b204dfe92 SURVIVING TUN_RECV CONNECTING
2024-02-13 21:13:55 x.x.x.x 317de5bb-646c-4f30-adef-744b204dfe92 CONNECTING PROBE_TIMEOUT INIT
2024-02-13 21:18:56 x.x.x.x 317de5bb-646c-4f30-adef-744b204dfe92 INIT TUN_RECV_TIMEOUT SURVIVING
2024-02-13 21:20:01 x.x.x.x 317de5bb-646c-4f30-adef-744b204dfe92 SURVIVING SUR_TUN_ERR SURVIVING
2024-02-13 21:21:07 x.x.x.x 317de5bb-646c-4f30-adef-744b204dfe92 SURVIVING SUR_TUN_ERR SURVIVING
2024-02-13 21:22:14 x.x.x.x 317de5bb-646c-4f30-adef-744b204dfe92 SURVIVING SUR_TUN_ERR SURVIVING
2024-02-13 21:22:45 x.x.x.x 317de5bb-646c-4f30-adef-744b204dfe92 SURVIVING TUN_RECV CONNECTING
2024-02-13 21:24:45 x.x.x.x 317de5bb-646c-4f30-adef-744b204dfe92 CONNECTING PROBE_TIMEOUT INIT
2024-02-13 21:24:45 x.x.x.x 317de5bb-646c-4f30-adef-744b204dfe92 INIT TUN_RECV_TIMEOUT SURVIVING
2024-02-13 21:25:53 x.x.x.x 317de5bb-646c-4f30-adef-744b204dfe92 SURVIVING SUR_TUN_ERR SURVIVING
2024-02-13 21:27:02 x.x.x.x 317de5bb-646c-4f30-adef-744b204dfe92 SURVIVING SUR_TUN_ERR SURVIVING
2024-02-13 21:28:12 x.x.x.x 317de5bb-646c-4f30-adef-744b204dfe92 SURVIVING SUR_TUN_ERR SURVIVING
2024-02-13 21:29:23 x.x.x.x 317de5bb-646c-4f30-adef-744b204dfe92 SURVIVING SUR_TUN_ERR SURVIVING
2024-02-13 21:30:36 x.x.x.x 317de5bb-646c-4f30-adef-744b204dfe92 SURVIVING SUR_TUN_ERR SURVIVING
2024-02-13 21:31:49 x.x.x.x 317de5bb-646c-4f30-adef-744b204dfe92 SURVIVING SUR_TUN_ERR SURVIVING
2024-02-13 21:33:03 x.x.x.x 317de5bb-646c-4f30-adef-744b204dfe92 SURVIVING SUR_TUN_ERR SURVIVING
ub_303h#
However, when it is finally established it may stay good for a long (30-90 minutes) period of time, or it may break before the client device can complete DHCP. When the connection is up, pings reliably pass through the tunnel at a stable latency (~100ms) and no drops over 100 pings.
Some of the microbranch APs are up and stay up. Some do not.
In my lab (no internet) I am able to set up a microbranch-to-VPNC setup with no issues and it is completely stable. Aruba TAC has been no help in the many requests I've made to have them look at it.
Any suggestions? It appears to be an IKE-related issue, but could it be a routing issue or packet drop at the VPNC side?
------------------------------
ryh
------------------------------