Troubleshoot RCM Based UPF Upgrade Failure - Configmgr Missing Host

Available Languages

Download Options

PDF (6.4 KB)
View with Adobe Reader on a variety of devices
ePub (83.2 KB)
View in various apps on iPhone, iPad, Android, Sony Reader, or Windows Phone
Mobi (Kindle) (68.9 KB)
View on Kindle device or Kindle app on multiple devices

Updated:February 12, 2024

Document ID:221676

Bias-Free Language

The documentation set for this product strives to use bias-free language. For the purposes of this documentation set, bias-free is defined as language that does not imply discrimination based on age, disability, gender, racial identity, ethnic identity, sexual orientation, socioeconomic status, and intersectionality. Exceptions may be present in the documentation due to language that is hardcoded in the user interfaces of the product software, language used based on RFP documentation, or language that is used by a referenced third-party product. Learn more about how Cisco is using Inclusive Language.

Introduction

Problem

Solution

Introduction

This document describes RCM based UPF upgrade failure due to configmgr is missing the host entry

Problem

When RCM (Redundancy Configuration Manager) controller initiates a planned UPF (User Plane Function) switchover from UPF 1(Active) to UPF 2(Standby), configmgr is expected to have both the UPF 1 and UPF 2 in its host list. But due to some reason configmgr does not have the Active UPF 1 in its active host list, contradicting with host list.on controller

And when RCM triggers UPF 1 switchover to UPF 2 in such condition, switchover process gets initiated. During switchover process configmgr tries to find the Active UPF 1 host details in its host list, but fails to find.

UPF switchover process fails with reason "Old Active moved from PendingStandby to Active due to timeout in receiving Standby state (planned switchover)" and UPF1 gets moved from PendingStandby to Active and UPF 2 from PendingActive to Standby.

//How to detect switchover failure is due to configmgr missing host details in its host list

In the RCM tac dbg covering such switchover failure times, look for log event in the configmgr pod log.

2024/01/12 09:08:26.878 rcm-configmgr [DEBUG] [sshclient.go:980] [rcm_grpc_ep.msg-process.Int] [RcmGenTrap]: SNMP Trap Raised: (SwitchoverFailure) - Switchover from 10.248.187.151:22 to 10.248.187.153:22 in Group:1 Failed! Reason : Active Not Found

If rcm tac dbg is NOT present, you can also confirm the UPF switchover failed due to this issue by looking for snmp trap from RCM controller ops-center.

a) Login to Active RCM ops-center

b) Run command rcm show-snmp-trap history

c) Look in the snmp traps trap present

SwitchoverFailure 2024-01-18T05:19:45.Z 2024-01-18T05:19:45.Z rcm-configmgr Switchover from 10.244.127.23:22 to 10.244.127.29:22 in Group:1 Failed! Reason : Active Not Found

Solution

Untill permanent fix comes via Cisco bug ID CSCwi70133 Work around is to delete the configmgr pod from the corresponding AIO (All In One) K8s master node, using kubectl delete <configmgr-pod-name> -n <k8-name-space>

Example :

1. As part of pre-checks of the UPF upgrade automation work flow, checks to compare the controller and configmgr host list canbe done. If a host is missing in configmgr host list, configmgr pod delete can be done so that configmgr gets complete hosts list freshly from controller.

2. If UPF switchover is being given manually, collect 2 CLI commands outputs from active RCM and compare them to find if any Host(Active/Standby) is missing in the configmgr host output. If any host missing, issue configmgr pod delete from RCM AIO K8s master node & recheck the controller and configmgr host list. If the hosts are matching on controller and configmgr, proceed to manaul switchover of UPFs from controller.

a) rcm show-statistics controller

b) rcm show-statistics configmgr

Revision History

Revision	Publish Date	Comments
1.0	14-Feb-2024	Initial Release

Contributed by Cisco Engineers

Narender Tummala

Was this Document Helpful?

Feedback

Contact Cisco

Open a Support Case
(Requires a Cisco Service Contract)

Troubleshoot RCM Based UPF Upgrade Failure - Configmgr Missing Host

Available Languages

Download Options

Bias-Free Language

Contents

Introduction

Problem

Solution

Revision History

Contributed by Cisco Engineers

Was this Document Helpful?

Contact Cisco