Logs and Information Required in the Event of a QPS System Failure

Available Languages

Download Options

PDF (31.3 KB)
View with Adobe Reader on a variety of devices
ePub (104.5 KB)
View in various apps on iPhone, iPad, Android, Sony Reader, or Windows Phone
Mobi (Kindle) (80.1 KB)
View on Kindle device or Kindle app on multiple devices

Updated:August 10, 2014

Document ID:117999

Bias-Free Language

The documentation set for this product strives to use bias-free language. For the purposes of this documentation set, bias-free is defined as language that does not imply discrimination based on age, disability, gender, racial identity, ethnic identity, sexual orientation, socioeconomic status, and intersectionality. Exceptions may be present in the documentation due to language that is hardcoded in the user interfaces of the product software, language used based on RFP documentation, or language that is used by a referenced third-party product. Learn more about how Cisco is using Inclusive Language.

Diagnostics and Debug Logs

QPS License Information

System Statistics

Thread Configuration in Policy Builder

Fatal Error Log

Introduction

This document describes the steps that must be completed in order to capture information when a Quantum Policy Suite (QPS) system failure or crash occurs. If the hardware, software, and virtual machine requirements are met, it is unlikely that the QPS will crash.

Prerequisites

Requirements

There are no specific requirements for this document.

Components Used

The information in this document is based on these software and hardware versions:

QPS Release 5.5 and later.

Note: Certain logs will not appear in QPS releases older than QPS Release 5.5.

The information in this document was created from the devices in a specific lab environment. All of the devices used in this document started with a cleared (default) configuration. If your network is live, make sure that you understand the potential impact of any command.

Capture Information

If a QPS system failure happens, collect this information:

Diagnostics and Debug Logs

Log in to the Policy and Charging Rules Function (PCRF) client virtual machine (for example, pcrfclient01) and collect diagnostic information (for example, /opt/broadhop/installer/diag/diagnostics.sh).
Log in to the PCRF client virtual machine and collect debug information. Debug information includes consolidated QNS log, svn repo, and QNS configuration details. Make sure that the consolidated logs cover the time of the system failure and that the debug level is set in the logback.xml file.
Collect this output from your QPS (for example, Run /opt/broadhop/installer/diag/zip_debug_info.sh and the output is stored in /var/tmp/debug_info<date>.zip).

QPS License Information

Log in to the PCRF client virtual machine and collect QPS license information. A QPS is usually licensed for a specific feature and there is a maximum number of concurrent sessions it supports. The QPS also has an expiration date for this feature.
Navigate to this directory: /etc/broadhop/license and capture the output of the license (.lic) file. (for example,cat /etc/broadhop/license/QUANTUM201311210402429360.lic).

System Statistics

Capture the system statistics (Example: CPU, memory, disk utilization).
Log in to the PCRF client virtual machine and collect the output. Example: /opt/broadhop/control/top_qps.sh
Log in to virtual machine that corresponds (for example, pcrfclient0x, lb0x, qns0x) and capture these system statistics:
cat /proc/meminfo > Allocated memory information
free –s 60 > Memory statistics for every single minute
vmstat 1 > CPU status for every single minute
ps –aux | head -10 > Top 10 process details which consumes most of CPU utilization
swapon –s > swap usage summary per device
. du -a | sort -n -r | head -n 10 > Top 10 files / directories consuming more space
Login to sessionmgr virtual machine and collect the outputs mongostat and mongotop, which will help in order to troubleshoot whether the issue is related to the database or not.

Thread Configuration in Policy Builder

The number of threads might range from 40 to 50 for TPS, but is less than 1,000. The maximum number of threads you can configure is 50. If you increase the number of threads, this impacts system performance.

Fatal Error Log

When a system failure occurs, the QPS generates a fatal error log, which contains the state of the process at the time the fatal error occured. Fatal error or fatal exception errors cause the program to abort.

The fatal error log includes this information:

The operating exception or signal that provoked the fatal error
Version and configuration information
Details on the thread that provoked the fatal error and the thread's stack trace
The list of running threads and their state
Summary information about the heap
The list of native libraries loaded
Command-line arguments
Environment variables
Details about the operating system (OS) and central processing unit (CPU)

The default log file name follows this format: hs_err_pid<pid>.log and is generated in the working directory where the corresponding Java processes started. Example: the working directory of the user when the user started the QNS process.

If you do not know the working directory, search the system for the file with the the name hs_err_pid*.log and examine the file for a time that matches when the error occured.

Complete these steps in order to specify the location for the fatal error:

Log in to the pcrfclient01 virtual machine
Open jvm.conf (for example,vi /etc/broadhop/pcrf/jvm.conf).
Add the option: -XX:ErrorFile=<directory>/<file-name>%p.log to the list and make sure that the specified directory path exists and that the user QNS has full permission over that directory. Example: -X:ErrorFile=/home/qns/fatal_error%p.log
The “syncconfig.sh” command can cause a lot of problems if the conf files in pcrfclient01:/etc/broadhop are not in synch with the conf files in /etc/broadhop on the VMs running the QNS service. The syncconfig.sh will take the pcrfclient01:/etc/broadhop conf files and over write the conf files in /etc/broadhop on the VMs running the QNS.
Warning: The command synconfig.sh will take the pcrfclient01:/etc/broadhop conf files and overwrite all the conf files in /etc/broadhop on the virtual machines running the QNS service (ifor example, iomgr01, iomgr02, qns01, qns02, etc.)
Restart the QNS application and enter the command restartall.sh

Contributed by Cisco Engineers

Aravindhan Balasubramian
Cisco TAC Engineer
Tony Pina
Cisco TAC Engineer
Vinodkumar Tiwari
Cisco TAC Engineer

Was this Document Helpful?

Feedback

Contact Cisco

Open a Support Case
(Requires a Cisco Service Contract)

Logs and Information Required in the Event of a QPS System Failure

Available Languages

Download Options

Bias-Free Language

Contents

Introduction

Prerequisites

Requirements

Components Used

Capture Information

Diagnostics and Debug Logs

QPS License Information

System Statistics

Thread Configuration in Policy Builder

Fatal Error Log

Contributed by Cisco Engineers

Was this Document Helpful?

Contact Cisco

This Document Applies to These Products