Professional Documents
Culture Documents
MSC Server
Troubleshooting Guide
Version: V4.13.10
ZTE CORPORATION
No. 55, Hi-tech Road South, ShenZhen, P.R.China
Postcode: 518057
Tel: +86-755-26771900
Fax: +86-755-26770801
URL: http://ensupport.zte.com.cn
E-mail: support@zte.com.cn
LEGAL INFORMATION
Copyright © 2013 ZTE CORPORATION.
The contents of this document are protected by copyright laws and international treaties. Any reproduction or
distribution of this document or any portion of this document, in any form by any means, without the prior written
consent of ZTE CORPORATION is prohibited. Additionally, the contents of this document are protected by
contractual confidentiality obligations.
All company, brand and product names are trade or service marks, or registered trade or service marks, of ZTE
CORPORATION or of their respective owners.
This document is provided “as is”, and all express, implied, or statutory warranties, representations or conditions
are disclaimed, including without limitation any implied warranty of merchantability, fitness for a particular purpose,
title or non-infringement. ZTE CORPORATION and its licensors shall not be liable for damages resulting from the
use of or reliance on the information contained herein.
ZTE CORPORATION or its licensors may have current or pending intellectual property rights or applications
covering the subject matter of this document. Except as expressly provided in any written license between ZTE
CORPORATION and its licensee, the user of this document shall not acquire any license to the subject matter
herein.
ZTE CORPORATION reserves the right to upgrade or make technical change to this product without further notice.
Users may visit ZTE technical support website http://ensupport.zte.com.cn to inquire related information.
The ultimate right to interpret this product resides in ZTE CORPORATION.
Revision History
II
Intended Audience
This manual is intended for the maintenance engineers.
Chapter Summary
2, Hardware Faults Describes how to handle hardware faults, including board, power
supply and fan.
Related Documentation
The following documentation is related to this manual:
Conventions
This manual uses the following typographical conventions:
Typeface Meaning
Italics Variables in commands. It may also refer to other related manuals and
documents.
Bold Menus, menu options, function names, input fields, option button names, check
boxes, drop-down lists, dialog box names, window names, parameters, and
commands.
Constant width Text that you type, program codes, filenames, directory names, and function
names.
[] Optional parameters.
{} Mandatory parameters.
II
1-1
Instrument-Using Skills
Skilled in using various instruments and meters. The most commonly used tools include
multi-meters and SS7 signaling analyzers.
1-2
1. Know the situations: In case of a fault, do simple service tests to know the situation of
the fault.
2. Collect fault information: Collect or record detailed information about the fault,
including the symptom, alarms and working information displayed on the NMS,
operations you have made to handle this fault, and other information that you can
collect with the maintenance tools (such as signaling trace, failure observation, and
performance management).
1-3
3. Classify the fault: Based on the symptom and the information that you have collected,
analyze the fault initially and classify it.
4. Locate the fault: Locate the fault and find out the possible causes by analyzing the
work flows and NEs.
5. Remove the fault: Handle according to the fault location and possible causes.
6. Record the fault handling information: Record details about the fault handling,
including the symptom and the operations. Such information may be a helpful
reference for handling similar faults. You can design a table to record the details or
use the recommended table, refer to Appendix “B Troubleshooting Records”.
Precautions
When you are troubleshooting, note the following:
l Rules and regulations for fault handling and trace should be made to be manifest
to all maintenance persons. Only authorized and relevant persons are allowed to
participate the troubleshooting, thus to prevent misoperations from causing worse
faults.
l The maintenance persons should do by following the instructions in the manuals of
the ZXUN iCX(MSCS). Before touching any hardware device, you should wear an
antistatic wrist strap, so as to avoid hazards and accidents caused by human factors.
l Back up service data and system running settings. Make a detailed record about
the fault symptom, versions, and configuration changes and operations that you have
done. Collect also other data about the fault. All the data may be used for analyzing
and removing the fault.
l Trace and record in detail the handling of each fault. For a fault that may last for days,
detailed records about the changes of shift should be made, so that the responsibilities
can be clear.
l Handle every fault promptly. In case of any fault that you cannot remove, contact ZTE
promptly.
In any of the following situations, you should contact ZTE for technical support.
à Emergency faults, for example, all services or some services interrupted
à Faults that you cannot remove with the methods described in this manual
à Faults that you cannot remove with you own knowledge
à Faults that you cannot remove by referring to the cases of successfully removing
similar faults
l Past a list of contacts of ZTE in a conspicuous place, and remember to confirm and
update the contacts frequently.
ZTE Global Customer Support Center (GCSC) provides 7×24 technical supports.
Contacts of GCSC:
à Tel: +86-755-26771900
à Fax: +86-755-26770801
1-4
à URL: http://ensupport.zte.com.cn
à E-mail: support@zte.com.cn
l When you are contacting ZTE for technical support, you may be required to provide
the following information:
à Detailed symptom about the fault, including time, place and events
à The calling number and the called number, and the time when the call was made
à Relevant alarms, performance statistical data, signaling trace result, and failure
observation result
à Operations that you have done after the fault occurred
à The way to remotely log in to the system and the telephone numbers of persons
for contact
System faults may comprise complete or partial power failures, network failures, database
faults and other faults. You can do troubleshooting in these key aspects. When you have
confirmed that power and communication is OK, you can use the alarm management
function of the NMS to locate the node where problem possibly lies in.
1-5
1-6
l OMM
The OMM provides signaling trace, failure observation, performance management,
alarm management and log management tools. These tools may be useful for you to
locate and remove a fault.
à With the alarm management function, you can query and view the alarms that
have occurred due to a fault. Many alarms can directly tell you the causes of a
fault.
à Signaling trace and failure observation can help you to locate a current service
fault.
à With the log management function, you can see the operation logs of the OMM .
You can know what operations have been done before an alarm, then you can
analyze whether this fault has something to do with these operations.
à You may be unable to directly locate a fault with the performance statistics
function. But analysis on performance statistical data is an very good assistant.
l Ethereal network packet sniffing tool
Ethereal is a free program. You can use it to capture and save wanted network
interfacing data and convert the data into a readable format.
l Instruments and meters
Commonly used instruments and meters including (flat-head and cross-head)
screwdrivers, a voltage tester, a signalling analyzer, network cable pliers and a
multi-meter should be ready at hand.
1-7
checking for the device that when it is installed the system becomes faulty again. This
device may be faulty, and you can replace it.
Note:
When you do troubleshooting with the minimum system method, be cautious that this
method might influence the services.
You can use the minimum system method for troubleshooting during the
commissioning procedure, and this method is not recommended if the equipment is
in commercial use.
l Signaling analysis
With the signaling trace function of the OMM, you can obtain and analyze the signaling.
By referring to the standard signaling, you can use the signaling analysis method to
locate a fault in inter-office signaling coordination.
l Failure code analysis
Failure code analysis is helpful for you to locate a service fault on the local office. The
system provides a failure code and reasons for every service failure.
l Performance statistics
Performance statistics is helpful for you to find out the when a fault occurred, what
services and what devices are influenced.
Recommendations
l To troubleshoot a hardware fault, you can check the indicators also when you are
using a troubleshooting method such as comparison or replacement.
l To troubleshoot a software or service fault, you are recommended to use the
maintenance tools of the OMM, while analyzing logs and other data.
l It may be difficult to tell whether a fault is a hardware fault or a software fault. In this
case, you need to use multiple troubleshooting methods. Therefore, proficiency in all
these troubleshooting methods helps you to quickly remove a fault.
1-8
H/S Hot-swap state Blue Off: The device is working, and you should
indicator not unplug the board.
Solid on: The device is not working, and
you can swap the board.
Flashing: The device is being activated or
deactivated.
2-1
HD1/HD2 Hard disk state Red/green Flashing in green: The disk is reading or
indicator writing.
On in red: Indicates that the hard disk fails
or is off position.
Table of Contents
Board Faults...............................................................................................................2-2
Power Supply Faults ..................................................................................................2-8
Fan Faults ................................................................................................................2-12
Symptom Reference
The H/S indicator of a board is solid on 2.1.1 The H/S Indicator Is Solid On
The H/S indicator of a board is flashing 2.1.2 The H/S Indicator Is Flashing
The HOST indicator of a board is flashing in red 2.1.5 The HOST Indicator Is Flashing in Red
The HD1/HD2 indicator of a board is solid on in 2.1.6 The HD1/HD2 Indicator Is Solid On in Red
red
Failure in board powering on/off when operated 2.1.7 Failure in Board Powering On/Off When
with the extractors Operated with the Extractors
A board is abnormally powered off with the H/S 2.1.8 A Board Is Abnormally Powered Off with
indicator solid on the H/S Indicator Solid On
2-2
Analysis
When a board is working properly, the hot-swap (H/S) indicator is off. If the H/S indicator
is solid on, the board is not in the working state.
Do the following to locate the fault:
l Check whether the extractors are secured.
l Check the contact between the front board and the rear board.
Handling
You are recommended to handle this fault in the following procedure.
1. Check whether the extractors of the board are secured, that is, whether the sliders of
the extractors are fastened.
l Yes→Go to Step 3.
l No→Go to Step 2.
2. Secure the extractors, and check whether the H/S indicator flashes for a little while
and turns off.
l Yes→End.
l No→Go to Step 3.
3. Open the extractors and then fastened them, and check whether the H/S indicator
flashes for a little while and turns off.
l Yes→End.
l No→Go to Step 4.
Note:
If the H/S indicator is always flashing, handle by referring to “2.1.2 The H/S Indicator
Is Flashing”.
Confirmation
The H/S indicator of a board is off.
2-3
Analysis
When a board is working, the H/S indicator is off. If the indicator is flashing, the board has
not started up.
Handling
You are recommended to handle this fault in the following procedure.
1. Open the extractors of the board and then fastened them, and check whether the
indicator becomes solid off after flashing for little while.
l Yes→End.
l No→Go to Step 2.
2. Open the extractors of the NCMM to do active/standby switchover and then secure
the extractors after the switchover. See whether the indicator becomes solid off after
flashing for little while.
l Yes→End.
l No→Go to Step 3.
3. Open the extractors of the board and then fastened them, and check whether the
indicator becomes solid off after flashing for little while.
l Yes→End.
l No→Go to Step 4.
4. Contact ZTE for help.
Confirmation
The H/S indicator is solid off.
Analysis
If the OK indicator is off, the board has not started up successfully.
Handling
You are recommended to handle this fault in the following procedure.
1. Check whether the IPMC version of this board is being updated.
l Yes→Go to Step 2.
l No→Go to Step 3.
2. Wait for the update to complete, and then check whether the OK indicator is flashing
in green.
l Yes→End.
2-4
l No→Go to Step 3.
3. Check whether this board is being powered off or on.
l Yes→Go to Step 4.
l No→Go to Step 5.
4. Wait for the board to start up, and then check whether the OK indicator is flashing in
green.
l Yes→End.
l No→Go to Step 5.
5. Reboot the board and check whether the OK indicator is flashing in green.
l Yes→End.
l No→Go to Step 6.
6. Contact ZTE for help.
Confirmation
The OK indicator of the board is flashing in green.
Analysis
If the OK indicator is on in red, there are alarms for this board.
Handling
You are recommended to handle this fault in the following procedure.
1. In the Fault Management tab page of the Local Maintenance Terminal, check
whether there are alarms for this board.
l Yes→Go to Step 2.
l No→End.
2. Handle and remove the alarms by referring to the recommended measures attached
to the alarm information, and see whether this fault is removed.
l Yes→End.
l No→Go to Step 3.
3. Contact ZTE for help.
Confirmation
The OK indicator is flashing in green.
2-5
Analysis
If the HOST indicator flashes in red, the board may be working abnormally or have alarms.
The higher frequency that the indictor is flashing at, the higher level the alarms are.
Handling
You are recommended to handle this fault in the following procedure.
1. In the Fault Management tab page of the Local Maintenance Terminal, check
whether there are alarms for this board.
l Yes→Go to Step 2.
l No→End.
2. Handle and remove the alarms by referring to the recommended measures attached
to the alarm information, and see whether this fault is removed.
l Yes→End.
l No→Go to Step 3.
3. Contact ZTE for help.
Confirmation
The HOST indicator is flashing in green.
Analysis
If the HD1/HD2 indicator of a GPBB0/GPBX1 board is solid on in red, the corresponding
hard disk is faulty.
Handling
You are recommended to handle this fault in the following procedure.
1. Replace the faulty hard disk, and then check whether the HD1/HD2 indicator becomes
normal.
l Yes→End.
l No→Go to Step 2
2. Contact ZTE for help.
2-6
Confirmation
The HD1/HD2 indicator flashes in green when the board is reading data from or writing
data into the hard disk. At other times, the indicator is off.
Analysis
If a board cannot successfully get power on after the extractors are closed or cannot
successfully get power off after the extractors are opened, the sliders of the extractors
may be damaged.
Handling
You are recommended to handle this fault in the following procedure.
Confirmation
The board can successfully power on and the HOST indicator shows no alarm.
Analysis
The fault may be caused by the internal control mechanism. If the temperature is over
high, the self-protection process starts and gets the power off.
2-7
Handling
You are recommended to handle this fault in the following procedure.
1. In the Fault Management tab page of the Local Maintenance Terminal, check
whether there are temperature related alarms.
l Yes→Go to Step 2.
l No→Go to Step 4.
2. Check the equipment room temperature and see whether the environmental
temperature is higher than the required temperature.
l Yes→Go to Step 3.
l No→Go to Step 4.
3. Lower down the environmental temperature, and check whether the board is powered
on and the H/S indicator is off.
l Yes→End.
l No→Go to Step 4.
4. Contact ZTE for help.
Confirmation
The board is powered on and the H/S indicator is off.
Symptom Reference
All the devices or half of the devices in a shelf 2.2.3 All the Devices or Half of the Devices in a
are powered off Shelf Are Powered Off
Half of the devices in a shelf are powered off 2.2.4 Half of the Devices in a Shelf Are Powered
after a power supply module is removed Off After a PEM Is Removed
Some boards are suddenly powered off 2.2.5 Some Boards Are Suddenly Powered Off
Analysis
When a power supply module is working properly, the H/S indicator is off. If the H/S
indicator is solid on, the power supply module is not in the working state.
2-8
Handling
You are recommended to handle this fault in the following procedure.
1. Press the sunk H/S key ( ) on the panel with a sharp tool (such as a forceps), and
check whether the H/S indicator flashes and then turns off.
l Yes→End.
l No→Go to Step 2.
2. Remove and then install the power supply module, and check whether the H/S
indicator flashes and then turns off.
l Yes→End.
l No→Go to Step 3.
Note:
If the H/S indicator is always flashing, handle by referring to “2.2.2 The H/S Indicator
Is Flashing”.
Confirmation
The H/S indicator of the power supply module is off.
Analysis
When a power supply module is working, the H/S indicator is off. If the indicator is flashing,
the power supply module has not started up.
Handling
You are recommended to handle this fault in the following procedure.
1. Press the sunk H/S key ( ) on the panel with a sharp tool (such as a forceps), and
check whether the H/S indicator flashes and then turns off.
l Yes→End.
l No→Go to Step 2.
2. Open the extractors of the NCMM to do active/standby switchover and then secure the
extractors after the switchover. See whether the indicator of the power supply module
flashes and then turns off.
2-9
l Yes→End.
l No→Go to Step 3.
3. Press the sunk H/S key ( ) on the panel with a sharp tool (such as a forceps), and
check whether the H/S indicator flashes and then turns off.
l Yes→End.
l No→Go to Step 4.
4. Contact ZTE for help.
Confirmation
The H/S indicator is solid off.
2.2.3 All the Devices or Half of the Devices in a Shelf Are Powered
Off
Symptom
All the devices or half of the devices in a shelf are suddenly powered off.
Analysis
The external power supply may be failed.
Handling
You are recommended to handle this fault in the following procedure.
1. Check whether the UPS is normal.
l Yes→Go to Step 2.
l No→Check and restore the power supply.
2. Check whether any power state indicator (-48/-60 VA or -48/-60 VB) on the panel of
the power supply module is on in red.
l Yes→Go to Step 3.
l No→Go to Step 5.
3. The switch of the power supply module is not turned on.
l Yes→Go to Step 4.
l No→Go to Step 5.
4. Turn on the two switches, and check whether the fault is resolved.
l Yes→End.
l No→Go to Step 5.
5. Check the -48 V/-60 V and RTN terminals of the PEM with a multimeter and see
whether the voltages are normal and whether the polarities are correct.
l Yes→Go to Step 8.
l No→Go to Step 6.
6. According to the engineering documentation, check whether the switches or lines that
provide power to the cabinet have a problem.
l Yes→Go to Step 7.
2-10
l No→Go to Step 8.
7. Remove the problem in the switches or lines, and then check whether the fault is
resolved.
l Yes→End.
l No→Go to Step 8.
8. Contact ZTE for help.
Confirmation
The devices can be successfully powered on and operate properly.
2.2.4 Half of the Devices in a Shelf Are Powered Off After a PEM Is
Removed
Symptom
Half of the devices in a shelf are powered off after a power supply module is removed.
Analysis
The two power supply modules work in hot-backup redundancy mode. If half of the devices
in a shelf are powered off after a power supply module is removed, the wiring may be
incorrect.
Handling
You are recommended to handle this fault in the following procedure.
1. Check the engineering documentation and see whether the wiring of power supply
lines is correct.
l Yes→Go to Step 3.
l No→Go to Step 2.
2. Correct the wiring according to the engineering documentation, and check whether the
fault is resolved.
l Yes→End.
l No→Go to Step 3.
3. Contact ZTE for help.
Confirmation
The devices in the cabinet work properly after a power supply module is removed.
2-11
Analysis
This fault may be due to the following causes:
l The voltage and power of the power supply to this shelf are abnormal.
l The contacts of power supply lines are of poor quality.
Handling
You are recommended to handle this fault in the following procedure.
1. Check whether voltage and power of the power supply to this shelf are high enough.
l Yes→Go to Step 2.
l No→Correct the power supply problem.
2. Check whether the contacts of the power supply lines are in good conditions. There
should be no heating spots and no loose, especially the crimp connections in the
distribution box.
l Yes→Go to Step 4.
l No→Go to Step 3.
3. Correct the wiring by referring to the engineering documentation, and check whether
the fault is resolved.
l Yes→End.
l No→Go to Step 4.
4. Contact ZTE for help.
Confirmation
The boards in the shelf are powered on, and the devices functions properly.
Symptom Reference
The RUN indicator is flashing in green but the 2.3.3 The RUN Indicator Is Flashing in Green but
fans always run in the full speed the Fans always Run in the Full Speed
The RUN indicator is off but the fans always run 2.3.4 The RUN Indicator Is Off but the Fans
in the full speed always Run in the Full Speed
2-12
Analysis
When a fan module is working properly, the H/S indicator is off. If the H/S indicator is solid
on, the fan module is not in the working state.
Handling
You are recommended to handle this fault in the following procedure.
1. Press the sunk hot-swap key ( ) on the panel with a sharp tool (such as a forceps),
and check whether the H/S indicator flashes and then turns off.
l Yes→End.
l No→Go to Step 2.
2. Remove and then install the fan module, and check whether the H/S indicator flashes
and then turns off.
l Yes→End.
l No→Go to Step 3.
Note:
If the H/S indicator is always flashing, handle by referring to “2.3.2 The H/S Indicator
Is Flashing”.
Confirmation
The H/S indicator of the fan module is off.
Analysis
When a fan module is working, the H/S indicator is off. If the H/S indicator is flashing, the
fan module has not started up.
Handling
You are recommended to handle this fault in the following procedure.
1. Press the sunk H/S key ( ) on the panel with a sharp tool (such as a forceps), and
check whether the H/S indicator flashes and then turns off.
l Yes→End.
2-13
l No→Go to Step 2.
2. Open the extractors of the NCMM to do active/standby switchover and then secure
the extractors after the switchover. See whether the H/S indicator of the fan module
flashes and then turns off.
l Yes→End.
l No→Go to Step 3.
3. Press the sunk H/S key on the panel with a sharp tool (such as a forceps), and check
whether the H/S indicator flashes and then turns off.
l Yes→End.
l No→Go to Step 4.
4. Contact ZTE for help.
Confirmation
The H/S indicator is solid off.
2.3.3 The RUN Indicator Is Flashing in Green but the Fans always
Run in the Full Speed
Symptom
The RUN indicator on the panel of the fan module is normal (flashing in green), but the
speed of fans cannot be adjusted and the fans always run in the full speed.
Analysis
The system may have an over-temperature alarm or the parts of the fan are damaged.
Handling
You are recommended to handle this fault in the following procedure.
1. In the Fault Management tab page of the Local Maintenance Terminal, check
whether there is any over-temperature alarm.
l Yes→Go to Step 2.
l No→Go to Step 3.
2. Handle and remove the alarms by referring to the recommended measures attached
to the alarm information, and then check whether the fault is resolved.
l Yes→Go to Step 3.
l No→Go to Step 4.
3. Check whether the parts of the fan are damaged.
l Yes→Go to Step 4.
l No→Go to Step 5.
4. Replace the fan, and then check whether the fault is resolved.
l Yes→End.
l No→Go to Step 5.
5. Contact ZTE for help.
2-14
Confirmation
The fans are working properly and the rotate speed can be adjusted.
2.3.4 The RUN Indicator Is Off but the Fans always Run in the Full
Speed
Symptom
The fans always run in the full speed, but the RUN indicator is not flashing in green.
Analysis
It is caused by a failure of NCMM to control the fan module.
Handling
You are recommended to handle this fault in the following procedure.
1. Remove the fan module and plug it in again. Check whether the fault is resolved.
l Yes→End.
l No→Go to Step 2.
2. Contact ZTE for help.
Confirmation
The fans are working properly and the rotate speed can be adjusted.
2-15
2-16
Table of Contents
Communication Faults................................................................................................3-1
Authority Faults ..........................................................................................................3-4
Database Faults .........................................................................................................3-5
Performance Management Faults...............................................................................3-7
Alarm Management Faults .........................................................................................3-9
Symptom Reference
Timeout when the LMT transfers data to the OMP 3.1.1 Timeout When the LMT Transfers Data to
the OMP
IE cannot access the OMM server 3.1.2 IE Cannot Access the OMM Server
3-1
Analysis
The cable connection between the OMM server and the OMP is faulty or the link is not
created.
Handling
You are recommended to handle this fault in the following procedure.
1. On the OMM server, ping the IP address of the OMP and see whether the
communication is through.
l Yes→Go to Step 6.
l No→Go to Step 2.
2. Check whether the OMM server and the OMP are connected through network cable
of a GPI1 card.
l Yes→Go to Step 3.
l No→Go to Step 6.
3. Check whether any network cable connector is loose between the OMM server and
the OMP.
l Yes→Go to Step 4.
l No→Go to Step 5.
4. Secure the cable connection, and see whether the fault is resolved.
l Yes→Go to Step 5.
l No→Go to Step 6.
5. In the Terminal tab page of the Local Maintenance Terminal, run the SYNA:STY
PE="ALL" command to transfer the data to the OMP, and see whether the system
returns a timeout error.
l Yes→Go to Step 6.
l No→End.
6. In the Terminal tab page of the Local Maintenance Terminal, run the CHECK OMC
LINK command (for showing the links between the OMM server and the OMP) and
see whether the returned result is null (no link).
l Yes→Go to Step 7.
l No→Go to Step 8.
7. In the Terminal tab page of the Local Maintenance Terminal, run the SET OMP:RE
LINK="YES" command to recreate the link, and see whether the fault is resolved.
l Yes→End.
l No→Go to Step 8.
8. Contact ZTE for help.
Confirmation
Run the SYNA:STYPE="ALL" command to transfer the data to the OMP. The operation
should succeed.
3-2
Analysis
The possible causes of this fault may be the following:
l The network communication between the OMM server and the computer where the
IE is broken.
l The OMM server processes have not started up successfully.
l The HTTP service on the OMM server is not started.
Handling
You are recommended to handle this fault in the following procedure.
1. On the computer of the IE, ping the IP address of the OMM server and see whether
the communication is through.
l Yes→Go to Step 4.
l No→Go to Step 2.
2. Check whether the network communication between the OMM server and the
computer is normal and whether the network cable is loose.
l Yes→Go to Step 4.
l No→Go to Step 3.
3. Secure the cable connection and see whether the fault is resolved.
l Yes→Go to Step 4.
l No→Go to Step 5.
4. Access the OMM server with the IE, and check whether the OMM interface appears.
l Yes→End.
l No→Go to Step 5.
5. In the Terminal window of the OMM server, run the ps –ef|grep service to check
whether all OMM server processes have started up.
l Yes→Go to Step 7.
l No→Go to Step 6.
6. Restart the OMM server and see whether the fault is resolved.
l Yes→Go to Step 7.
l No→Go to Step 8.
7. Access the OMM server with the IE, and check whether the OMM interface appears.
l Yes→End.
l No→Go to Step 8.
8. In the Terminal window of the OMM server, run the service httpd status command to
check whether the HTTP service has started up.
l Yes→Go to Step 10.
l No→Go to Step 9.
3-3
9. In the Terminal window of the OMM server, run the service httpd start command to
start the HTTP service and see whether the fault is resolved.
l Yes→Go to Step 10.
l No→Go to Step 11.
10. Access the OMM server with the IE, and check whether the OMM interface appears.
l Yes→End.
l No→Go to Step 11.
11. Contact ZTE for help.
Confirmation
The OMM interface does not appear when Internet Explorer is used to access the OMM
server.
Analysis
The permissions may be not correctly assigned to this user, for example:
Handling
You are recommended to handle this fault in the following procedure.
1. In the Terminal tab page of the Local Maintenance Terminal, run the SHOW USER
to check whether this user is valid.
l Yes→Go to Step 3.
l No→Go to Step 2.
2. In the Terminal tab page of the Local Maintenance Terminal, run the SET USE
R command with the Valid User parameter set to Yes. Check whether the fault is
resolved.
l Yes→End.
l No→Go to Step 3.
3. Assign permissions to this user (if you are allowed to do so). In the Terminal tab page
of the Local Maintenance Terminal, run the SHOW USER ROLE to check whether
related roles are assigned to this user.
l Yes→Go to Step 7.
3-4
l No→Go to Step 4.
4. Assign permissions to this user (if you are allowed to do so). In the Terminal tab page
of the Local Maintenance Terminal, run the SHOW ROLE to check whether this role
is valid.
l Yes→Go to Step 6.
l No→Go to Step 5.
5. Assign permissions to this user (if you are allowed to do so). In the Terminal tab
page of the Local Maintenance Terminal, run the ADD ROLE to add the role. Check
whether the fault is resolved.
l Yes→End.
l No→Go to Step 6.
6. In the Terminal tab page of the Local Maintenance Terminal, run the ADD USER
ROLE to assign the related roles to the user. Check whether the fault is resolved.
l Yes→End.
l No→Go to Step 7.
7. Assign permissions to this role (if you have the permission to do so). In the Terminal
tab page of the Local Maintenance Terminal, run the SHOW ROLE CMDSET to
check related operation permissions are assigned to the role.
l Yes→Go to Step 9.
l No→Go to Step 8.
8. In the Terminal tab page of the Local Maintenance Terminal, run the ADD ROLE CM
DSET to assign operation permissions to the role. Check whether the fault is resolved.
l Yes→End.
l No→Go to Step 9.
9. Contact ZTE for help.
Confirmation
The user can operate the LMT with the assigned permissions.
Symptom Reference
3-5
Analysis
Possible causes:
l The Firebird service on the OMM server has not started up.
l The Firebird database is faulty and inaccessible.
Handling
You are recommended to handle this fault in the following procedure.
1. In the Terminal window of the OMM server, run the service firebird status command
and check whether the database service has started up.
l Yes→Go to Step 3.
l No→Go to Step 2.
2. Run the service firebird start command to start the database service, and check
whether the fault is resolved.
l Yes→End.
l No→Go to Step 3.
3. Enter the /opt/firebird/bin directory and run the following command to check
whether the database is accessible.
For how to install the Firebird database, refer to Section “Configuring System Services”
of ZXUN iCX(MSCS) MSC Server Software Installation Guide.
l Yes→End.
l No→Go to Step 5.
5. Contact ZTE for help.
Confirmation
Commands can be executed on the OMM interface.
3-6
Analysis
The export function of the Local Maintenance Terminal needs to enable ActiveX
controls option, but the default setting of the IE is disable.
Handling
1. Open the IE on the Local Maintenance Terminal.
2. Enter the address of the OMM server in the Address Bar of the IE. The prompt is
displayed, see Figure 3-1.
Note:
Just you visit the OMM server address for the first time, the dialog box is displayed.
3. Click the dialog box and select Add-on Disabled > Run Add-on shortcut menu.
4. Click the Running button to install “Zte OMM Assistant ActiveX Control Module” in the
message box.
5. After it is installed, and log in to the OMM server again, and check whether export
function is available.
l Yes→End.
l No→Go to Step 6.
6. Contact ZTE for help.
Confirmation
After a Local Maintenance Terminal logs in to the OMM server, export function is
available.
Symptom Reference
Time delay in performance data reporting 3.4.1 Time Delay in Performance Data Reporting
3-7
Analysis
The time is different between the OMP module and the OMM server.
Handling
You are recommended to handle this fault in the following procedure.
1. In the Terminal tab page of the Local Maintenance Terminal, run the SHOW TIME
command to check whether the OMP time is the same as the OMM server.
l Yes→Go to Step 3.
l No→Go to Step 2.
2. In the Terminal tab page of the Local Maintenance Terminal, run the SYNC NTPTIM
E to synchronize the OMP time with the OMM server or run the UPD TIME command
to change the OMP time. Check whether the fault is resolved.
l Yes→End.
l No→Go to Step 3.
3. Contact ZTE for help.
Confirmation
Wait for some collection granularities later after a measurement task is activated, check the
collected performance data. The collected performance data should be same as collection
granularities.
Analysis
Possible causes:
l The link between the OMP and the OMM server is abnormal.
l The time difference between the OMP and the OMM server is too big.
l The processes of the OMM server is abnormal.
3-8
Handling
You are recommended to handle this fault in the following procedure.
1. In the Terminal tab page of the Local Maintenance Terminal, run the SHOW TIME
command to check whether the OMP time is the same as the OMM server.
l Yes→Go to Step 3.
l No→Go to Step 2.
2. In the Terminal tab page of the Local Maintenance Terminal, run the SYNC NTPTIM
E to synchronize the OMP time with the OMM server or run the UPD TIME command
to change the OMP time. Check whether the fault is resolved.
l Yes→End.
l No→Go to Step 3.
3. In the Terminal tab page of the Local Maintenance Terminal, run the CHECK OMC
LINK to check whether the link between the OMP and the OMM server is created.
The returned result should not be null.
l Yes→Go to Step 5.
l No→Go to Step 4.
4. In the Terminal tab page of the Local Maintenance Terminal, run the SET OMP:RE
LINK="YES" to create a link again. Check whether the fault is resolved.
l Yes→End.
l No→Go to Step 5.
5. In the Terminal window of the OMM server, run the ps –ef|grep service to check
whether all OMM server processes have started up.
l Yes→Go to Step 7.
l No→Go to Step 6.
6. Restart the OMM server and see whether the fault is resolved.
l Yes→Go to Step 7.
l No→Go to Step 8.
7. Reboot the board and check whether this problem is removed.
l Yes→End.
l No→Go to Step 7.
8. Contact ZTE for help.
Confirmation
Create and activate a performance measurement task. After 10 data collection
granularities, performance data should be reported.
Symptom Reference
Alarm filtering rules do not work 3.5.1 Alarm Filtering Rules Do Not Work
3-9
Symptom Reference
Alarms cannot reach a well-connected alarm 3.5.2 Alarms Cannot Reach a Well-Connected Alarm
box Box
Analysis
An alarm filtering rule can filter only the alarms that occur after the rule is created and
activated.
Handling
You are recommended to handle this fault in the following procedure.
1. In the Fault Management tab page of the Local Maintenance Terminal, manually
filter the alarms and then check whether the same alarms are filtered. The same
alarms may also occur but should be filtered automatically.
l Yes→End.
l No→Go to Step 2.
2. Contact ZTE for help.
Confirmation
After an alarm filtering rule is created and activated, newly generated alarms that meet the
conditions of the rule are automatically filtered.
Analysis
An alarm box is valid for only the alarms that occur after the alarm box is configured.
3-10
Handling
You are recommended to handle this fault in the following procedure.
1. Check whether new alarms are reported to the alarm box.
l Yes→End.
l No→Go to Step 2.
2. Contact ZTE for help.
Confirmation
New alarms generated after an alarm box is configured are sent to the alarm box.
3-11
3-12
Table of Contents
Basic Call Service Faults............................................................................................4-1
SMS Faults ................................................................................................................4-6
Location Update Service Faults ................................................................................4-12
4-1
1. The calling subscriber MS1 dials the phone number of the MS2, informing MSCS1
through the RNS/BSS system.
2. Analyzing the phone number of the called subscriber MS2, the MSCS1 finds the
MS2-homed HLR and sends the routing request to the HLR.
3. HLR queries the current location information of MS2, and finds that MS2 is served by
MSCS2/VLR2. HLR sends a request for the routing information to MSCS2/VLR2.
4. MSCS2/VLR2 distributes the routing information (that is, the roaming number MSRN)
and sends the MSRN to HLR.
5. HLR sends the MSRN to the calling party MSCS1.
6. MSCS1 establishes the call with MSCS2 according to the MSRN.
7. MSCS2/VLR2 sends the paging message to the called subscriber MS2.
8. MSCS2/VLR2 receives the message that MS2 can access.
9. MSCS2 sends a request for establishing the bearer terminal to the MGW2.
10. MSCS2 sends an APM signal to MSCS1, carrying the call bearer address information.
11. MSCS1 sends the bearer address information of the call and the request for
establishing the bearer terminal to MGW1.
12. The bearer between MGW1 and MGW2 is successfully established. Meanwhile, the
call circuit between MGW and MS is successfully set up.
13. MSCS1 sends the ring back tone for the calling subscriber. If the called subscriber
hooks off at this time, the called office will sends the answer signal to the calling office.
The MSCS of each party will respectively ask MGW to connect the speech channel.
The calling subscriber and the called subscriber can communicate normally.
A basic call procedure involves such NEs as RNS/BSS, MSCS, MGW and HLR.
In CS domain, the CN-related NEs are MSCS and MGW. The MSCS is responsible for
call connection and control. The MGW is responsible for establishing and maintaining the
bearing channel, and providing various resources.
4-2
Fault Phenomena
l After successfully updating the location, the subscriber cannot originate a call
normally.
l After successfully updating the location, the subscriber cannot receive a call normally.
l There are a great number of call losses in the Failure Observation tab page.
Fault Handling
For the flow of troubleshooting the call service fault, see Figure 4-2.
4-3
3. If some signaling can be successfully traced, analyze them. If the fault is possibly
caused by the local office by analyzing the signaling, open the Failure Observation
tab page to check the failure reason in the system, and locate the internal fault.
4. If it is the interconnection fault, contact the opposite-end office to handle it corporately.
5. If the call failure happens to many subscribers, check whether the links between the
local office and important office directions are normal, and whether all SMPs in local
office are normal.
6. Contact ZTE for help.
1. Check the MM message in the ZXUN iCX(MSCS), and find that the message call has
been established with normal signaling flow.
2. Check the IAM field in the ZXUN iCX(MSCS) and find that the sent roaming number is
1585310034FF, which contains FF (two Fs). F is the flag of number effective ending.
3. The roaming number contained in the SRI ack message is 1585310034F in the ZXUN
iCX(MSCS), the F should be added by the opposite-end MSCS.
Analysis flow:
l The roaming number of the opposite end is 11 digits, but the ZXUN iCX(MSCS) is 10
digits. The opposite-end MSCS receives 1585310034F from the ZXUN iCX(MSCS).
After receiving 11 digits, the opposite-end MSCS adds F to the 12th digit. That’s why
there are two Fs in the IAM message.
l This number is sent to the ZXUN iCX(MSCS) for the number analysis. Because F
is the end flag, the ZXUN iCX(MSCS) analyzes 1585310034F during the number
analysis, but there is no matched number analysis data. Therefore, the system
considers it is a vacant number, and sends a REL to the opposite-end office, causing
the service failure.
Fault Handling
You are recommended to handle this fault in the following procedure.
4-4
1. In the Terminal tab page , run the ADD MRNPFX command to configure the roaming
number of the local end to 11 digits.
2. Test again and the dial-up test is successful. The problem is solved.
Check the GT data about this subscriber on the tandem office and local gateway office,
and find that the GT translation incorrectly points to local other HLR.
Analysis flow:
l When a subscriber homed to foreign mobile or fixed network calls to this subscriber,
the signaling should be transferred to local gateway office. The local gateway office
initiates a routing information request to HLR. Therefore, probably, there is a problem
with the local gateway office, which did not perform this subscriber’s GT analysis
pointing to HLR.
l The local MSCS end office is connected to the HLR in an associated mode, while the
local gateway office is connected to the HLR in a quasi-associated mode, which is
forwarded by the local tandem office. Therefore, the problem may be located in the
local tandem office, which did not perform the GT analysis pointing to HLR.
Fault Handling
You are recommended to handle this fault in the following procedure.
1. In the Terminal tab page of the tandem office, run the SET GT command to modify
that the GT translation points to local HLR.
2. On the Terminal tab page of the local gateway office, run the SET GT command to
modify that the GT translation points to local HLR.
3. Test again and the dial-up test is successful. The problem is solved.
4-5
Fault Handling
You are recommended to handle this fault in the following procedure.
1. Set these number to "Not trigger caller IN service" at the SSP side.
2. In the Terminal tab page of the ZXUN iCX(MSCS), run the SET TPDNAL command
to enable NTC option for these numbers.
SET TPDNAL:ENTR=1,DIGIT="110",ENOPT="NTC";
3. Test again and the dial-up test is successful. The problem is solved.
4-6
4-7
Fault Phenomena
l The MO SM always fails.
4-8
l The MT SM fails.
Fault Handling
For the flow of handling the SMS fault, see Figure 4-5.
4-9
Fault Handling
You are recommended to handle this fault in the following procedure.
1. After confirming there are errors in the SC configuration, add the office number of the
IW/GMSCS in the SC configuration.
2. Besides adding the office number of the IW/GMSCS, it is required to modify the SMP
module of the corresponding IW/GMSCS in the SC system configuration.
3. Test again and find that MS/UE can receive the short message. The problem is solved.
4-10
Fault Handling
1. On the HLR and PVMSCS side, trace the number of the incoming short message
simultaneously. The routing request message of this incoming short message can
be traced on the HLR, and the HLR has returned a routing response message.
However, the short message delivery message cannot be traced on the PVMSCS,
which indicates that the PGMSCS can normally forward the SCCP-layer message to
the HLR. The problem probably occurs when the PGMSCS forwards a message to
the PVMSCS or STP forwards a message to the PVMSCS.
2. Check the GT configuration of PGMSCS, and find that all the GT data are configured,
including that of PVMSCS, HLR, extranet WGMSCS, and SMC. Particularly, the
configured PGMSCS GT is directly connected to the PVMSCS, not transferred
through STP. Therefore, something is wrong when the PGMSCS forwards an
SCCP-layer message to the PVMSCS.
3. Check the PVMSCS-related configuration on the PGMSCS, and find that the SCCP
protocol between PGMSCS and PVMSCS is not configured in the SIO-locate-AS
configuration.
Both PGMSCS and PVMSCS adopt the full-IP networking mode, all the MAP and
SCCP messages are forwarded through STP. Before cutover, this problem did not
happen because the short message of the extranet did not use the SCCP forwarding.
After cutover, this problem happened because the short message of the extranet
4-11
needed to use the SCCP forwarding but the SCCP between PGMSCS and PVMSCS
is not configured.
Fault Handling
You are recommended to handle this fault in the following procedure.
1. In the Terminal tab page of the PGMSCS office, run the ADD SIOLOCAS command
to add SIO-locate-AS configuration between PGMSCS and PVMSCS and service
indication selected SCCP.
2. Test again and find that MS can receive the short message. The problem is solved.
Summary
When the IP networking is adopted, the inter-office signaling configuration of the adjacent
office should be selected in the SIO-locate-AS configuration as required.
4-12
In the location update flow, MGW implements the SG function, switching the signaling of
the access side to the MSCS without any processing.
Fault Phenomena
l The subscriber cannot log in the network after updating the location.
l The subscriber cannot be found in the VLR after the MS is powered on.
l The location update failure results in the call fault.
Fault Handling
You are recommended to handle this fault in the following procedure.
4-13
1. Analyzing the subscriber with fault, make sure whether it is the single subscriber fault
or multi-subscriber fault, and then open the Signaling Trace tool to trace the faulty
subscriber.
2. For the single subscriber fault, trace the subscriber signaling. If no signaling is suc-
cessfully traced, the fault possibly lies in the subscriber side. In this way, contact the
subscriber for handling it. If the signaling of the subscriber location update request is
successfully traced, but the request is refused by HLR, check whether this subscriber
has subscribed the roaming restriction data.
3. For the multi-subscriber fault, analyze the information of faulty subscribers. The
analysis method is shown as follows.
l Analyze whether the subscribers are located in the same location area or cell. If
they are, it indicates that the fault is related to the geographical position where
the subscribers are located. Ask service personnel at the wireless side to check
whether the radio-side equipment at that location area is normal. Check whether
the radio-related configuration data at the local office has been modified and is
correct.
l Check whether the subscriber IMSI is regular, whether the SMP corresponding to
the IMSI load sharing of the faulty subscriber is normal, and whether the service
fault is caused by the SMP fault.
l Check whether these subscribers are home to the same HLR. If they are, check
the mobile number analysis, GT and MTP configuration of the faulty subscribers.
Check the HLR status. Contact the HLR maintenance personnel to perform the
signaling trace, and corporately solve the problem.
l If the joint location update fails, check whether the Gs interface between the SGSN
and the MSCS/VLR is normal.
Check the VLR system data configuration. At the CN side, the period of general location
update is 45 minutes by default (this duration is: VLR location update protection duration
+ periodic location update duration), however, at the RNC side, the duration is set as 60
minutes.
Analysis flow:
4-14
l At the beginning, the duration of general location update at the CN side is 45 minutes,
less than the 60 minutes set at the RNC side. When the location update timer at the
CN side expires, but the subscriber does not perform the periodic location update and
is in inactive status, VLR marks this subscriber with “IMSI detach”, which causes this
subscriber unable to be paged.
l After the subscriber is restarted after power-off, location update is performed again,
and VLR marks the subscriber with “ATACH”. Therefore, this subscriber can be paged,
and may make a call normally.
Fault Handling
You are recommended to handle this fault in the following procedure.
1. At the CN side, set the periodic location update duration as 60 minutes, which is
specified in the VLR system parameters. So, the general location update duration
at the CN side is 75 minutes (the periodic location-update duration is 60 minutes, and
the VLR location-update protective duration is 15 minutes), larger than that at the RNC
side (60 minutes).
2. Test again and the dial-up test is successful. The problem is solved.
Summary
In the OMM system of CN, the default duration of periodic location update is 30 minutes,
and the duration of the VLR location update protection is 15 minutes. Adjust these
parameters according to actual conditions to make the duration of general location update
at the CN side larger than that at the RNC side.
The duration of periodic location update originated by UE is determined by the related
configuration of RNC. RNC data configuration must be consistent with that of CN.
4-15
2. Open the Signaling Trace tab page and perform the location update test. The
engineer finds that the MSCS prompts an RNC_id ERR message after received a
SCCP message, and then directly returns a Disconnect message. It is related with
the RNC-related data configured on the MSCS. The problem probably lies in the
inconsistency between the data configured on the MSCS and that on the RNC.
Fault Handling
You are recommended to handle this fault in the following procedure.
1. In the Terminal tab page , run the SHOW RNCOFC command to check the RNC ID
configuration on the MSCS and find that the RNC ID is set to 2, while the RNC side is
set to 1.
2. In the Terminal tab page , run the SET RNCOFC command to modify the RNC ID to
1 on the MSCS.
3. Perform the location update test. The SCCP normally processes the subscriber
message from RNC. The problem is solved.
Summary
The RNC ID configuration on the MSCS is inconsistent from that on the RNC, which causes
MSCS not to process subscriber information reported by RNC. As a result, the location
update fails.
4-16
Board-related Do not unplug online l The data synchronization between active and standby boards
operations active board. requires a certain period when the standby board is operating
properly. When the active board is unplugged online, the latest
data on it cannot be completely and automatically backed up
to the standby board, although the system will switch over
the boards automatically. It will easily cause statistical errors
and data loss.
l When the standby board operate abnormally, this operation
will interrupt all the service processing of corresponding
module, resulting in partial or global service block in the
system.
Do not press the RST l A board will be implemented the hardware reset by force in
button on the board case of the RST button on the board panel being pressed.
panel at will. Only qualified maintenance personnel can press it when
serious fault occurs in the system.
l Pressing the RST button on the active board panel will reset
the active board. The consequence is the same as that of
"Unplugging online active board".
Do not plug and unplug The electrostatic on human bodies may cause great damage to
boards without wearing the components. If you plug or unplug boards without wearing
antistatic wrist straps. antistatic wrist straps, the boards may suffer electrostatic damage
easily. As a result, boards are damaged or run unstably.
Cable-related Do not plug or unplug This operation probably will interrupt services and the
operations network cables inside communication between the front-end and back-end systems.
the cabinet at will.
Power-supply- Do not operate the power You can operate various power switches by following operation
related operations switches on the cabinet procedure only when serious fault occurs in the system, or
at will. during network upgrade, network expansion, and component
replacement. This operation will stop system running and interrupt
services.
A-1
A-2
Type of fault:
Origin of fault:
Symptom:
Handling:
Conclusion:
B-1
B-2
HTTP
- Hypertext Transfer Protocol
IMSI
- International Mobile Subscriber Identity
IP
- Internet Protocol
IPMC
- Intelligent Platform Management Controller
LAN
- Local Area Network
LMT
- Local Maintenance Terminal
MAP
- Mobile Application Part
MO
- Mobile Originated
MS
- Mobile Station
MSRN
- Mobile Subscriber Roaming Number
MT
- Mobile Terminated
MTP
- Message Transfer Part
NCDM
- New Chassis Data Module
NCMM
- New Chassis Management Module
NMS
- Network element Management System
OMM
- Operation & Maintenance Module
PEM
- Power Entry Module
RNS
- Radio Network Subsystem
RTN
- Return
SCP
- Service Control Point
SG
- Signaling Gateway
SGSN
- Serving GPRS Support Node
SIGTRAN
- Signalling Transport
SMS
- Short Message Service
SS7
- Signaling System No. 7
SSP
- Service Switching Point
TCP
- Transmission Control Protocol
UMTS
- Universal Mobile Telecommunication System
II
UPS
- Uninterruptible Power Supply
III