You are on page 1of 32

Security Level:

2013/10/10
www.huawei.com
HUAWEI TECHNOLOGIES CO., LTD. Huawei Confidential
Introduction to the Fault
Management Assistant (FMA)
HUAWEI TECHNOLOGIES CO., LTD. Huawei Confidential
Page 2

Content
Background
Key Function
Scenario
Detailed Function
Accident Recovery SOP
Download
HUAWEI TECHNOLOGIES CO., LTD. Huawei Confidential
Page 3

Background
Longer Restored
Time of Accident
Inefficient
Location
Various Tools
1. Times of accident (include
non-quality) for 2010
reaches about 238, and
pressure of production line
is very higher
2. Restored time of accident
is too longer, and average
value is 136 minutes by
statistic in last year. Now,
in Canada, the accident
recovery SOP has been
deployed(use FMA tool),
and time decreases as 40
minutes.
3. Quickly restored accident
becomes a key task for
production
1. Lacked method of location,
and inefficient for
maintenance
2. Various type of log. The
size of log is larger for
download by hand and
time is longer
3. Generous information. It is
inefficient to gather
information of accident

OMSTARInsightSharpNICUMATPRESTAR
HUAWEI TECHNOLOGIES CO., LTD. Huawei Confidential
Page 4

Function
Alarm and
Operation
log Analysis
DashBoard
Real-time
UKPI
Monitor
CHR&PCHR
Fault
Diagnosis
Performance
Browsing
&Compariso
n
MML
Comparison
and Feature
Scan
FMA
Accident Log Collect
Quickly
Effectual relationship
Quickly Location
Experience integration
Uniform maintenance
plane
Convenience
HUAWEI TECHNOLOGIES CO., LTD. Huawei Confidential
Page 5

Scenario
1MML Scripts
2Alarm Log
3Performance
4Operation
5CHR&PCHR

Fault Diagnosis
DashBoard
Performance Analysis
Alarm Analysis
1. Accident Log
Collection
2. Performance
Comparison Online
3. Real-time UKPI
Monitor

Diagnosis Report
1.Phenomena
2.Result
3.Workaround
4.Information
MML Comparison
Feature Scan
PC
FMA
Tool
CHR&PCHR Analysis
Operation Analysis
Fault Assistant
Analysis
Function
Online
Function
HUAWEI TECHNOLOGIES CO., LTD. Huawei Confidential
Page 6

FMA

Scenario
Accident happens in commercial
network
1. Accident log collection
2. Fault diagnosis and workaround
3. Dashboard, associate performance, alarm
and operation log
4. Performance browsing and comparison
online, recognize fault point quickly
5. Alarm and operation log analysis
6. CHR&PCHR analysis
Degraded KPI of network
1. CHR&PCHR analysis
2. Performance browsing, and TOPN Cell
3. MML Comparison
Safeguard for holiday or cell
Real-Time UKPI monitor
Which feature is opened?
Feature & License Scan in MML scripts
What is network?
MML parsing and exported key information
Fault Analysis and Location
MML parsing, alarm, performance, operation log,
CHR log Analysis, MML comparison

Scenario
HUAWEI TECHNOLOGIES CO., LTD. Huawei Confidential
Page 7

Detailed Function Introduction (1)
Functio
n
Detailed Information Remark Level
Accident
Log
Collectio
n
1. Accident log collection, and
divide for two batches. The size of
the first batch is less than 5M, and
30M for second batch,
2. Provide to collect transmission
log
3. Provide to collect SOP log
1. The function has been
deployed with accident
recovery SOP for global
operators (about 53 operators
has been used)
2. During the accident, it takes
about 10 minutes to feedback
the accident log

1. Collect expediently, and do not
worry about missing log
2. The size of collection log is
small, and easy to deliver to HQ
by Email
Collection data
HUAWEI TECHNOLOGIES CO., LTD. Huawei Confidential
Page 8

Detailed Function Introduction (2)
Functio
n
Detailed Information Remark Level
Fault
Diagnosi
s
1. Associate the alarm, performance and operation log
by MML script
2. Provide the key KPI and alarm statistic of different
SPU and INT board
3. Provide the visual plane of MML script, and
relationship information of cell, link and neighboring
cell
4. Based on the various original rules, and draws a
workaround
1. Cover about 40~50%
Scenario of accident
2. The function has
been deployed with
accident recovery
SOP for global
operators (about 53
operators has been
used)


The FMA has been deployed in
accident SOP of Canada
Run the FMA to check whether result is right or not
when accident occurs
1. Get conclusion
quickly
2. Classification
3. Impaction
clearly
HUAWEI TECHNOLOGIES CO., LTD. Huawei Confidential
Page 9

Detailed Function Introduction (2)
MML Script Parsing and Visual Display
Visual display for Plane
Extract MML Scripts
Zoom Figure of Subrack
It is not painful to extract MML scripts
of Node and information link now!
HUAWEI TECHNOLOGIES CO., LTD. Huawei Confidential
Page 10

Detailed Function Introduction (3)
Function Detailed Information Remark Level
DashBoar
d
1. Relationship display for performance,
alarm and operation
2. The counter can be queried and drawn as
curve and recognize the impaction of KPI
3. Frequency of alarm statistic and chart
It can be used for
analysis of accident,
and recognize the
impaction of alarm
and operation log for
accident.

1. This function is edge tool to analyze
the accident log. It is convenience to
browse the performance (KPI) ,
alarm and operator log, and
relationship with them.
2. If the SR of RRC or RAB is
deteriorated, this function can check
the alarms and operator log during
the worsen period or KPI.
HUAWEI TECHNOLOGIES CO., LTD. Huawei Confidential
Page 11

Detailed Function Introduction (4)
Functio
n
Detailed Information Remark Level
Alarm
Analysis
1. Alarm log parses and display quickly
2. Filter, classification and highlight
3. Relationship between alarm and MML scrip, and
provide the SPU subsystem, port and Node
information for each fault alarm
4. Statistic for alarm, provide the proportion of fault
alarm for SPU subsystem or port, and frequency of
alarm to analyze the accident log
It has been used
widely in
maintenance, test
and other
department

1. Relationship between alarm
and MML scrip is a light
spot
2. Recognize the issue quickly,
and check whether the issue
in happened on SPU or
interface board

Frequency of Alarm
HUAWEI TECHNOLOGIES CO., LTD. Huawei Confidential
Page 12

Detailed Function Introduction (5)
Functio
n
Detailed Information Remark Level
Operatio
n Log
Analysis
1. Normal and BAK operation log
browsing
2. Filter
3. Priority of command (Critical/Normal)
4. Backup operator log to browse
It has been used widely
in maintenance
department


The traditional accident is caused by the
wrong MML command easily, and
whether this issue is caused by
command or not?
Use the function and browse or filter the
commands quickly.
The backup operation log for several months ago can be analyzed by FMA
HUAWEI TECHNOLOGIES CO., LTD. Huawei Confidential
Page 13

Detailed Function Introduction (6)
Functio
n
Detailed Information Remark Level
Real-Time
UKPI
Monitor
1. Connect with OMU online, and get UKPI
file and User number information
2. Chart to display the UKPI and user
number information, convenience to monitor
3. Cluster Cell to monitor hotspot cell
It has been used widely
in safeguard for the
South Africa World Cup,
Asia Sport Game in
Guangzhou, Hajj of
Saudi Arabia

The FMA can help you to
monitor performance of
system during the holiday
HUAWEI TECHNOLOGIES CO., LTD. Huawei Confidential
Page 14

Detailed Function Introduction (7)
Function Detailed Information Remark Level
MML
Compariso
n
1. Comparison for two scripts of different
RNC or different version of RNC
2. Color to denote the results
3. Filter and extract
It has been used
widely in maintenance,
performance and other
department

The result of comparison with two types
1. Comparison for two
scripts of different/same
RNC or different version
of RNC(V2 and V9), and
display the difference
2. The function can be used
for degraded KPI caused
by wrong parameter
HUAWEI TECHNOLOGIES CO., LTD. Huawei Confidential
Page 15

Detailed Function Introduction (8)
Function Detailed Information Remark Level
NodeB
XML2MML
Convert the NodeB XML configuration
file to MML commands. The user needs to
browse the XML file to confirm the
configuration by CME tool.

It has been used
widely in maintenance,
performance and other
department

XML configuration MML Commands
HUAWEI TECHNOLOGIES CO., LTD. Huawei Confidential
Page 16

Detailed Function Introduction (9)
Functio
n
Detailed Information Remark Level
Performa
nce
Analysis
1. Support browsing quickly for about
maximal 200 files, and take about 3
minutes
2. Normal KPI browsing, query, and
chart to display
3. TOPN cell analysis, including access,
drop call
4. Provide KPI analysis for cluster cell,
and counter query
5. Health check, and provide about 300
rules
6. Defined counter, support expression
and logical operation
7. Voice model
1. It has been used
widely in
maintenance, test and
other department
2. The efficiency of
analysis for about one
week is more higher
than other tool, such
as OMSTAR,
NASTAR

HUAWEI TECHNOLOGIES CO., LTD. Huawei Confidential
Page 17

Detailed Function Introduction (9)
Performance Analysis(to be)
1. FMA can analyze for about 200 performance file(1~2M zip) on normal PC with 2G
memory, and it takes about 1.2s to parse one file averagely.
2. Much experience has been integrated into FMA, and user can analyze TOPN cell,
heath check and voice model expediently.
HUAWEI TECHNOLOGIES CO., LTD. Huawei Confidential
Page 18

Detailed Function Introduction (9)
Performance Analysis(to be)
1. TOPN Analysis
HUAWEI TECHNOLOGIES CO., LTD. Huawei Confidential
Page 19

Detailed Function Introduction (10)
Function Detailed Information Remark Level
Performance
Comparison
1. Different period of performance to compare
for same RNC and same or different version of
RNC
2. Draw the chart quickly for normal KPI
3. Collection performance files online
The function has
been deployed in
Canada

HUAWEI TECHNOLOGIES CO., LTD. Huawei Confidential
Page 20

Detailed Function Introduction (11)
Function Detailed Information Remark Level
CHR&PCH
R Analysis
1. CHR&PCHR browsing quickly
2. Classification of fault for CHR and PCHR
3. Filter, filter by column value or filter by
condition
4. Statistic for point code
5. Statistic for parameter
It has been used
widely in
maintenance
department

Analyze and browse CHR&PCHR
log expediently and quickly, and
easy to locate KPI issue
1. About 0~1s to parse one
CHR log file
2. About 2~3s to parse one
PCHR log file
3. About 0~1s to filter one
CHR/PCHR log file
HUAWEI TECHNOLOGIES CO., LTD. Huawei Confidential
Page 21

Detailed Function Introduction (11)
CHR&PCHR Analysis (to be)
The Fault Classification based on the CHR or PCHR log, and
analyze the KPI issue quickly by the function
HUAWEI TECHNOLOGIES CO., LTD. Huawei Confidential
Page 22

Detailed Function Introduction (11)
CHR&PCHR Analysis (to be)
The chart is shown as the trend of statistic for RRC attempts times. The
FMA can provide the other statistic, such as RAB attempts /Succ times,
or given condition
HUAWEI TECHNOLOGIES CO., LTD. Huawei Confidential
Page 23

Detailed Function Introduction (11)
CHR&PCHR Analysis (to be)

The chart is shown as the trend of CPU with second period
HUAWEI TECHNOLOGIES CO., LTD. Huawei Confidential
Page 24

Detailed Function Introduction (11)
CHR&PCHR Analysis (to be)

The statistic of given parameter
HUAWEI TECHNOLOGIES CO., LTD. Huawei Confidential
Page 25

Detailed Function Introduction (11)

Functio
n
Detailed Information Remark Level
Feature &
License
Scan
1. Feature scans for MML scripts and License
feature scans for License file
2. Feature compares for MML scripts or
between MML script and License file
3. Rule of feature is defined by user in excel file
It has been firstly
used in test
department


Result of License Scan
It is quick to known which feature is open for some operator?
HUAWEI TECHNOLOGIES CO., LTD. Huawei Confidential
Page 26

MML&License Feature Scan
Feature
definition
Scan
Result in
Excel
Dialog of feature scan
Result of License
1. MML Feature Scan
2. License File Scan
3. Feature Comparison
between MML and
License File
It is quick to known which feature is open for
some operator?
HUAWEI TECHNOLOGIES CO., LTD. Huawei Confidential
Page 27

Detailed Function Introduction (12)
Feature &
License
Scan
1. Feature scans for MML scripts and License
feature scans for License file
2. Feature compares for MML scripts or
between MML script and License file
3. Rule of feature is defined by user in excel file
It has been firstly
used in test
department


Functio
n
Detailed Information Remark Level
Node B
main
board log
parse
1. Included of run log, alarm log, call log, cell
log, operation log .etc parse function.
2. The configure file figure shown.
3. DRD configure compared between RNC
script and Node B script.
4. Transmission configure compared between
RNC script and Node B script.
It has been used in
department.
Provide fast parse
function and
RNC/Node B script
compare function.




It is quick and convenient to known the site configuration.
HUAWEI TECHNOLOGIES CO., LTD. Huawei Confidential
Page 28

Detailed Function Introduction (12)
It is quick and convenient to known where is the problem of
DRD configuration.
It is quick and convenient to known where is the problem of
transmission configuration.
HUAWEI TECHNOLOGIES CO., LTD. Huawei Confidential
Page 29

Accident Recovery SOP
The accident recovery SOP is guide to prevent
or recover the accident quickly for the
front, and provides technology and support.
1)Precaution and preparative operation for
accident
2) The network and guide for accident
collection tool
3) The FMA guide book
4) The emergency solution for accident, and
recover the accident by the guide
Benefit
1) The time of collection of accident is
saved by using the tool
2) The efficiency of analysis for accident
is improved. FMA can display the key
information of accident and provide the
result of diagnose quickly.
3) The front can recover the accident based
on the emergency solution
4)The average recovery time of accident of
UMTS is decreased as 50% with last year
Application :
The FMA tools has been deployed for about 53
operators. The following table is sample for
Canada, and the recovery time of accident are
listed.




Op Time Phenomena Recover
time(min)
Canada
Sasktel
2011-8-4 It is different to connect
user
15
Canada
Dry Run
2011-7-6 CS RAB SR is decreased as
80%
30
Canada
Dry Run
2011-6-3 PS RAB SR is worse quickly 50
Canada
Telus
2011-4-27 RRC SR is decreased as 90%
90%
37
Canada
SASKTEL
2011-4-6 RRC SR is decrease as 50% 90
Canada
Bell
2011-2-17 The traffic with 72 NodeB
interrupted under one RNC
3811
10
Canada
Bell
2011-1-28 The PS traffic for 6 RNC
impacted
24
Canada
Bell
2011-1-27 The traffic with 789 NodeB
are interrupted
62
Canada
Telus
2011-1-6 The SPU Boards in Subrack
1,2,3 are reset
50
HUAWEI TECHNOLOGIES CO., LTD. Huawei Confidential
Page 30

Download
The download time of FMA tool in Support network, and the comparison
with the other tools
Up to 2011-8-30the download time of FMA has reached more than
1000 times. The tool has been widely to use by Maintenance, R&D, Test,
NTS, GTAC and the front engineer.
HUAWEI TECHNOLOGIES CO., LTD. Huawei Confidential
Page 31

Download
1. UMTS Accident Log Collection Tool
2. UMTS FMAanalysis tool
Thank you
www.huawei.com

You might also like