Professional Documents
Culture Documents
communications
performance analyzer
Product Note
Protection-switching time
measurements:
Understanding the OmniBER analyzer’s
measurement technique and the
alternative test methods
Introduction
The OmniBER analyzer’s service
disruption measurement is designed to
accurately measure the time taken by a
transmission system to perform an
automatic protection switch when a
transmission defect is detected by the
system. The measurement supports
testing of all protection switching
architectures deployed in today’s
transmission networks. Result accuracy
and reliability is based on a simple test
principle, coupled with a specially
designed error detector that measures
the duration of error bursts associated
with a protection switch event.
Contents
Introduction ..................................... 2
Test configuration and setup ........ 3
Measurement principle ................. 3
Service disruption test results ..... 4
Understanding the test results ..... 4
Case study of protection-
switching time measurements
on a STM-64 dual-ring ................... 5
System configuration ..................... 5
OmniBER setup .............................. 5
Summary of test procedure
and results ...................................... 6
Appendix 1: Protection
switching time test methods
and associated error sources ....... 7
Protection switching time
measurements - the alternative
solution ............................................ 8
(1) Service disruption time
with simulated LOS failure
(minimize detection time) ............. 8
(2) Service disruption time with
SF trigger by excessive errors
(eliminate detection time) ..........10
Summary and conclusions ..........11
2
Test configuration and setup test results (see Understanding the Test As shown in figure 2, the duration of this
Results for details). corruption is controlled by three factors:
The measurement is performed by, first, 1. The system’s fault detection time
connecting the OmniBER analyzer as 2. The protection-switching time
previously described and verifying error- 3. The time taken by the OmniBER
free reception of the PRBS test pattern. analyzer to re-align to the pointers
Then, invoking a protection switch on a (SDH tributary only) and test pattern
working section of the transmission
system that is transporting the PRBS. Since the objective is to measure a
Figure 1: Test configuration for measuring Typical methods used to trigger this transmission system’s protection-
protection-switching time protection switch are disconnecting a switching time (specified as less than 50
fibre1, 2 to simulate a fibre-break, or ms in ITU-T standards), it is essential that
As shown in figure 1, the OmniBER removing power from a transmission the contributions made by these additional
analyzer measures protection-switching element to simulate a node-failure. As factors are minimized. For fault detection
time from the tributary-side of the will be shown in the following STM-64 time, this is achieved by triggering the
transmission system under test. This Ring case study, there is value in verifying protection switch using a failure that
point is significant for two reasons: the system’s performance when tested results in a LOS defect. Although ITU-T
separately using a simulated fibre-break G.783 (2000) defines LOS detection time as
1. The measurement is totally and a simulated node-failure. being “in the province of regional
independent of the protection standards”, it provides an example based
switching architecture and all 1 Exercise extreme caution when on a value of less than 100 µs (less than
optional settings – it therefore disconnecting an optical fibre – 0.2% of the maximum acceptable
supports all configurations follow your organization’s standard protection-switching time). In the case of
2. The performance of the system safety procedures. pointer and pattern acquisition, the required
under test cannot be affected by the 2. Refer to Appendix 1 for a discussion times are as shown in figure 2. These
test set since results are obtained on the issues associated with values represent the following percentage
through passively monitoring a PRBS generating a loss-of-signal by errors for PDH and SDH tributaries (with
for errors disconnecting a fibre. 140M & 2M mappings) when related to the
maximum acceptable switching time:
The measurement is performed by Irrespective of how the protection
inserting a PRBS test pattern into a switch is triggered, it results in the PDH: +0.1%
tributary feeding the transmission PRBS test pattern being corrupted for a SDH (140M): +0.35% to +0.85%
system under test, looping this signal at short period. SDH (2M): +1.35% to 3.85%
the associated drop-side tributary, then
monitoring the PRBS for errors. This
PRBS can be inserted either as part of a
PDH signal or as a mapped service
within a SDH signal.
Measurement principle
The protection-switching time
measurement is an out-of-service test
that is typically performed during
verification, installation and commissioning
of new transmission systems. As such,
the measurement principle assumes that
the system under test will be operating
error-free before the test is performed.
This assumption is important since the
presence of background errors can affect Figure 2: Contributors to the protection switching time as measured by OmniBER
3
When measuring a system’s protection- New developments in the OmniBER 718 Understanding the test results
switching time, the above discussion analyzer have been designed to make
shows that the total systematic error analysis of a protection-switch faster As is true for any measurement, having a
associated with the OmniBER analyzer’s and easier, including: working understanding of how the
service disruption measurement can be measured values are derived is helpful
restricted to between +0.3% to +4.05% of Timestamping: Displaying a relative when attempting to identify the root-
the maximum acceptable switching time. timestamp of the beginning of each cause of unexpected results. In the case
Consequently, it can be relied on to service disruption event. of the OmniBER analyzer’s service
accurately evaluate this important system disruption measurement this means
specification. History: A record of the first ten service understanding the rules associated with
disruption measurements. the analysis of error-burst duration.
Service disruption test results
While an Alarm Indication Signal (AIS), The OmniBER analyzer’s service
Three separate results are provided by the is not a pure protection-switching disruption test measures the elapsed
OmniBER analyzer’s existing service measurement, it is closely related. AIS is time between the first and last error in
disruption measurement: activated as a result of any physical layer an error-burst that consists of two or
failure, such as a fibre break. The more errors. The error-burst is taken as
Longest: The duration of the longest error OmniBER 718 can now provide details of having ended when no errors are detected
burst detected during the test AIS duration measurements, during a period of greater than 200 to 300
Shortest: The duration of the shortest timestamping and history. When used in ms following the last error. Single errors
error burst detected during the test conjunction with the OmniBER analyzer’s that are separated by more than 200 to
Last: The duration of the most recent service disruption measurements, it 300 ms are not considered as being part
error burst detected during the test becomes possible to show a relationship of an error-burst (no result is returned).
between alarms within a device and
These result fields are reset to 0 ms at automatic protection switches in a Figure 3 illustrates the affect these
the beginning of a protection-switching network. This means you can quickly simple rules have on measurement
time test by pressing the instrument’s and accurately debug network elements. results when different error distributions
START button. When the protection For example, using the timestamping are present in the received test pattern.
switch is triggered, the duration of the and AIS duration measurements, In case 1 and case 2, there are single
resulting error burst is measured and makes it easy to see if a device fails due errors due to a low background error rate
displayed. For the system under test to to AIS not being raised quickly enough, or in the transmission system, plus an
pass, a single1 error burst of duration less taking too long to be removed after the error-burst associated with a protection-
than 50 ms should be detected. Detection switch has taken place. switching event.
of a single error burst is indicated by an
identical value being displayed in the
three result fields.
5
Summary of test procedure and results Initially, the wide variation in the results
obtained from these tests caused
Protection Switch Transmission Direction Measured questions to be raised concerning the
Trigger Event Of Test Pattern (1) Results reliability of the OmniBER’s analyzer’s
service disruption measurement. When
Normal operation R1A1C R1A4 C R2A1 C R2A3 -
examined more closely, however, with an
Node failure :(2)
R1A1C R1A2 C R1A3 C R2A2 100 ms (4) understanding of the measurement
Ring #1, ADM 4 C R2A1 C R2A3 technique (as explained in this Product
Node failure (2): R1A1C R1A2C R1A3 C R2A2 30 ms Note), it became clear that the analyzer
Ring#2, ADM 1 C R2A3 was accurately measuring the error burst
activity associated with each test. And
Node Recovery (3): R1A1C R1A4C R1A3C R2A2 174 & 18 ms (5)
in doing so, the OmniBER analyzer’s
Ring #1, ADM 4 C R2A3 then 34 ms (6)
service disruption measurement provided
Node Recovery (3): R1A1C R1A4C R2A1C R2A3 14 & 11 ms (5) the evaluation team with a detailed
Ring #2, ADM 1 then 34 ms (6) knowledge of the actual protection-
Fibre-disconnect: R1A1C R1A2C R1A3C R1A4 30 ms switching performance supported by the
Between ADM 1 & C R2A1C R2A3 system under test.
4 on Ring#1
Fibre-reconnect: R1A1C R1A4C R2A1C R2A3 30 ms
Between ADM 1 &
4 on Ring#1
Notes:
1. Indicates the direction of transmission of the test pattern after completion of the
protection switch [Rn = Ring number (1 or 2); An = ADM number (1 to 4 on R1;
1 to 3 on R2)]. Go and return paths for the test pattern are the same
(i.e. bi-directional protection switching is used)
2. Caused by removing power from the selected ADM
3. Power restored to failed node
4. Customer accepted this as being acceptable due to need for switching events to
occur on both rings in order to restore error-free transmission
5. These results were measured approximately 90 seconds after power was restored
to the ADM – before the wait-to-restore period was complete. On further
examination using OmniBER’s graphical measurement results, it was found these
unexpected results were caused by an AU-AIS alarm being output for a short time
period at the tributary connected to OmniBER’s receiver. The only possible
explanation for this is that the ring’s operation is being corrupted during the
process of the failed node re-establish itself as an active node on the ring.
6. This result was measured 5 minutes after power was restored to the ADM – it is
therefore the actual protecting-switching result for this test.
6
Appendix 1:
Protection switching time test methods and associated error sources
Protection switching time This example shows the state of the l The head-end finishes by
measurement – the problem nodes after a switch has taken place. performing a switch
The typical sequence of events is: operation if necessary.
Many of today’s high-reliability
transmission networks are founded on l The tail-end node detects the Following a failure, full service is not
SDH/SONET technology, which provides failure and signals the head- restored until all the bridge and switch
built-in fault restoration known as end to request a protection operations are completed. A key design
Automatic Protection Switching (APS). switch. goal for Network Equipment
This general term covers a range of l The head-end performs a Manufacturers (NEMs) is to keep this
different protection schemes designed bridge or bridge and switch service disruption a short as possible, as
for use in Linear and Ring network operation, and sends back an their customers (Network Operators)
topologies, and includes linear Multiplex acknowledgement. will demand that all system deployed in
Section Protection (MSP), Multiplex l The tail-end receives the the network meet or exceed the
Section Protected Rings (MSPRING) and acknowledgement and specification published by the governing
Path Protection. However, regardless of the performs a bridge and switch standards body (Telcordia or ITU-T). This
network topology and specific protection operation, then finishes by appendix deals with the challenge of
scheme, the basic principles behind the sending a status message to making meaningful and repeatable
OmniBER analyzer’s service disruption the head-end. measurements of protection switch time.
measurement, and its application in
verifying a transmission system’s Figure A1
protection switching time, remain valid.
Figure A2 shows the two main 1. Signal Fail (SF): usually loss of 1 x 10-5 to 1 x 10-9. Note that, at the
components of a service disruption signal, loss of framing, or a very high multiplex section level, ITU-T G.806
following any kind of failure: error ratio such as 1 x 10-3 or greater. (October 2000 draft) specifies the
‘detection time’ for these error rates
The first part is the time taken to detect 2. Signal Degrade (SD): a persistent as 1 second for 1 x 10-5 to 10,000
a failure. Protection switching can be background error rate that exceeds a seconds for 1 x 10-9.
initiated by either of two events: provisioned threshold in the range
The second part is the time taken for
the actual switching process to
complete as described above. This is
normally dominated by the protocol
processing time at each node on the
Protection Circuit.
Figure A2
7
Although users of protected services are you obtain a reliable measure of a l Insert a PRBS test pattern at a
more interested in the total service system’s protection switching time’. tributary port feeding the protected
disruption time, the ITU-T standards transmission system
provide separate specifications for Protection switching time l Perform a loopback on this test
protection switch time and the ‘detection measurements - the alternative pattern at the associated drop-side
times’ associated with the various SF solutions tributary port
and SD conditions. While this division of l Induce a failure on the ‘working’ line
service disruption time into its Two different approaches can be used system that affects the test pattern
component part may seem unhelpful, it is to obtain measurement results that are l Measure the duration of the
necessary due to the wide variation in (or should be) closely related to a resulting error-burst in the PRBS
detection time associated with the system’s protection switching time, test pattern at the tributary port.
different SF/SD conditions. These range namely, measure the service disruption
from approximately 100 µs for a LOS time associated with a SF/SD Although ITU-T G.783 (2000) defines
failure to 10,000 seconds for a signal condition that either: LOS detection time as being “in the
degrade that has a provisioned threshold province of regional standards”, it
of 1 x 10-9 error rate. Another good 1. Minimizes the ‘detection time’ provides an example based on a value
reason for treating detection time (create a LOS failure – typically of 100 µs or less. This is less than the
separately is that the nature of some detected in less than 100 µs), or detection times specified for all other
fault conditions can be very 2. Eliminates the ‘detection time’ SF conditions. Consequently, if it is
unpredictable. For example, when a fibre (generate control parity errors* on assumed that the system under test
is damaged during construction work it the entity being protected) detects LOS within this ‘example’
may not break cleanly. Instead, the * B2 errors for multiplex section protected time, then inducing an ‘instantaneous’
optical signal may fade over several tens systems; HP-B3 errors for High-order Path LOS failure and measuring the
of milliseconds or vary erratically before protected systems; resulting service disruption time will
finally disappearing. So the ITU-T LP-B3/BIP-2 errors for Low-order Path
protected systems
provide an accurate estimate of the
standards require that, once SF/SD is system’s protection switching time. In
detected, a protection switch event must This document now provides a theory, this method will yield a result
be completed in 50 milliseconds or less. discussion on the merits of each of that over-estimates the protection
This is a tough requirement, but if it is these approaches. switching time by a maximum of
met, end-users will not normally notice a 100 µs, since the measured service
protection switch event even allowing for (1)Service disruption time with disruption time will include both the
a realistic SF/SD detection time. simulated LOS failure (minimize LOS detection time and the protection
detection time) switching time.
Now, having looked at the general
processes associated with a Protection As discussed earlier in this product
Switch event, it is time to address the note, the principles behind the service
central question: How can the disruption measurement are simple:
potection switch time of a transmission
system be measured to ensure that it
meets the ITU-T specification (equal to
or less than 50 millisecond)? Figure A3
10
For standards compliant network
elements, this method will yield
accurate and repeatable results when
measuring protection switching time.
Its main advantage over the ‘LOS
methods’ discussed earlier is that it
eliminates the ‘SF detection time’ error
from the measured result. The only
technical drawback is that its results
slightly under-estimate a system’s
protection switching time – but only by
up to 250 µs (assuming that the tail-end
Figure A7 node inserts the downstream AIS within
the 250 µs period specified in ITU-T
Within 10 ms of injecting the B2 errors, alarm in all down-stream traffic G.783). Possibly the most serious
the tail-end node (the NE receiving the channels within 250 µs of declaring SF. ‘drawback’ associated with this
B2 errors) will detect the excessive And since this AIS will overwrite the measurement method is a commercial
error condition. This causes the NE to PRBS test pattern that is transmitted one – it requires two transmission test
declare a SF and to initiate a protection and monitored by test set#2, it causes sets (one covering the required
switch sequence. In addition, the tail- the service disruption measurement to tributary rates, the other covering
end node is required to insert an AIS be triggered (started). required line rates).
Online assistance:
Agilent Technologies aims to maximize the value
www.agilent.com/find/assist
you receive, while minimizing your risk and
problems. We strive to ensure that you get the test
and measurement capabilities you paid for and Phone or Fax
obtain the support you need. Our extensive support United States:
resources and services can help you choose the (tel) 1 800 452 4844
right Agilent products for your applications and
apply them successfully. Every instrument and Canada:
(tel) 1 877 894 4414
system we sell has a global warranty. Support is
(fax) (905) 282 6495
available for at least five years beyond the
production life of the product. Two concepts China:
underlie Agilent’s overall support policy: “Our (tel) 800 810 0189
Promise” and “Your Advantage.” (fax) 1 0800 650 0121
Your Advantage means that Agilent offers a wide Other Asia Pacific Countries:
(tel) (65) 375 8100
range of additional expert test and measurement
(fax) (65) 836 0252
services, which you can purchase according to your Email: tm_asia@agilent.com
unique technical and business needs. Solve
problems efficiently and gain a competitive edge by Product specifications and descriptions
in this document subject to change
contracting with us for calibration, extra-cost without notice.
upgrades, out-of-warranty repairs, and on-site
education and training, as well as design, system © Agilent Technologies UK LTD. 2002
integration, project management, and other
professional engineering services. Experienced Printed in USA, 8 March, 2002
Agilent engineers and technicians worldwide can 5988-5122EN
help you maximize your productivity, optimize the
return on investment of your Agilent instruments
and systems, and obtain dependable measurement
accuracy for the life of those products.