Professional Documents
Culture Documents
Abstract: The development of multimedia propagations and The clear picture is an example of a 100 minutes movie
applications has led to a greater expansion in the field of video displayed at 30 frames per second with width of frame
transmission over a heterogeneous media as well as iterative 640x480 pixels with each pixel taking 3 bytes of memory.
delivery platforms with dedicated content requirements. It is The reality shows that, for each second of the movie, the
known that conventional video coding systems encode video requirement be at least 27MB of memory; as a result, the
content with given bitrates adapted to a specific function or entire movie will need almost 162GB of memory. If this
application. As a result, conventional video coding does not meet
movie were stored on DVD’s, then considered the current
the fundamental requirements of the state-of-the-art flexible
digital media application. The newly technology based on
DVD capacity of 4.7 GB, would roughly require 35 DVD’s.
scalable video coding appears as a new modus operandi that has Therefore, video needs to be compressed considerably for
the ability to satisfy the underlying requirements. In this work, efficient storage and sharing over the web [1]
a multi-users scenario was considered for an optimum However, there are a lot of redundancies within the video
performance between multiple streams. A rate distortion data that can be eliminated yielding file size reduction or
optimized video frame dropping strategy which can be applied compression.
on active network nodes during high traffic intensity was
developed. The concept of scalability here, come to introduce the 2. H.264/AVC Scalable Video Coding
operability of high level of suppleness coding and decoding
systems. A base layer which can display the suitable quality of
the premium file was considered and take care of the
2.1 Basic H.264/AVC structure
improvement of video quality. The H.264/AVC standard has a range of coding tools
contributing to its high compression performance, flexibility
Keywords: Bitrates, PSNR, bandwidth, multi-users scenario and robustness. However, the performance improvements
and RDO. come at a cost of significantly high computational
complexity. Therefore, encoder implementations should
1. Introduction make use of the available coding tools effectively to achieve
the desired compression performance with the available
The past few decades, starting in the early nineties, a processing resources.
remarkable development has been achieved in the field of H.264/AVC is an extremely scalable video codec,
video compression. A lot of efforts were and still are being delivering excellent quality across the entire bandwidth
exerted for compressing, storing data in digital medium and spectrum, from high definition television to the video
allocation over the web. conferencing and 3G mobile multimedia. The following can
It is very crucial to have the idea of monochrome digital thusly be summarized as the important differences.
video data sequence which is a set of individual pictures • Enhanced motion prediction capability
called frames happening at predetermined time increments. • Use of a small block-size exact match transform
This frame needs to be considered as a light intensity of two
• Adaptive in-loop deblocking filter
dimensions in terms of function of variable x and y; f(x, y),
• Enhanced entropy coding methods
where x and y denote special coordinates and the value off at
any point (x, y) is proportional of the brightness of the frame
or the gray level at the point for monochrome. The normal
standard speed at which these frames are displayed is 30
frames per second.
This representation is called canonical representative.
However, this canonical representation has negative impact
because it needs very huge amounts of memory, resulting in
impracticality of being stored or shared on the web or to be
launched into digital channel. The fact may seem as an
amusing game when we try to illustrate how it could be
done.
2.2 Scalable Video Coding of a given video. Doing this, the encoder would be looking
to minimize distortion in a sequence of particular video.
Scalable video coding is desirable in heterogeneous and Rate-distortion Optimisation (RDO) methods used in video
error-prone environments for various reasons. For example, compression are discussed in [6] [2], which include dynamic
scalable coding helps streaming servers avoid congestions in programming and Lagrange optimisation methods.
network by allowing the server to reduce the bitrate of A Lagrange optimisation method, which is also known as
bitstreams whilst still transmitting a useable bitstream. Lagrange multiplier method, offer computationally less
One application for scalability is to improve error complex (although sometimes sub-optimal) solutions to the
resilience in transport systems that allow different qualities optimisation problem was proposed. Due to its less complex
of service. nature, a specific form of the Lagrange optimisation method
For example, the essential information could be delivered has been used in rate-distortion optimisation of H.264/AVC
through a channel with high error protection. Scalability can [10].
also be used to enable different quality representations
depending on playback devices processing power. 3.2 Constrained Optimisation Problem
Devices with better processing power can decode and
display the full quality version, whereas devices having The objective function within source constraints is
lower processing power decode the lower quality version. minimized or maximized by the constrained optimization.
In the case here of video coding standards, this issue of
2.3 Types of SVC constrained optimization can be considered as reducing the
There are three conventional types of scalability: temporal, amount of distortion of a given video sequence meaning to
quality and spatial. Temporal scalability enables adjustment strive looking to increase the number of bits that can be
of picture rate. encoded in exactly that particular coding sequence[4].
a) This is commonly carried out with either disposable Below is the mathematical representation of the constrained
pictures or disposable sub-sequences, which are optimization unit;
explained later on. Picture rate adjustment is then Let S represent all the allowable vectors and let B an
simply done by removing these disposable parts element of S, (BЄS). The objective function is defined for all
from the coded sequence thus lowering the frame B in S as D(B) and the constraint function R(B) is defined
rate. for all B in S . The constrained problem can be presented as:
b) In conventional quality scalability, also known as Given a constraint Rc, find
SNR scalability, an enhancement layer is achieved
with pictures having finer quantizers than the
particular picture in the lower reference layer[3]. BЄS
4. Methodology
Now, for the above subset and for any λ≥0:
4.1 Objective video quality measurement
5. Implementation
This may be written as an unconstrained problem using a For the objectives to be achieved, software video simulation
Lagrange multiplier: tool JSVM was used to implement and test the algorithms.
There are plenty and different H.264/AVC reference
software. JSVM was chosen for this research due to its
flexibility of varying parameters.
Where the solution to (4-11), M*, would satisfy, JSVM codec is commonly used to test new algorithms in
the video community. The use of this reference software
4 (IJCNS) International Journal of Computer and Network Security,
Vol. 2, No. 4, April 2010
enables realistic comparison of the performance of different slices one by one. If the link is still overloaded, the spatial
algorithms developed by different researchers. The source enhancement layers are dropped next in the same spirit, i.e.,
code is mainly the same as the one used in the C scale out the enhancement layers completely sticking only to
programming language [8]. the base layer. The optimized SVC offers better quality than
the unoptimized SVC one
6. Results analysis
Based on Lagrangian Cost Function, if a video frame is to Flower 40.3215 39.0235 37.5468 37.6256
be sent on the outgoing link, it is first placed in the output Claire 36.2597 36.4566 31.2564 32.2564
buffer. Note that, for simplicity, we don’t consider the buffer
Carphone 41.3255 38.4552 38.2545 39.2545
limitations for the simulations in here.
If the outgoing link cannot accommodate all the video
packets, it will first drop the additional enhancement PR
(IJCNS) International Journal of Computer and Network Security, 5
Vol. 2, No. 4, April 2010