Professional Documents
Culture Documents
Authorized licensed use limited to: Maharaja Surajmal Institute of Technology. Downloaded on March 25, 2009 at 00:34 from IEEE Xplore. Restrictions apply.
EUROCON 2003 Ljubljana, Slovenia
For the set of distorted images, the MOS values were impairment scale method with fivc-grade impairment s c a k
obtained from an experiment involving 20 non-expert described in ITU-R BT Rec. 500 [7].
viewers. The testing methodology was the double-stimulus
TAULE I
PICTUREQUAI.ITY MEASURES
Nonnalised Cross-Correlation
Average Difference
Structural Content
Maximum Difference
When the tests span the full range of impairments (as in Viewing distance was 4.H, where H is image height
our experiment) the double-stimulus impairment scale displayed on monitor in full resolution. In addition to MOS:
method should be used. The method uses the five-grade we used PQS methodology proposed in [ 6 ] . The PQS has
impairment scale with proper description for each gradc: 5 - been developed for evaluating the pcrceived quality of
impcrceptible, 4-perceptible, but not annoying, 3-slightly compressed images. It combines various perceived
annoying, 2-annoying and I-very annoying. At the end of the distortions into a single quantitative measure. To do so, PQS
series of sessions, MOS for each test condition and test image methodology uses some of the properties of HVS relevant to
are calculated: global image impairments, such as random errors, and
emphasizes the perceptual importance of structured and
localized errors. PQS is constructed by regressions with
MOS, which is 5-level grading scale. PQS closely
where i is grade andp(i) is grade probability. approximates the MOS in the middle of the quality range [XI.
To perform subjective assessment o f picture quality we For very high quality images it is possible to obtain values of
developed an application in Visual Basic, which enables PQS larger than 5 . At the low end of the image quality scale,
equal viewing conditions for all viewers in our laboratory PQS can obtain negative values (meaningless results).
environment and oreciselv follows recommendation 171.
. ., . .. .
..
( e ) Lena
SFM= l6.l67,SAM= 126.77 SFM= 14.019,SAM=227.43
Fig. I . Tesl images
234
Authorized licensed use limited to: Maharaja Surajmal Institute of Technology. Downloaded on March 25, 2009 at 00:34 from IEEE Xplore. Restrictions apply.
EUROCON 2003 Ljubljana, Slovenia
111. COMPRESSION
TECHNIQUES IV. TESTIMAGES
To produce test images for our objective and subjective The fundamental difficulty in testing image compression
picture quality assessments we used three different image system is how to decide which test images to use for the
compression systems: JPEG [9], [IO], JPEG2000 [II],[I21 evaluations. The image content being viewed influences the
and SPIHT [13]. JPEG (Joint Photographic Experts Group) perception of quality irrespective of technical parameters of
corresponds to the ISOilEC international standard 10928-1 the system [19]. We have selected three test images
for digital compression and coding of continuous-tone (512x512, 8 bitdpixel) with different spatial and
(multilevel) still images. Image compression scheme in JPEG frequency characteristics: Baboon, Goldhill and Lena, Figure
is based on Discrete Cosine Transform (DCT) [14]. I . The spatial frequency measure (SFM) indicates Ihe overall
JPEG2000 should provide low bit-rate operation (below activity level in an image [20]. SFMis defined as follows:
0.25 bitsipixel) with subjective picture quality performance
superior to existing standards. Image compression scheme in S F M = m ,
JPEGZOOO Part I is based on discrete wavelet transform
(DWT) [ I S ] , [16]. In our experiment 152000 implementation
of JPEG2000 codec is used [ 171.
Set Partitioning in Hierarchical Trees (SPIHT) coding
-
aleorithm introduced by Said and Pearlman is a vew efficient
technique for wavelet image compression. SPIHT is
improved and extended version of Embedded Zerotree where R is IOW frequency, C iS column frequency and x,:~
Wavelet (EZW)
~, - _
codine aleorithm introduced bv J.M. Shaoiro denotes the samples of image; M a n d N are numbers of pixels
in horizontal and vertical directions.
[I81 and it is one ofthe best wavelet coder today.
(a) Baboon
(b) Goldhill
(C) Lena
Fig. 2. PSNR (in dB), PQS and MOS mulls for test images (a) Baboon, (b)Goldhill. (c) Lena.
and compression sysremr denoted as (+ - JPEG2000: 0 - SPIHT; -
JPEG)
23 5
Authorized licensed use limited to: Maharaja Surajmal Institute of Technology. Downloaded on March 25, 2009 at 00:34 from IEEE Xplore. Restrictions apply.
EUROCON 2003 Ljubljana, Slovenia
Spectral activity measure (SAM) is a measure of image where x and x' are two series between which correlation has
prcdictability and it is evaluated in frequency domain [ I 1: to be found. The possible values o f r are between - I and + I ;
the cluser r is to -1 or + I , the better the correlation is. The
last row in the Table I I contains average absolute values of
correlation coefficients for each objective measure. The
values of correlation coefficients indicate that commonly
used measures of visual quality PSNR and MSE cannot be
where Ffj,k) is fj,k)-th DFT coefficient of image. SAM has a reliably used with all techniques, because they have poor
dynamic range of [ I , a).Higher values of SAM imply higher correlation with MOS. PQS incorporates model of H V S and
predictability. Active imagcs (SAM close to 1) are in gcneral leads to the best correlation with MOS for all three
difficult to code. coinpression systems and all test images but it needs too
Test image Baboon has large SFM and small SAM. Large much time to be evaluated (approximately 15 sec per image
value of SFM means that image contain components in high in our test). Beside PQS, measures with good correlation with
f q u c n c y area and small value of SAM mcaiis low MOS are MD, LMSE, NAE and NK (scc nvcragc absolutc
predictability. For typical natural image largest value of SFM values of r in Table 11). MSE, PSNR and SC cannot be
implies smaller value of SAM. Images Goldhill and Lena are reliably used with all techniques, because they have poor
images with less detail (smaller SFM) than Baboon. correlation with MOS for some of them. The poorest
correlation has AD.
V. RESULTS To evaluate usefulness of each quality measure in tested
compression systems we found average absolute values of
Test images Baboon, Lena and Goldhill are coded using correlation coefficients for each compression system. Results
JPEG, JPEG2000 and SPIHT. For each test image and are shown in Table 111. Table 111 indicates that PQS is
compression method, nine different bit rates are selected: 0.1; excellent measure of picture quality for all compression
0.2; 0.3; 0.4; 0.5; 0.75; I ; 1.5 and 3 bits per pixel (bpp). systems. In PEG2000 and SPIHT compression systems MSE
Objective and subjective picture quality measures are
should be used instead of PSNR because of its bctter
calculated for all images. Results for PSNR. PQS and MOS correlation with MOS. For JPEG2000 and SPIHT
are presented in Figure 2. For some very low quality images comprcssion systems MD, MSE, LMSE and NAE measurcs
PQS is out o f range.
demonstrate very good results. For JPEG compression system
PQS and MOS use the same quality scale so direct good results are achieved using PSNR, MD and LMSE.
comparison between these two measures is possible. On the In some image coding application, it is not appropriate to
other hand, PSNR values depend very much on image compute PQS because o f its time expensiveness. Maximum
content. For example, PSNR of image Lena is through all difference (MD)has a good correlation with MOS for all
compression ratios for about 8-1 I dB higher than PSNR for tested compression techniques so w e propose this very simple
image Baboon. PSNR can not be used for quality comparison measure as a reference for measuring compressed picture
of different images. If we consider only PSNR values (Figure quality in different compression systems. LMSE has also
2) we can conclude that JPEGZOOO and SPlHT provides good correlation with MOS but this measure is not so simple
better picture quality than JPEG. If we take into account as MD (see equations in Table I for MD and LMSE).
visual picture quality quantified by MOS, the conclusions are
quite different. At high and moderate bitrates (above 0.75 VI. CONCLUSION
bpp) for all test images JPEG produces better visual picture
quality than wavelet-based techniques (JPEG2000 and The results of an evaluation conceming the usefulness of a
SPIHT). At low bitrates (below 0.5 bpp) JPEG picture number of objective quality measures in image compression
quality degrades below SPIHT and JPEGZOOO picture systems have been presented, In addition, picture quality is
quality, because of the artefacts introduced by block-based measured subjectively using perceived picture quality. The
DCT scheme. It is clear example that PSNR can not be used correlation between each objective measure and subjective
as definitive picture quality measure. PQS grades follow the measure is found. We demonstrated that for a given
trend of MOS'grades but MOS results show that human compression system a group of numerical objective measures
observers have more tolerance for moderately distorted could reliably be used to specify the magnitude of
images than PQS. degradation in reconstructed images. We also demonstrated
that this group of objective measures is different for different
Table II shows the correlation between the numerical
compression systems. We proved that MSE and PSNR, as
objective quality measures introduced in Table I and MOS.
traditionally used objective measures o f picture quality, are
As a measure of the extent of the linear relationship, the
not adequate as perceptually meaningful measures in all
Pearson product-moment (r) was used [ZO]. Correlation
tested compression systems. We found out that PQS is the
coefficient is defined as:
most correlated measure with MOS for all compression
techniques. In some image compression application, i t is not
236
Authorized licensed use limited to: Maharaja Surajmal Institute of Technology. Downloaded on March 25, 2009 at 00:34 from IEEE Xplore. Restrictions apply.
EUROCON 2003 Ljubljana, Slovenia
possible 10 compute PQS because of' its time expensivcness. tested compression techniques. So, we propose use o f M D for
So we considered other objective measures o f picture quality comparisoii of picture quality across different compression
for each compression technique and we found that maximum systems because of its good cnrrelation with MOS and
difference (MD) has a good correlation with MOS for all computing simplicity.
TABLEII
C o K n e i ~ T l o NC O E F F ~ C W T SFOR EACH COMPRESSIONTECllNlQUE ANn TEST IMAGE
~ -
Test
Codcc MSE PShR AD SC NK ,MD LMSE NAE PQS
Image
JPEG2000 -0.95506 0.94071 0.84244 -0.92320 0.942M -0.97625 -0.98814 -0.98912 0.99054
Baboon SPlHT -0.961 17 0.93149 ~0.50867 -0.95273 0.95803 -0.98784 -0.98947 -0.98905 0.98951
JPEC -0.89973 0.88491 0.38663 -0.94490 0.92001 -0.91574 -0.90406 -0.93453 0.97878
JPEGZOOO -0.97097 0.83227 0.69434 -0.96565 0.97000 -0.96746 -0.93256 -0.95399 0.97033
Goldhill SPlHT -0.96723 0.86626 -0.80212 -0.97618 0.97247 -0.96992 -0.96155 -0.96573 0.94765
JPEG -0.74839 0.89574 0.50969 0.42746 0,80153 -0.94067 -0.90946 -0.84936 0.97073
JPEG2000 -0.98327 0.88481 -0.71231 -0.95673 0.97326 -0.97585 -0.95612 -0.97600 0.991 I 1
Lena SPlHT -0.97636 0.88636 0.85532 -0.97525 0.97609 -0.97765 -0.95429 -0.97363 0.98234
JPEG -0.68045 0 91077 -0.43969 0.56470 0.80798 -0.85259 -0.90469 -0.78024 0.98867
Avcrage absolute
0.90 0.89 0.64 0.85 0.92 0.95 0.94 0.93 0.98
YnlUCs "fr
TABLEIII
AVERAGE AUSOLUTE VALUES OFCORRELATION COEFFICIENTS FOR EAClt COMPKlPlON SYSTEM
Codec
-MSE PSNR AD
~
SC NK MD
- LMSE NAE PQS
lPEG2WO 0.97 0.89 0.75 0.95 0.96 0.97 0.96 0.97 0.98
SPlHT 0.97 0.89 0.72 0.97 0.97 0.98 0.97 0.98 0.97
JPEG 0.78 0.9 0.45 0.65 0.84 0.9 0.91 0.85 0.98
231
Authorized licensed use limited to: Maharaja Surajmal Institute of Technology. Downloaded on March 25, 2009 at 00:34 from IEEE Xplore. Restrictions apply.