You are on page 1of 5

EUROCON 2003 Ljubljana, Slovenia

Picture Quality Measures in


Image Compression Systems
Marta Mrak, Sonja Grgic, and Mislav Grgic
University of Zagreb, Faculty of Electrical Engineering a n d Computing
Unska 3 / XII, HR-10000 Zagreb
CROATIA
E-mail: marta.mrak@fer.hr

The evaluation of lossless image compression techniques


Ahslroet----l major problem in evaluating picture quality in
is a simple task where compression ratio and cxecutioii lime
image compression systems is the extreme difficulty in
describing the type and amount of degradation in reconstructed are employed as standard criteria. The picturc qiinlity bcforc
image. Because of the inherent drawbacks associated with the and after compression is unchanged. Contrary, the evaluation
subjective measures of picture quality there has been a great of lossy techniques is difficult task because of inherent
deal of interest in developing an objective measure that can be drawbacks associated with both objective and subjective
used as B substitute. The aim of this paper is to examine a set of measures of picture quality. Objective measures of picture
objective picture quality measures for application in still image quality do not correlate well with subjective quality measures
compression systems and to highlight the correlation of these [3], [4]. Subjective assessment of picture quality is time
measures with subjective picture quality measures. Picture consuming process and results of measurements should be
quality is measured using nine different objective picture processed very carefully.
quaiity measures and subjectively using Mean Opinion Score
(MOS) as measure of perceived picture quality. The correlation In this paper we attempt lo evaluate and compare objective
bctwecn each objective measure and MOS is found. The cffccts and subjective picture quality measures. As lest images we
of different image compression algorithms, image contents and used images with different spatial and frequency
compression ratios are assessed and the best Objective measures characteristics. Images are coded using three different
are proposed. Our results show that some objectivc measures
compression algorithms. The paper is structured as follows.
correlate well with the perceived picture quality for B given
In section 2 we define picture quality measures. In section 3
compression algorithm but they are not reliable for an
evaluation across different algorithms. So, we compared we briefly present image compression systems used in our
objective picture quality measures B C ~ D S Sdifferent algorithms experiment. In Section 4 we evaluate statistical and
and we found measures, which sewe well in all tested image frequency properties o f test images. Section 5 contains
compression systems. numerical results of picture quality measures. In this section
we analyse correlation of objective measures with subjective
Index Terms--Correlation, JPEG, JPEG2000, Objective grades and we propose objective measures which should be
Assessment, Picture Quality Measures, SPIHT. used in relation to each image compression system and
1. INTRODUCTION objective measures which are suitable for the comparison of
picture quality between different compression systems.
W, 1TH the increasing use of multimedia technologies,
image compression requires higher performance. To
address needs and requirements of multimedia and
II. PICTUREQUALtTY MEASURES
Among many objective numerical measures of picture
Internet applications, many efficient image compression quality, that are based on computable distortion measures, we
techniques, with considerably different features, have have chosen those listed in'Table 1.
recently been developed. Image compression techniques In our analysis digital image is represented as MxN matrix,
exploit a common characteristic of most images that the
where M denotes number of columns and N number of rows.
neighbouring picture elements (pixels, pels) are highly
and x ; , ~denote pixel values o f original image before
correlated [I]. It means that a typical still image contains a
compression and degraded image after compression.
large amount of spatial redundancy in plain areas where
adjacent pixels have almost the same values. In addition, still Mean squared error (MSE) and Peak Signal to Noise Ratio
image can contain subjective redundancy, which is (PSNR) are the most common measurcs of picture quality,
determined by properties of human visual system (HVS). despite the fact that they are not adequate as perceptually
HVS presents some tolerance to distortion depending upon meaningful measures [ 5 ] . In addition to objective measures
the image content and viewing conditions. Consequently, listed in Table I, we chose to use perception based objective
pixels must not always be reproduced exactly as originated evaluation, quantified by Picture Quality Scale (PQS)[6] and
and HVS will not detect the difference behveen original a perception based subjective evaluation, quantified by Mean
image and reproduced image [2]. Opinion Score (MOS) [7].

0-7803-7763-W03/$17.0002003 IEEE 233

Authorized licensed use limited to: Maharaja Surajmal Institute of Technology. Downloaded on March 25, 2009 at 00:34 from IEEE Xplore. Restrictions apply.
EUROCON 2003 Ljubljana, Slovenia

For the set of distorted images, the MOS values were impairment scale method with fivc-grade impairment s c a k
obtained from an experiment involving 20 non-expert described in ITU-R BT Rec. 500 [7].
viewers. The testing methodology was the double-stimulus
TAULE I
PICTUREQUAI.ITY MEASURES

Mean Square Error

Peak Signal to Noise Ratio

Nonnalised Cross-Correlation

Average Difference

Structural Content

Maximum Difference

Laplacian Mean Square Error

Normalised Absolute Error

Picture Quality Scale

When the tests span the full range of impairments (as in Viewing distance was 4.H, where H is image height
our experiment) the double-stimulus impairment scale displayed on monitor in full resolution. In addition to MOS:
method should be used. The method uses the five-grade we used PQS methodology proposed in [ 6 ] . The PQS has
impairment scale with proper description for each gradc: 5 - been developed for evaluating the pcrceived quality of
impcrceptible, 4-perceptible, but not annoying, 3-slightly compressed images. It combines various perceived
annoying, 2-annoying and I-very annoying. At the end of the distortions into a single quantitative measure. To do so, PQS
series of sessions, MOS for each test condition and test image methodology uses some of the properties of HVS relevant to
are calculated: global image impairments, such as random errors, and
emphasizes the perceptual importance of structured and
localized errors. PQS is constructed by regressions with
MOS, which is 5-level grading scale. PQS closely
where i is grade andp(i) is grade probability. approximates the MOS in the middle of the quality range [XI.
To perform subjective assessment o f picture quality we For very high quality images it is possible to obtain values of
developed an application in Visual Basic, which enables PQS larger than 5 . At the low end of the image quality scale,
equal viewing conditions for all viewers in our laboratory PQS can obtain negative values (meaningless results).
environment and oreciselv follows recommendation 171.
. ., . .. .
..

( e ) Lena
SFM= l6.l67,SAM= 126.77 SFM= 14.019,SAM=227.43
Fig. I . Tesl images

234

Authorized licensed use limited to: Maharaja Surajmal Institute of Technology. Downloaded on March 25, 2009 at 00:34 from IEEE Xplore. Restrictions apply.
EUROCON 2003 Ljubljana, Slovenia

111. COMPRESSION
TECHNIQUES IV. TESTIMAGES
To produce test images for our objective and subjective The fundamental difficulty in testing image compression
picture quality assessments we used three different image system is how to decide which test images to use for the
compression systems: JPEG [9], [IO], JPEG2000 [II],[I21 evaluations. The image content being viewed influences the
and SPIHT [13]. JPEG (Joint Photographic Experts Group) perception of quality irrespective of technical parameters of
corresponds to the ISOilEC international standard 10928-1 the system [19]. We have selected three test images
for digital compression and coding of continuous-tone (512x512, 8 bitdpixel) with different spatial and
(multilevel) still images. Image compression scheme in JPEG frequency characteristics: Baboon, Goldhill and Lena, Figure
is based on Discrete Cosine Transform (DCT) [14]. I . The spatial frequency measure (SFM) indicates Ihe overall
JPEG2000 should provide low bit-rate operation (below activity level in an image [20]. SFMis defined as follows:
0.25 bitsipixel) with subjective picture quality performance
superior to existing standards. Image compression scheme in S F M = m ,
JPEGZOOO Part I is based on discrete wavelet transform
(DWT) [ I S ] , [16]. In our experiment 152000 implementation
of JPEG2000 codec is used [ 171.
Set Partitioning in Hierarchical Trees (SPIHT) coding
-
aleorithm introduced by Said and Pearlman is a vew efficient
technique for wavelet image compression. SPIHT is
improved and extended version of Embedded Zerotree where R is IOW frequency, C iS column frequency and x,:~
Wavelet (EZW)
~, - _
codine aleorithm introduced bv J.M. Shaoiro denotes the samples of image; M a n d N are numbers of pixels
in horizontal and vertical directions.
[I81 and it is one ofthe best wavelet coder today.

(a) Baboon

(b) Goldhill

(C) Lena

Fig. 2. PSNR (in dB), PQS and MOS mulls for test images (a) Baboon, (b)Goldhill. (c) Lena.
and compression sysremr denoted as (+ - JPEG2000: 0 - SPIHT; -
JPEG)

23 5

Authorized licensed use limited to: Maharaja Surajmal Institute of Technology. Downloaded on March 25, 2009 at 00:34 from IEEE Xplore. Restrictions apply.
EUROCON 2003 Ljubljana, Slovenia

Spectral activity measure (SAM) is a measure of image where x and x' are two series between which correlation has
prcdictability and it is evaluated in frequency domain [ I 1: to be found. The possible values o f r are between - I and + I ;
the cluser r is to -1 or + I , the better the correlation is. The
last row in the Table I I contains average absolute values of
correlation coefficients for each objective measure. The
values of correlation coefficients indicate that commonly
used measures of visual quality PSNR and MSE cannot be
where Ffj,k) is fj,k)-th DFT coefficient of image. SAM has a reliably used with all techniques, because they have poor
dynamic range of [ I , a).Higher values of SAM imply higher correlation with MOS. PQS incorporates model of H V S and
predictability. Active imagcs (SAM close to 1) are in gcneral leads to the best correlation with MOS for all three
difficult to code. coinpression systems and all test images but it needs too
Test image Baboon has large SFM and small SAM. Large much time to be evaluated (approximately 15 sec per image
value of SFM means that image contain components in high in our test). Beside PQS, measures with good correlation with
f q u c n c y area and small value of SAM mcaiis low MOS are MD, LMSE, NAE and NK (scc nvcragc absolutc
predictability. For typical natural image largest value of SFM values of r in Table 11). MSE, PSNR and SC cannot be
implies smaller value of SAM. Images Goldhill and Lena are reliably used with all techniques, because they have poor
images with less detail (smaller SFM) than Baboon. correlation with MOS for some of them. The poorest
correlation has AD.
V. RESULTS To evaluate usefulness of each quality measure in tested
compression systems we found average absolute values of
Test images Baboon, Lena and Goldhill are coded using correlation coefficients for each compression system. Results
JPEG, JPEG2000 and SPIHT. For each test image and are shown in Table 111. Table 111 indicates that PQS is
compression method, nine different bit rates are selected: 0.1; excellent measure of picture quality for all compression
0.2; 0.3; 0.4; 0.5; 0.75; I ; 1.5 and 3 bits per pixel (bpp). systems. In PEG2000 and SPIHT compression systems MSE
Objective and subjective picture quality measures are
should be used instead of PSNR because of its bctter
calculated for all images. Results for PSNR. PQS and MOS correlation with MOS. For JPEG2000 and SPIHT
are presented in Figure 2. For some very low quality images comprcssion systems MD, MSE, LMSE and NAE measurcs
PQS is out o f range.
demonstrate very good results. For JPEG compression system
PQS and MOS use the same quality scale so direct good results are achieved using PSNR, MD and LMSE.
comparison between these two measures is possible. On the In some image coding application, it is not appropriate to
other hand, PSNR values depend very much on image compute PQS because o f its time expensiveness. Maximum
content. For example, PSNR of image Lena is through all difference (MD)has a good correlation with MOS for all
compression ratios for about 8-1 I dB higher than PSNR for tested compression techniques so w e propose this very simple
image Baboon. PSNR can not be used for quality comparison measure as a reference for measuring compressed picture
of different images. If we consider only PSNR values (Figure quality in different compression systems. LMSE has also
2) we can conclude that JPEGZOOO and SPlHT provides good correlation with MOS but this measure is not so simple
better picture quality than JPEG. If we take into account as MD (see equations in Table I for MD and LMSE).
visual picture quality quantified by MOS, the conclusions are
quite different. At high and moderate bitrates (above 0.75 VI. CONCLUSION
bpp) for all test images JPEG produces better visual picture
quality than wavelet-based techniques (JPEG2000 and The results of an evaluation conceming the usefulness of a
SPIHT). At low bitrates (below 0.5 bpp) JPEG picture number of objective quality measures in image compression
quality degrades below SPIHT and JPEGZOOO picture systems have been presented, In addition, picture quality is
quality, because of the artefacts introduced by block-based measured subjectively using perceived picture quality. The
DCT scheme. It is clear example that PSNR can not be used correlation between each objective measure and subjective
as definitive picture quality measure. PQS grades follow the measure is found. We demonstrated that for a given
trend of MOS'grades but MOS results show that human compression system a group of numerical objective measures
observers have more tolerance for moderately distorted could reliably be used to specify the magnitude of
images than PQS. degradation in reconstructed images. We also demonstrated
that this group of objective measures is different for different
Table II shows the correlation between the numerical
compression systems. We proved that MSE and PSNR, as
objective quality measures introduced in Table I and MOS.
traditionally used objective measures o f picture quality, are
As a measure of the extent of the linear relationship, the
not adequate as perceptually meaningful measures in all
Pearson product-moment (r) was used [ZO]. Correlation
tested compression systems. We found out that PQS is the
coefficient is defined as:
most correlated measure with MOS for all compression
techniques. In some image compression application, i t is not

236

Authorized licensed use limited to: Maharaja Surajmal Institute of Technology. Downloaded on March 25, 2009 at 00:34 from IEEE Xplore. Restrictions apply.
EUROCON 2003 Ljubljana, Slovenia

possible 10 compute PQS because of' its time expensivcness. tested compression techniques. So, we propose use o f M D for
So we considered other objective measures o f picture quality comparisoii of picture quality across different compression
for each compression technique and we found that maximum systems because of its good cnrrelation with MOS and
difference (MD) has a good correlation with MOS for all computing simplicity.
TABLEII
C o K n e i ~ T l o NC O E F F ~ C W T SFOR EACH COMPRESSIONTECllNlQUE ANn TEST IMAGE

~ -
Test
Codcc MSE PShR AD SC NK ,MD LMSE NAE PQS
Image
JPEG2000 -0.95506 0.94071 0.84244 -0.92320 0.942M -0.97625 -0.98814 -0.98912 0.99054

Baboon SPlHT -0.961 17 0.93149 ~0.50867 -0.95273 0.95803 -0.98784 -0.98947 -0.98905 0.98951

JPEC -0.89973 0.88491 0.38663 -0.94490 0.92001 -0.91574 -0.90406 -0.93453 0.97878

JPEGZOOO -0.97097 0.83227 0.69434 -0.96565 0.97000 -0.96746 -0.93256 -0.95399 0.97033

Goldhill SPlHT -0.96723 0.86626 -0.80212 -0.97618 0.97247 -0.96992 -0.96155 -0.96573 0.94765

JPEG -0.74839 0.89574 0.50969 0.42746 0,80153 -0.94067 -0.90946 -0.84936 0.97073

JPEG2000 -0.98327 0.88481 -0.71231 -0.95673 0.97326 -0.97585 -0.95612 -0.97600 0.991 I 1

Lena SPlHT -0.97636 0.88636 0.85532 -0.97525 0.97609 -0.97765 -0.95429 -0.97363 0.98234

JPEG -0.68045 0 91077 -0.43969 0.56470 0.80798 -0.85259 -0.90469 -0.78024 0.98867
Avcrage absolute
0.90 0.89 0.64 0.85 0.92 0.95 0.94 0.93 0.98
YnlUCs "fr

TABLEIII
AVERAGE AUSOLUTE VALUES OFCORRELATION COEFFICIENTS FOR EAClt COMPKlPlON SYSTEM

Codec
-MSE PSNR AD
~

SC NK MD
- LMSE NAE PQS

lPEG2WO 0.97 0.89 0.75 0.95 0.96 0.97 0.96 0.97 0.98

SPlHT 0.97 0.89 0.72 0.97 0.97 0.98 0.97 0.98 0.97

JPEG 0.78 0.9 0.45 0.65 0.84 0.9 0.91 0.85 0.98

REFERENCES ./ I 1 1. ISOilEC FDlS 15444-1. "JPEG2000 Pan I Final DnR lntemational


Standard'. Aug. 2000.
Ill N. Jayan!. P. Noll. Ilipiad Coding of Wovcfornrr: Pririci1>ler orid [I21 N. Skadns. C. A. Chrislopoulos. and T. Ebnhimi. "Jl'l:(i2000: .The
Appiiuu,iors 10 S p e d ? wid Video. Prcnliee Hall: Washington. 1984. Upcoming Still Image Compression Standard'. Procccdiug.Y of ~ h cIll13
121 N.Jaynn~J. Johnrron. R. Sofnnek, "Signal Compression Bvicd on M d d r Portuguese C o n ~ r m zan Porierri Recogni!ion, Ponu, Panugal. pp.
O ~ H U M ~riceptlon",~m oriEEE.vo~.
. tii.pp. 1385.1422,oct. 1993. 359-366. May 2000.
131 S . Bauer. B. Zavko-Cihlar, and M . Grgic. "The Influence of [I31 A. Said and W.A . Pearlman. "A New. Fast. and BffiCiEnf Image Codcc
Impairments from Digital Compression af Video Signal on Perccivcd Based an Sei Panitioning in Hierarchical Trees". iEEE Trmsocrions on
Piemre Quality". Proceedings e/ :he 3rd lniernoiional Workhop on Circuits ondSys,ems/or Video Jechndo&y,vol. 6, no. 3, pp. 243-249.
Image ond Signgnu/ Procasing, IWISP96. Manchester, pp. 245-248, ,.,"*
,"./. .,".
,001
Nov. 1996. [I41 K. R. Rao, P. Yip, Dircrele Cosine J r m s f i m : Alg~rirhm+,Ah.~rrrogerge~
141 S. Baucr. B. Zavka-Cihlar. M. Grgic. "Objective and Subjcctive andApplicorions, Academic Press: San Dicgo, 1990.
Evaluations of Piclure Quality in Digital Video Systems", hireniaiional [ I S ] M . Antonini, M. Barland, P. Mxhieu, and I. Daubechies, "Image
Canfermce 00 Muitimediu Technolow and Digird Jdecommu,iication Coding Using the Wavelel Transform", IEEE Trans. on lnioge'
Services, lCOMT'96. Budapest, Hungary, pp. 145.150, Oct. 1996. Proeeiiing, vol. I, pp. 205-220, 1992.
I51 S. Grgic. M. Grgic. B. Zovka-Cihlar, "Piccure Quality iincasuremenli in 1161 S. Grgic. K. Kerr. and M. Grgic, "Image Compression Using
Wavelet Compression Sysrem". Procccdingr of the lnrernorional Wavelets", Proccedimgr of rhe IEEE lnlernolionul Symposium U I I
Broodcarring Conveniion, IBC99, Amsttidam. The Netherlands, pp. bdurrrioi E i e c l r o ~ i clSIe99,
~, Bled, Slovenia, pp. 99-104, 1999.
554-559. sep. 1999. 1171 JJ2000 project by EPFL. Ericrson and CRF, World Wide Web
M. Miyahara. K. Katani. and V. R. Algari, "Objective Picture Quality http:il~2000.epfl.ehi
Scale (PQS) for Image Coding". 1181 J. M. Shapiro. "Embedded Image Coding Using ZeroWc~of Wavelet
h[rp:iiinfo.cipic.ucdavir.eduhcriptsirFpnP~g~?96-I 2 Coefficients", IEEE Trorrsocrions on Signal Processing, vol. 41, pp.
ITU, "Methodology for the Subjective Assesimcm of the Quality of 3445.3462, Dec. 1993.
Tclevirion Pictures",lTU-RRcc. EL 500-9, 1998. [I91 S.Grgic, M. M n k and M. Grgie, "Comparison of JPEG Image Caden",
S. Grgic. M. Grgic. B. Zovko-Cihlar, "Perfarmancc Analysis of Image Proceedings of rlic 3rd Inlernailonal Symposium on Video Processing
Compression Using Wavelets". /€E€ Tronsoctionr on Indumial ond Muirimedio Communicoriorir. VPmmCom-2001. Zadai, Croatia.
Elecironicr, vol. 48. ~ $ 1 3,~ 8pp. 682-695. June 2001. pp. 79-85. June 2001
ISOilEC IS 10918. "Digital Compression and Coding of Continuous (201 A . M . Eskicioglu and P. S. Fishcr. "Image Quality Measures and Their
Tone Still Imagcs", 1991 Performance", IEEE Tronr. *on Communieorions, vol. 43. no. 12, pp.
G. K. Wallace. '"The JPEG Still Picture Compression Slandard". 2959-2965, Dee. 1995.
Communicmion "/,he ACM, vnl 34, no. 4, pp. 30-44, 1991

231

Authorized licensed use limited to: Maharaja Surajmal Institute of Technology. Downloaded on March 25, 2009 at 00:34 from IEEE Xplore. Restrictions apply.

You might also like