Professional Documents
Culture Documents
Gaussian blurring does not “respect” the natural bound- not permit space variant blurring. Clearly this is not es-
aries of objects. Suppose we have the picture of a treetop sential to the issue, but it simplifies the analysis greatly.”
with the sky as background. The Gaussian blurring pro- We will show how a suitable choice of c ( x , y, t ) will
cess would result in the green of the leaves getting enable us to satisfy the second and third criteria listed in
“mixed” with the blue of the sky, long before the treetop the previous section. Furthermore this can be done with-
emerges as a feature (after the leaves have been blurred out sacrificing the causality criterion.
together). Fig. 3 shows a sequence of coarser images ob- Consider the anisotropic diffusion equation
tained by Gaussian blurring which illustrates this phe-
nomenon. It may also be noted that the region boundaries I, = div ( c ( x ,y, t ) V l ) = c ( x , y. r ) A l + Vc VI (3)
are generally quite diffuse instead of being sharp. where we indicate with div the divergence operator, and
With this as motivation, we enunciate [18] the criteria with V and A respectively the gradient and Laplacian op-
which we believe any candidate paradigm for generating erators, with respect to the space variables. It reduces
multiscale “semantically meaningful” descriptions of to the isotropic heat diffusion equation I, = cAZ if
images must satisfy. c ( x , y, t ) is a constant. Suppose that at the time (scale)
I ) Causaliry: As pointed out by Witkin and Koender- t , we knew the locations of the region boundaries appro-
ink. a scale-space representation should have the property priate for that scale. We would want to encourage
that no spurious detail should be generated passing from smoothing within a region in preference to smoothing
finer to coarser scales. across the boundaries. This could be achieved by setting
2) Immediate Localization: At each resolution, the re- the conduction coefficient to be 1 in the interior of each
gion boundaries should be sharp and coincide with the region and 0 at the boundaries. The blurring would then
semantically meaningful boundaries at that resolution. take place separately in each region with no interaction
3 ) Piecewise Smoothing: At all scales, intraregion between regions. The region boundaries would remain
smoothing should occur preferentially over interregion sharp.
smoothing. In the tree example mentioned earlier, the leaf Of course. we do not know in advance the region
regions should be collapsed to a treetop before being boundaries at each scale (if we did the problem would
merged with the sky background. already have been solved!). What can be computed is a
PERONA A N D MALIK: SCALE-SPACE A N D EDGE DETECTION 63 1
K S
V
negative. This may be a source of concern since it is
known that constant-coefficient diffusions running back-
Fig. 5 . (TOP
-T-
to bottom) A mollified step edge and its 1st. 2nd, and 3rd
derivatives.
wards are unstable and amplify noise generating ripples.
In our case this concern is unwarranted: the maximum
principle guarantees that ripples are not produced. Exper-
imentally one observes that the areas where 4r(1,) < O
quickly shrink, and the process keeps stable.
V . EXPERIMENTAL RESULTS
We choose c to be a function of the gradient of I : c ( x ,
Our anisotropic diffusion, scale-space, and edge detec-
y, t ) = g(z,(x, y, t ) ) as in (4). Let 4(z,) g ( z x ) . I,
denote the flux c I,. tion ideas were tested using a simple numerical scheme
Then the 1-D version of the diffusion equation (3) be- that is described in this section.
comes Equation (3) can be discretized on a square lattice, with
brightness values associated to the vertices, and conduc-
a tion coefficients to the arcs (see Fig. 7). A 4-nearest-
4 =- ax
4(Ix) = 4 r ( 4 ) 4,. ( 5 ) neighbors discretization of the Laplacian operator can be
used:
We are interested in looking at the variation in time of
the slope of the edge: a/at(Z,). If c( ) > 0 the function I!:,;' = r:,j h[CN +
V N I + cs VsI
I ( ) is smooth, and the order of differentiation may be
inverted: C E . VEI + cw+ oWl]:,j (7)
where 0 IX I 1 / 4 for the numerical scheme to be sta-
ble, N , S, E , Ware the mnemonic subscripts for North,
= 4rr . South, East, West, the superscript and subscripts on the
I:, + 4' * L,. ( 6 ) square bracket are applied to all the terms it encloses, and
Suppose the edge is oriented in such a way that I, > 0. the symbol V (not to be confused with V, which we use
At the point of inflection I,, = 0, and I,,, << 0 since the for the gradient operator) indicates nearest-neighbor dif-
point of inflection corresponds to the point with maximum ferences:
slope (see Fig. 5). Then in a neighborhood of the point
V N I ; , j E Ii- 1.j - 1j.j
of inflection a/at(Z,) has sign opposite to 4r(Z,). If
4r(I,) > 0 the slope of the edge will decrease with time;
V S 1 i . j E I l + ~ , j- ';,I
if, on the contrary 4' (I,) < 0 the slope will increase with
time. V E Z j , j = Z i , j + I - z;,j
Notice that this increase in slope cannot be caused by a
scaling of the edge, because this would violate the max- vU/z;,j = I;,j-l - z;,j. (8)
imum principle. The edge becomes sharper.
The conduction coefficients are updated at every itera-
There are several possible choices of 4 ( . ), for exam-
tion as a function of the brightness gradient (4):
ple, g(Z,) = C / ( 1 + ( I , / K ) ' + " ) with a > 0 (see Fig.
6 ) . Then there exists a certain threshold value related to
-
K , and a , below which 4 ( ) is monotonically increasing,
and beyond which 4 ( ) is monotonically decreasing, giv-
ing the desirable result of blurring small discontinuities
and sharpening edges. Notice also that in a neighborhood
of the steepest region of an edge the diffusion may be
thought of as running "backwards" since + ' ( I , ) in (5) is (9)
PERONA A N D MALIK: SCALE-SPACE A N D EDGE DETECTION 633
5 IM:J(l - X ( c N + CS + CE + cW):,J)
Fig. 9. Effect of anisotropic ditfusion ( b ) on the Canaletto image ( a ) [3]. Notice that the edges remain sharp until their disappearance
Banner.000.30
A . Fixed Neighborhood Detectors
5v1 Lw
W
U
/ L I
Fig. 15. Scale-space using linear convolution. The edges are distorted and Fig. 16. Edges detected by thresholding the gradient in Fig. 14. Linking
the junctions disappear. Images generated using the Canny detector and is not necessary. Thinning is only for the finer scales. Compare to Fig.
smoothing Gaussian kernels of variance (top left to bottom right) 1/ 2 , 17 where thinning and linking have been used.
1, 2, 4, 8, 16 pixels. Compare to Fig. 17 where anisotropic diffusion
preserves edge junctions, shape, and position.
JEN(I)
PERONA AND MALIK: SCALE-SPACE AND EDGE DETECTION 637
compromise between speed and accuracy. entiable on D U aTD, then it obeys the maximum prin-
ciple, i.e., the maximum o f f i n D is reached on the bot-
VII. CONCLUSION tom-side boundary asBD of D:
We have introduced a tool, anisotropic diffusion, that maxf = maxf
we believe will prove useful in many tasks of early vision. D asBD
~
638 IEEE TRANSACTIONS ON PATTERN ANALYSIS A N D MACHINE INTELLIGENCE. VOL. 12, NO. 7. JULY 1990
Corollary: Consider a function f satisfying the hy- Notice that the maximum principle also guarantees that
potheses of the previous theorem, and such thatfis twice there are no local maxima o f f in D U d,D. The same
differentiable on asD, and V,f = 0 (where V, indicates technique used in the proof restricting D to be a cylinder
the gradient operator along the x direction). Then contained in the neighborhood where the local maximum
is a strict maximum may be used to see that the existence
m_axf = maxf.
D aBD of one at p E D U dTD would violate the differential in-
equality.
The following proof is adapted from John [ 101. The corollary may be proven along the same lines: since
Pro08 First consider f satisfying the stricter condi- f i s , by hypothesis, differentiable on 8,Done can use (21),
tion and (22) for any p E d,D, with o in an appropriate hemi-
sphere so that p + E Z I E D .
C ( X ,t)J - C(X, t ) A f - VC * Vf < 0. (21) If a function f satisfies the differential equation
By hypothesis f i s continuous on E,a compact set, hence C ( X ,t ) J - C(X, t ) Af - VC Vf = 0 (23)
it has a maximum in it. Call p = ( y , T ) this maximum.
Suppose that p E D . Since f is twice continuously dif- with the hypotheses already stated on the functions C ( )
and c ( ), the arguments above can be run for f and h =
ferentiable in D we can write the first three terms of the
-fproving that both a maximum and minimum principle
Taylor expansion off about p :
have to be satisfied.
f(p + Eo) = f ( p ) + €of%+ E 2 V T X f V The diffusion equation (3) is a special case of (23) (set
C(x, t ) = 1, and f = I), hence the scale-space brightness
+ W3)% f ( P ) (22) function Z(x, y , t ) obeys the maximum principle provided
that the conduction coefficient c never takes negative value
+ ‘
where U E R” , E E some neighborhood of zero, and Xf (in fact the condition that c does not take negative value
indicates the n + 1 X n + 1 Hessian matrix off. For the wherefhas a maximum is sufficient) and is differentiable.
sake of compactness, unlike in the rest of the paper, Vf If adiabatic (V,f = 0 ) boundary conditions are used then
in (22) indicates the gradient offwith respect to the space the hypotheses of the corollary are satisfied too, and the
coordinates and the time coordinate. Since p is a point maxima may only belong to the initial condition.
where f has a maximum, the gradient Vf in the first order Solutionsfof (3) have an additional property if the con-
term of the expansion (22) is equal to zero therefore the duction coefficient is constant along the space axes: c =
second term cannot be positive, V u E Rn I : o T X f o I +
161 S. Geman and D. Gernan. "Stochastic relaxation. Gibbs distribu- 12 I ] A. Witkin. .'Scale-space filtering." in In/. J o r t ~ rCotfj. Artificiul I n -
tions, and the Bayesian restoration of images." I€€€ Trtrris. Purrrrrr relligeric.c,, Karlsiuhe. West Germany. 1983. pp. 1019-1021,
A n d . Machine Inre//.. vol. PAMl-6. pp. 721-741. Nov. 1984. (221 A. Yuille and T . Poggio. "Scaling theorems for zero crossings."
(71 A. Huminel. "Representations based on zero-crossings in scale- IEEE Trapis, Ptrrterri A M I / .Machir~eIritell.. vol. PAMI-8. Jan. 1986.
space ," in Proc. I € € E Compurer Vision and Ptitrern Rrco,ynirion
Conf.. June 1986. pp. 204-209: reproduced in: Readings i r i Cornpurer
Vision: I.~we.s.Prohlenis. Principles tirid Parcrdrgtm. M. Fischler and
0. Firschein. Eds. Los Altos, CA: Morgan Kaufmann, 1987. Pietro Perona w a s born in Padua. ltaly. on Sep-
181 -, "The scale-space formulation of pyramid data wucturcs." in temher 3. 1961. He received the Doctor degree in
Parallel Computer ViAion, L. Uhr. Ed. New York: Academic. 1987. electrical engineering cum laude Irom the Univer-
pp. 187-223. sit) of Padua in 1985 with a thesis on dynamical
[9] A. Hummel. B. Kimia. and S. Zucker. "Deblurring Gaussian blur.'' systems theor).
Compur. Vision, Gruphics. Image Processing. vol. 38, pp. 66-80. He received the Ph.D. degree from the De-
1987. partment of Electrical Engineering and Computer
[IO] F. John. Parrial Diferrntial Equurions. New York: Springer-Ver- Science of the University of California at Berke-
lag. 1982. ley in 1990. His research interests are in compu-
[ I I ] J. Koenderink. "The stmcture of images." Biol. Cybrrri.. vol. 50. tational and biological vision.
pp. 363-370, 1984.
[I21 J . Malik, "Interpreting line drawings of curved object.." I N . J.
Compur. Vision. vol. I , no. I , pp. 73-103. 1987.
1131 D . Marr, Vision. San Francisco, CA: Freeman, 1982.
1141 J. Marroquin, "Probabilistic solution of inverse problems." Ph.D. Jitendra Malik (A'88) was born in Mathura, In-
dissertation, Massachusetts Inst. Technol.. 1985. dia, on October 1 I . 1960. He received the B.Tech
[I51 -, "Probabilistic solution of inverse problems," Artificial Intell. degree from Indian Institute of Technology. Kan-
Lab., Massachusetts Inst. Technol., Tech. Rep. AI-TR 860, 1985. pur. in 1980 where he was awarded the gold medal
1161 D. Mumford and J . Shah. "Optimal approximation of piecewise for the best graduating student in electrical engi-
smooth functions and associated variational problems." Commun. neering. He received the Ph.D. degree in com-
Pure Appl. Math., vol. 42, pp. 577-685, 1989. puter science from Stanford University, Stanford.
[IT] L . Nirenharg. "A strong maximum principle for parabolic equa- CA. in 1986.
tions," Commuri. Purr A p p l . Mtrrh., vol. VI. pp. 167-177. 1953. Since January 1986. he has been an Assistant
[I81 P. Perona and J . Malik, "Scale space and edge detection using an- Professor in the Computer Science Division. De-
isotropic diffusion." in Proc. IEEE Compur. Soc. Workshop Com- partment of EECS, University of California at
puter Vision. Miami. FL. 1987, pp. 16-27. Berkeley. Since October 1988 he has also been a member of the group in
[I91 -, "A network for edge detection and scale space," in Proc. I€€E Physiological Optics at UC Berkeley. His research interests are in machine
Inr. Symp. Circuits and S y s r m i s , Helsinki. June 1988. pp. 2565-2568. vision and computational modeling of early human vision. These include
1201 A. Rosenfeld and M . Thurston. "Edge and curve detection for v~sual work on edge detection. texture segmentation, line drawing interpretation,
scene analysis.'' / € E € Trans. Compur., vol. C-20, pp. 562-569. May and 3-D object recognition.
1971. Dr. Malik received a Presidential Young Investigator Award in 1989.