Professional Documents
Culture Documents
'Inst. Info. Sci. and Elec. University of Tsukuba, Tsukuba, Ibaraki, 305 JAPAN
2Doctoral Program in Engineering, University of Tsukuba, Tsukuba, Ibaraki, 305 JAPAN
656
0-8186-7128-9/95$4.00 0 1995 IEEE
alternation of the sequence of boundary points. So in In this process, the set of boundary points between
the third stage, redundant feature points are removed. any adjacent feature points is approximated with a
Thus, a method of extracting feature points by piecewise polynomial.
multi-stage algorithm is defined as a construction de- Parametric expression is one of the methods for
sign of extracting feature points. The details of this easily showing a curve on two-dimensional space.
algorithm are presented later. It is used in the present algorithm. Each interval
{(zj3,yj3)}y:=l between any adjacent feature points is
2.2.1 Feature Points Extraction Based on approximated by ( s , ( t ) , s y ( t ) )with parameter t . Here,
Digit a1 Curvature n3 is the number of points in the sequence, and s O ( t )
and s y ( t ) is piecewise polynomials of degree 2, ap-
A method of extracting plain feature points based on
digital curvature [4] is presented. Each character of proximating points {(tj3, zj3)}y:=1 and {(tj3,uj3)}7:=i
the source ima e is N x N [pixel/font] of a binary respectively, when tj, = i3 - 1.
image. A set f ( ~ i ~ , g j ~ ) } y ; =
of~ boundary points is Let n be the dimension of the piecewise golyno-
extracted out of the binary image by tracking the edge mial. When n 2 3, the parameters { c ~ , , } ; , ; - ~and
according to 8-neighbor-connection. If the number of {cv,,}y;f2 of s Z ( t ) and s y ( t )are determined to min-
boundary points n1 is less than 8, they are regarded imize the square error Q:
as noise, and removed.
First, a method of extracting right an led corners
is presented. At every point { ( t i l , gil)}$=l of each
set, digital curvature is evaluated as
for
1 = -2, -1, ..., n - 3 (3)
respectively.
where In order to judge if least squares approximation
converges,
ailK1 = ( Z i l + K l - Zil 1 W1+K1 -
bilKl = ( Z i l - K 1 - Zil,Y i l - K 1 - yil)-
Here the symbol '-'denotes the inner product of vec-
tors. If the point satisfying the condition P j l ~ =l 0 is found, and n is increased until e < 0.90. Thus t e (h4)
exists, it becomes a feature point at a right angled set of boundary points between any adjacent feature
corner. The constant is set as K 1 = 3. points is approximated by a piecewise polynomial in
Next, a method of extracting feature points which the first process.
are both ends of a straight line is presented. If Additional Feature Points
more than I f 2 points which satisfy the condition of In this process, connections between different
P~,K = ~-1, beginning and ending points of this se- classes of the contour such as straight lines, arcs, and
quence become feature points. We call the beginning free curved lines are extracted as feature points from
point "the feature point of a straight line". The con- the curvature of the approximated contour determined
stant is set as li'2 = 25. in the first process.
When these processes are done for all il(i1 = Next property [5] shows equations for analytically
1,2, ...,nl), right angled corners and both ends of finding the curvature from the piecewise polynomial
straight lines on the contour {(zj,, yi,)}~~=, are found to express the contour.
as feature points {(xi,( 4,yi2
( 4)};+. n2 is the number P r o p e r t y 1 A piecewise polynomial of degree ( m - 1)
of feature points extracted in this stage. is represented in the
f o r m s,(t) = C~~_",+l.cZ,Nk,m(t). Then, the j - t h
2.2.2 Feature Points Extraction Based on derivative of the piecewzse polynomial of order m is
Analog C u r v a t u r e determined as, for Cm-s 5 t 5 &,+l,
A method of extracting additional feature points
based on curvature of the approximated contour is
presented.
The second stage consists of two processes. First,
each set of boundary points between any adjacent fea- with
ture points is approxima.ted by a piecewise polynomial.
Then, connections between different classes of the con-
tour such as a straight line, arc, and free curved line
are also extracted as feature point,s based on curvature
of the approximated contour.
Function Approximation
657
From this property, the curvature IKI is determined feature points may exist between continuous straight
as, for t E [ e h ,& + I ] , lines. If the font is not inferior in quality without these
feature points, they can be removed as redundant fea-
sL(t)s”(t) - s;(t)s&(t) ture points. The detail is as follows.
n(t) = Each feature point (zia ( J ) );,=,
( J ) ,yja is estimated
{sL(t)2 + sI/(t)2}3/2 *
where i 2 = 1 , 2 , . .. If there are more than two fea-
Then, additional feature points are also extracted by ture points of a straight line, let the set of these feature
using this curvature as follows. points be {(zi;), ~ i f ) ) } T = ~ Let~ .the distance between
First, both ends of straight lines are extracted as
feature points. The curvature I ~ ( t j , ) is l evaluated for
(zKil, y g i l ) and a straight line, which is connected
.
i 3 = 1 , 2 , 3 , . .. Where the condition Itc(ti,)l < IC3
between (&?,Y%’) and (zn6+2,~ns+2)r (4 (4 be Ln,+l. If
continues in IC4 points, that interval is estimated as Ln,+1 is less than 1C7, (t,,+l,ynd+l)(4 (4 are removed as
E
a straight line. Let [tal,t,, be this interval, and let
t:l and t:, be both ends o this interval rounded t o
the decimal place. Then the beginning point ( x ~ ; ~ +
redundant feature points.
After this process is repeated for all feature points
+ + + between (&), yi;)) and (x!~) I4 ’ y!$)),
$4 where i4 = ns+
1 , ~ t ; ~ 1) and the ending point (ztZ1 1,ytZl 1)
are also extracted as feature points. We mark the
2,ns + +
3,-..,ne 1. Let the distance between
( 4)}~~;~,+,and the straight line be { L i s } ,
{ ( x i( sJ ) ,yis
beginning point ’feature point of straight line’. The
constants are set as 1<3 = 1/200 and IC4 = 30. and let the maximum of {Li,} be L,,. If L,, 2 IC7,
Next, both ends of arcs are extracted as feature ( z $ f ~ l , y ~ fis~ notl ) removed, and the estimation of
points. The curvature I ~ ( t i , ) lis evaluated for is = that interval is stopped. The constant is set as Ki. = 2 .
1 , 2 , 3 , .. .. Let k be the mean value of an already After this estimation is done for all feature points,
evaluated curvature in the interval. It is renewed ev- then redundant feature points of straight line are re-
ery time the curvature is evaluated, until k is not in moved.
+
[k - IC51 and [k Kg]. Then the interval, the points in Next , feature points between continuous arcs are
which had been evaluated in this process, is estimated removed. Some feature points may exists between
as a candidate for an arc. If the central angle of it is continuous arcs. If the font is not inferior in qual-
more than K G ,the interval is estimated as an arc. Let ity without these feature points, they are removed as
[t8,,t,,] be this interval, and let t:z and t;, be both redundant feature points. The detail is as follows.
ends of this interval, rounded to the decimal place.
+
Then the beginning point (ztz2 1,ytZ2 1) and the + Each feature point (z!:), yi2?) is estimated where
i 2 = 1 , 2 , . . .. If there are more than two feature
ending point (zt:, + +
1, yt:, 1) are also extracted as points in the arc, let the set of these feature points
feature points. We mark the beginning point “feature be { ( x i( ,J ) , yi4
( 4)}?&,. Let, each radius of the arc be-
point of arc”. The constants are set as K5 = 3/400
and ICs = a / 2 so as to not estimate straight lines or ginning from (xi:), yi{)) and ( (4 + ~ , be rns and
(4
x ~ ~ynJ+!)
free curved lines as arcs. r,, 1 respectively, and let each central point of the arc
Thus, the processes is done for all i3(i3 = be r 2 n s l jjns) and (2ns+l,jjnnJ+1)
respectively. If
1,2, ...,n 3 ) , and feature points of straight lines
and arcs are extracted in the sequence of points
{ ( x i 3 ,~ i ~ ) } r ; = ~These
. processes are repeated for all
and
intervals between feature points which are extracted
in the first stage and additional feature points are ex-
tracted as { ( z s ) , y ~ ~ ’ ) } ~ ~ = = nHere
2 + l 1.24 is the total
J(k’nb+l- ~ 8 ) + (jjns+l-
’ jjns)’ <1~9
number of feature points which are extracted in the then they are estimated as a single arc. So
two stages. ( 4 is removed as a redundant feature point
( 4 ynns+l)
(z,,+~,
of the arc.
After this process is repeated for all feature points
2.2.3 Removal Redundant Feature Points
between (&?, y${’) and (zi?, yi()), where i4 = ns +
In Subsubsection 2.2.1 and 2.2.2, feature points are ex-
tracted based on the local variance of the set of bound- + + (4 , (4 is removed if
2, ns 3 , . , n e 1, then (zi4-l
ary points. There may be some redundant feature it is a redundant feature point. The constants are set
points which have been ext,racted because of noise. as IC8 = 1 and K g = 2.
These feature points between continuous straight lines After this estimation is done for all feature points,
and between continuous arcs are removed in this stage. then the redundant feature points of arc are removed.
First, feature points between continuous straight In this subsection, a method of extracting feature
lines are removed. Preparing for this process, fea- points has been presented.
ture points {(q, ( 4, yi2 ( J ) )}y:=l, which have been found 3 Experimental results
in first two stages are sorted in the order of the bound- For verification, handwritten characters, printed
ary points { ( x i , , yil)}y:=l of the source image. Some characters and figures are selected. They are observed
658
Table 1: Data quantity
659