Professional Documents
Culture Documents
Available at http://www.ijcsonline.com/
Department of Computer Science and Engineering, CMRCET, Kandlakoya, Hyderabad, Telangana, India
Department of Computer Science and Engineering, VNRVJIET, Bachupall, Hyderabad, Telangana, India
Department of Computer Science and Engineering, JNTUCEH, JNTUH, Hyderabad, Telangana, India
Abstract
Handwriting interfaces simplifies the communication with the hand-held devices using regional languages. These devices
need to recognize the characters while writing, referred to as online. The online recognizers for regional languages are
becoming popular and yields better recognition performances compared to offline. Offline recognition considers scanned
document images as input instead of ordered sequences of coordinates as online. We attempted to recover the writing
order i.e. the ordered sequences of coordinates from character images and thus an online recognizer can be used to
identify the correct character class. In order to recover the writing order, skeleton graph is constructed from thinned
representation of character image. The endpoints, junction points along with spurious edges formed as a result of
thinning are considered as nodes and the strokes connecting the nodes form edges of the graph. We identified and
proposed rules to recover writing order from eight graph structures possible in Telugu language: S-vertex, S-loop, Stouching, T-vertex, T-loop, D-line, M-loop and L-loop. The strokes are ordered from top to bottom using bounding boxes
and recovered writing order of about 97% of character images with the proposed start and end point identification rules.
Keywords: Writing order recovery, Telugu Character, Graph based.
I.
INTRODUCTION
LITERATURE
347 | International Journal of Computer Systems, ISSN-(2394-1065), Vol. 03, Issue 04, April, 2016
computation based on SLALOM. The doubletraced/terminal segments which are identified by odd
degree nodes in the skeleton graph are resolved using
probability framework based on angles measured using
Principal Component Analysis (PCA). All the even degree
nodes are analyzed by the node traversing rule (NTR). The
authors claimed that the proposed method is better than the
method proposed by Jager, Huang et al. and Kato et al.
(Yoshiharu & Makoto, 2000). The proposed method for
identification of the double-traced/terminal segment is
more robust and smoothness optimization is
computationally efficient.
Qiao et al. (Qiao, et al., 2005) recovered the writing
order by finding the smoothest Euler path from the skeleton
graph within the framework of Edge Contiguous Relation
(ECR). The authors obtained the possible ECRs at each of
the nodes by analyzing locally followed by a global trace to
find candidate Euler paths and finally the smoothest one is
selected. The two main contributions are: obtaining
possible ECRs at even nodes based on two simple
assumptions and a method to identify double traced lines
by using weighted matching of general graph. They
claimed that their method achieved 95.2% correct recovery
rate.
Rousseau et al. (Rousseau, et al., 2005) improved the
Kato and Yasuhara (Yoshiharu & Makoto, 2000)
algorithm. The authors proposed a methodology for
searching start and end points, identifying drawing
direction and another algorithm for reconstruction of online
information. They proposed criterion for selecting the best
path from the possible paths after identifying the start and
end point pairs. This method also restricted to single stroke,
but relaxed the other two constraints imposed by Kato and
Yasuhara. The authors reported 99.6% of correct
identification of start and end points, 97% of correctly
identified start and end points reconstructed that contains
the good path and finally 93% of times the good path
selected from the paths reconstructed. The recognition
results of the online data and the original online data are
roughly the same.
Qiao et al. (Qiao & Yasuhara, 2006) proposed recovery
of writing trajectory from multi stroke static handwritten
images by searching the best matching writing paths with
that of template strokes. To reduce the search space, the
matching cost function defined as the summation of
positional distortion cost and directional difference cost
combined with bidirectional search algorithm based on
dynamic programming is used to find the best matching
path. The authors claimed 94.5% of strokes recovered
correctly while holding the start/end vertex constraint and
is increased to 98% if the start and end points are supplied
explicitly.
Cordella et al. (Cordella, et al., 2010) proposed a
method to extract the dynamic information from skeleton
graphs. They transformed the complete skeleton graph all
except when the start and end points each have an even
degree. The Eulerian trail in the transformed graph found
using the Fleury's algorithm (Taylor, ). The experimental
results show that the writing order produced even in
handwriting with retracing, crossings and pen-ups.
Nagoya et al. (Nagoya & Hiroyuki, 2011) solved two
major problems of writing order recovery from single
348 | International Journal of Computer Systems, ISSN-(2394-1065), Vol. 03, Issue 04, April, 2016
III.
349 | International Journal of Computer Systems, ISSN-(2394-1065), Vol. 03, Issue 04, April, 2016
ii.
(a)
(b)
(c)
(d)
350 | International Journal of Computer Systems, ISSN-(2394-1065), Vol. 03, Issue 04, April, 2016
351 | International Journal of Computer Systems, ISSN-(2394-1065), Vol. 03, Issue 04, April, 2016
(a)
(b)
(a)
(b)
352 | International Journal of Computer Systems, ISSN-(2394-1065), Vol. 03, Issue 04, April, 2016
In Figure 10(a), the edge labelled 1 connected to the Dline labelled 2 which in turn connected to the edge labelled
3 to recover the black constituent edge in Figure 10(b). The
edge labelled 4 connected to the D-line labelled 2 which in
turn connected to the edge labelled 5 to recover the blue
constituent edge in Figure 10(b).
(a)
(b)
(
b)
(b)
(a)
(b)
353 | International Journal of Computer Systems, ISSN-(2394-1065), Vol. 03, Issue 04, April, 2016
(a)
(b)
Figure 14: (a) The thinned image (b) The order of strokes
recovered (numbered 1-4)
RS / TS
265/267
261/271
230/263
262/271
262/265
265/265
235/268
260/262
264/269
270/279
277/278
268/278
CID
001
004
008
011
014
017
020
023
026
029
032
035
RS / TS
267/279
251/260
230/244
277/281
262/272
270/276
269/269
278/278
265/274
255/264
260/272
277/277
CID
002
005
009
012
015
018
021
024
027
030
033
036
RS / TS
282/282
252/258
250/266
267/272
261/276
278/279
267/274
253/274
270/280
272/272
270/282
251/271
037
040
043
046
049
052
055
058
061
064
067
070
073
076
081
084
087
090
093
096
099
102
105
108
111
114
117
120
123
126
129
132
135
138
160
259/270
260/270
276/280
222/266
277/277
272/273
279/279
277/277
277/277
268/268
257/257
262/270
275/276
263/270
260/262
275/275
287/287
257/270
235/257
266/271
263/265
256/264
266/272
278/278
260/265
269/282
269/273
274/283
262/277
268/280
282/292
242/262
278/278
295/307
275/275
038
041
044
047
050
053
056
059
062
065
068
071
074
077
082
085
088
091
094
097
100
103
106
109
112
115
118
121
124
127
130
133
136
141
163
305/314
241/267
270/282
278/278
253/270
273/276
278/279
277/278
272/272
241/266
250/268
290/291
244/260
273/273
260/261
278/279
235/270
265/269
242/277
255/261
261/269
260/276
265/269
269/272
256/264
271/273
261/265
264/277
266/280
254/272
263/269
274/274
246/274
229/239
265/265
039
042
045
048
051
054
057
060
063
066
069
072
075
079
083
086
089
092
095
098
101
104
107
110
113
116
119
122
125
128
131
134
137
142
164
274/278
270/276
271/271
265/278
257/274
272/277
280/280
269/270
270/270
281/283
266/272
253/257
286/296
261/272
273/274
256/265
276/276
247/262
275/287
266/273
258/266
256/268
275/277
264/270
254/260
275/276
255/274
258/268
260/269
237/260
232/268
271/274
254/267
274/274
266/267
[2]
354 | International Journal of Computer Systems, ISSN-(2394-1065), Vol. 03, Issue 04, April, 2016
[5]
[6]
[7]
[8]
[9]
[10]
[11]
[12]
[13]
[14]
[15]
[16]
[17]
[18]
[19]
355 | International Journal of Computer Systems, ISSN-(2394-1065), Vol. 03, Issue 04, April, 2016