Professional Documents
Culture Documents
Recognition
Giorgos Vamvakas
gbam@iit.demokritos.gr
Outline
• Pre-processing
• Segmentation
• Feature Extraction
• Classification
• Post-processing
Feature Extraction Methods
• Statistical
• Structural
Statistical Features
• Zoning
[2] G. Vamvakas, B. Gatos, S. Petridis and N. Stamatopoulos, ''An Efficient Feature Extraction and
Dimensionality Reduction Scheme for Isolated Greek Handwritten Character Recognition'', Proceedings of
the 9th International Conference on Document Analysis and Recognition, Curitiba, Brazil, 2007, pp. 1073-1077.
Experimental Results – Greek OCR
CIL Database
Step 4: Each pattern of the test set is then fed to the initial
classifier with features extracted at level L0. If the classifier
decides that this pattern belongs to one of the single classes
then the unknown pattern is assumed to be classified. Else, if
it is classified to one of the groups of classes then the new
classifier decides the recognition result
Feature Extraction
Hierarchical Classification (v.2)
[5] G. Vamvakas, B. Gatos, S. J. Perantonis, “A Novel Feature Extraction and Classification Methodology for the
Recognition of Historical Documents”, 10th International Conference on Document Analysis and Recognition
(ICDAR’09), pp 491-495, Barcelona, Spain, July 2009
Feature Extraction
Hierarchical Classification (v.3)
Level = 2
Level = 2 Level = 4
[6] G. Vamvakas, B. Gatos, S. J. Perantonis,” Handwritten Character Recognition through Two-Stage Foreground
Sub-Sampling”, Pattern Recognition, accepted for publication
Experimental Results
Databases
CIL
CEDAR
• 52 characters (classes) of isolated and labeled English
handwritten characters
• 19145 characters for training
MNIST
• 10 classes of isolated and labeled handwritten digits
• 60000 digits for training
• 10000 digits for testing
Experimental Results
CIL
CIL Database
[1] - zones & projections 91.61%
[3] - Kavalieratou et.al 88.62%
[2] - dimensionality reduction 92.05%
[4] – DP (no sub-pixel) 93.21%
[5] – DP (sub-pixel) 93.65%
[6] – DP (sub-pixel and iteration of step 1 95.63%
of the classification procedure)
Experimental Results
CEDAR
CEDAR Character Database (52 Classes)
Uppercase Lowercase Overall Recognition
Characters Characters Rate
YAM[7] NA NA 75.70%
KIM [8] NA NA 73.25%
GAD[9] 79.23% 70.31% 74.77%
DP[6] 86.17% 84.05% 85.11%
[7] H. Yamada and Y. Nakano, "Cursive Handwritten Word Recognition Using Multiple Segmentation Determined
by Contour Analysis", IECIE Transactions on Information and System, Vol. E79-D. pp. 464-470, 1996.
[8] F. Kimura. N. Kayahara. Y. Miyake and M. Shridhar, "Machine and Human Recognition of Segmented
Characters from Handwritten Words", International Conference on Document Analysis and Recognition
(ICDAR '97), Ulm, Germany, 1997, pp. 866-869.
[9] P. D. Gader, M. Mohamed and J-H. Chiang. "Handwritten Word Recognition with Character and
Inter-Character Neural Networks", IEEE Transactions on System, Man. and Cybernetics-Part B: Cybernetics,
Vol. 27, 1997, pp. 158-164.
Experimental Results
CEDAR
CEDAR
Uppercase Characters Lowercase Characters
(26 Classes) (26 Classes)
# Train # Test Recogni- # Train # Test Recogni-
Patterns Patterns tion Rate Patterns Patterns tion Rate
[10] M. Blumenstein, X.Y. Liu, B. Verma, "A modified direction feature for cursive character recognition",
IEEE International Joint Conference on Neural Networks, Vol.4, pp. 2983 – 2987, 2007.
Experimental Results
[11] S. Singh and M. Hewitt, "Cursive Digit and Character Recognition on Cedar Database". International
Conference on Pattern Recognition, (ICPR 2000), Barcelona, Spain. 2000, pp. 569-572.
[12] F. Camastra and A. Vinciarelli. "Combining Neural Gas and Learning Vector Quantization for Cursive
Character Recognition", Neurocomputing. vol. 51. 2003, pp. 147-159.
Experimental Results
MNIST
POLYTIMO
Handwritten Database (HW)
• 51 characters (classes) of Greek historical
handwritten characters
• 5407 characters for training
• 1351 characters for testing
POLYTIMO
TW-Database HW-Database
[1] - zones & projections 95.44% 94.62%
[5] – DP (sub-pixel) 97.71% 94.51%
[6] – DP (sub-pixel and
iteration of step 1 of the 98.24% 95.21%
classification procedure)
Application to Historical Character
Recognition
IMPACT
Typewritten Database (TW-1)
• 53 characters (classes) of German
historical typerwritten characters
• 13181 characters for training
• 3246 characters for testing
[14] G. Vamvakas, B. Gatos and S.J.Perantonis, “Efficient Character/Word Recognition based on a Hierarchical
Classification Scheme” , International Journal on Document Analysis and Recognition (IJDAR) , under review.
Experimental Results
IAM
Handwritten Database
• 147 classes of English handwritten words
• 23171 words for training
• 3799 words for testing
IAM Database
GAT [15] 87.68%
Proposed Methodology 89.19%
(no slant correction)
Proposed Methodology 90.56%
(slant correction)
[15] B. Gatos, I. Pratikakis, A.L. Kesidis and S.J. Perantonis, "Efficient Off-Line Cursive Handwritten Word
Recognition", 10th International Workshop on Frontiers in Handwriting Recognition (IWFHR 2006), La Baule,
France, October 2006, pp. 121-125.
Word Spotting [14]
Merge all five lists in a list Q. Every word the documents’ set is now
represented in Q by five distances from the keyword. For each one we
choose to keep the minimum distance and remove the others,
resulting to Q΄.
Sort Q΄ in ascending order. Choose a threshold thr and remove all
instances above thr. List Q΄ now contains only the thr nearest words
to the word we want to be matched, which is the result of the
matching algorithm.
Experimental Results
[16] G. Vamvakas, B. Gatos, N. Stamatopoulos, S.J. Perantonis, "A Complete Optical Character Recognition
Methodology for Historical Documents" (DAS’08), 8th IAPR International Workshop on Document Analysis
Systems, pp.525-532, Nara, Japan, September 2008.
Historical Document Recognition
Methodology
k – Means clustering
Given a parameter set S, in order to evaluate the performance of
the clustering algorithm for every k between k1 and k2, the mean
squared distances from the centroids (within clusters sum of squares)
is calculated as follows:
The value of W (k) is low when the partition is good thus resulting
to compact clusters.
A quality of the segmentation result that
corresponds to a parameter set S is given as:
Journals
[1] G. Vamvakas, B. Gatos, S. J. Perantonis, “Handwritten Character Recognition
through Two-Stage Foreground Sub-Sampling”, Pattern Recognition, accepted
for publication
[2] G. Vamvakas, B. Gatos, S. J. Perantonis, “Efficient Character/ Word Recognition
based on a Hierarchical Classification Scheme” , International Journal on
Document Analysis and Recognition (IJDAR) , under review
Conferences
[1] G.Vamvakas, N. Stamatopoulos, B.Gatos, S.J.Perantonis, “Automatic
Unsupervised Parameter Selection for Character Segmentation” , accepted to
appear in the 9th IAPR International Workshop on Document Analysis Systems
(DAS 2010)
[2] G. Vamvakas, B. Gatos, S. J. Perantonis, “A Novel Feature Extraction and
Classification Methodology for the Recognition of Historical Documents”, 10th
International Conference on Document Analysis and Recognition (ICDAR’09), pp
491-495, Barcelona, Spain, July 2009
Publications
Conferences
[3] G. Vamvakas, B. Gatos, N. Stamatopoulos, S.J. Perantonis, "A Complete Optical
Character Recognition Methodology for Historical Documents," 8th IAPR
International Workshop on Document Analysis Systems (DAS’08), pp.525-532,
Nara, Japan, September 2008
[4] G. Vamvakas, B. Gatos, S. J. Perantonis, "Hierarchical Classification of
Handwritten Characters based on Novel Structural Features", 11th International
Conference on Frontiers in Handwriting Recognition (ICFHR'08), Montreal, Canada,
August 2008.
[5] G. Vamvakas, B. Gatos, S. Petridis, N. Stamatopoulos, "An Efficient Feature
Extraction and Dimensionality Reduction Scheme for Isolated Greek Handwritten
Character Recognition ", 9th International Conference on Document Analysis and
Recognition(ICDAR’07), vol.2, pp.1073-1077, 23-26 September 2007
[6] G. Vamvakas, N. Stamatopoulos, B Gatos, I. Pratikakis, S. J. Perantonis, "Greek
Handwritten Character Recognition", 11th Panhellenic Conference on Informatics
(PCI’07), pp. 343-352, Patras, Greece, May 2007
[7] G. Vamvakas, B. Gatos, I. Pratikakis, N. Stamatopoulos, A. Roniotis, S.J.
Perantonis, "Hybrid Off-Line OCR for Isolated Handwritten Greek Characters", The
Fourth IASTED International Conference on Signal Processing, Pattern Recognition,
and Applications (SPPRA’07), pp. 197-202, Innsbruck, Austria, February 2007