Login (DCU Staff Only)
Login (DCU Staff Only)

DORAS | DCU Research Repository

Explore open access research and scholarly works from DCU

Advanced Search

Word matching using single closed contours for indexing handwritten historical documents

Adamek, Tomasz, O'Connor, Noel E. orcid logoORCID: 0000-0002-4033-9135, Murphy, Noel and Smeaton, Alan F. orcid logoORCID: 0000-0003-1028-8389 (2007) Word matching using single closed contours for indexing handwritten historical documents. International Journal on Document Analysis and Recognition, 9 (2-4). pp. 153-165. ISSN 1433-2825

Abstract
Effective indexing is crucial for providing convenient access to scanned versions of large collections of historically valuable handwritten manuscripts. Since traditional handwriting recognizers based on optical character recognition (OCR) do not perform well on historical documents, recently a holistic word recognition approach has gained in popularity as an attractive and more straightforward solution (Lavrenko et al. in proc. document Image Analysis for Libraries (DIAL’04), pp. 278–287, 2004). Such techniques attempt to recognize words based on scalar and profile-based features extracted from whole word images. In this paper, we propose a new approach to holistic word recognition for historical handwritten manuscripts based on matching word contours instead of whole images or word profiles. The new method consists of robust extraction of closed word contours and the application of an elastic contour matching technique proposed originally for general shapes (Adamek and O’Connor in IEEE Trans Circuits Syst Video Technol 5:2004). We demonstrate that multiscale contour-based descriptors can effectively capture intrinsic word features avoiding any segmentation of words into smaller subunits. Our experiments show a recognition accuracy of 83%, which considerably exceeds the performance of other systems reported in the literature.
Metadata
Item Type:Article (Published)
Refereed:Yes
Additional Information:The original publication is available at www.springerlink.com
Uncontrolled Keywords:Historical manuscripts; Holistic word recognition; Contour matching; Annotation; Indexing;
Subjects:Computer Science > Information retrieval
DCU Faculties and Centres:Research Institutes and Centres > Centre for Digital Video Processing (CDVP)
Publisher:Springer Berlin / Heidelberg
Official URL:http://dx.doi.org/10.1007/s10032-006-0024-y
Use License:This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License
ID Code:450
Deposited On:21 May 2008 by DORAS Administrator . Last Modified 08 Nov 2018 12:29
Documents

Full text available as:

[thumbnail of ijdar_2007.pdf]
Preview
PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
1MB
Downloads

Downloads

Downloads per month over past year

Archive Staff Only: edit this record