Efficient multiscale and multifont optical character recognition system based on robust feature description - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2015

Efficient multiscale and multifont optical character recognition system based on robust feature description

Résumé

Optical Character Recognition (OCR) is the process of translating images of text into a comprehensible machine format. Generally, an OCR system is composed of binariza-tion, segmentation and recognition stages. Given an extracted binary character, the recognition stage ensures its description and decides its corresponding ASCII code. In this paper, we propose a new OCR system that aims to high speed, Multiscale and Multifont character recognition. Our proposal is based essentially on robust description using a new Unified Character Descriptor (UCD). In addition, a character type-face and font-size recognition is performed to choose the adequate template for faster matching process. Obtained OCR Accuracy of our proposed System is 1.5x higher then that reached by Tesseract on the LRDE dataset.
Fichier principal
Vignette du fichier
IPTA15_OCR(accepté).pdf (1.48 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01309987 , version 1 (03-05-2016)

Identifiants

Citer

Mahmoud Soua, Rostom Kachouri, Mohamed Akil. Efficient multiscale and multifont optical character recognition system based on robust feature description. 5th International Conference on Image Processing Theory, Tools and Applications , Nov 2015, Orléans, France. ⟨10.1109/IPTA.2015.7367214⟩. ⟨hal-01309987⟩
119 Consultations
713 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More