A comparison study between MLP and Convolutional Neural Network models for character recognition - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2017

A comparison study between MLP and Convolutional Neural Network models for character recognition

Résumé

Optical Character Recognition (OCR) systems have been designed to operate on text contained in scanned documents and images. They include text detection and character recognition in which characters are described then classified. In the classification step, characters are identified according to their features or template descriptions. Then, a given classifier is employed to identify characters. In this context, we have proposed the unified character descriptor (UCD) to represent characters based on their features. Then, matching was employed to ensure the classification. This recognition scheme performs a good OCR Accuracy on homogeneous scanned documents, however it cannot discriminate characters with high font variation and distortion. 3 To improve recognition, classifiers based on neural networks can be used. The multilayer perceptron (MLP) ensures high recognition accuracy when performing a robust training. Moreover, the convolutional neural network (CNN), is gaining nowadays a lot of popularity for its high performance. Furthermore, both CNN and MLP may suffer from the large amount of computation in the training phase. In this paper, we establish a comparison between MLP and CNN. We provide MLP with the UCD descriptor and the appropriate network configuration. For CNN, we employ the convolutional network designed for handwritten and machine-printed character recognition (Lenet-5) and we adapt it to support 62 classes, including both digits and characters. In addition, GPU parallelization is studied to speed up both of MLP and CNN classifiers. Based on our experimentations, we demonstrate that the used real-time CNN is 2x more relevant than MLP when classifying characters.
Fichier principal
Vignette du fichier
cnn2.pdf (881.33 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01525504 , version 1 (21-05-2017)

Identifiants

Citer

Syrine Ben Driss, Mahmoud Soua, Rostom Kachouri, Mohamed Akil. A comparison study between MLP and Convolutional Neural Network models for character recognition. SPIE Conference on Real-Time Image and Video Processing, Apr 2017, Anaheim, CA, United States. ⟨10.1117/12.2262589⟩. ⟨hal-01525504⟩
737 Consultations
12957 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More