A comparison study between MLP and Convolutional Neural Network models for character recognition

Abstract : Optical Character Recognition (OCR) systems have been designed to operate on text contained in scanned documents and images. They include text detection and character recognition in which characters are described then classified. In the classification step, characters are identified according to their features or template descriptions. Then, a given classifier is employed to identify characters. In this context, we have proposed the unified character descriptor (UCD) to represent characters based on their features. Then, matching was employed to ensure the classification. This recognition scheme performs a good OCR Accuracy on homogeneous scanned documents, however it cannot discriminate characters with high font variation and distortion. 3 To improve recognition, classifiers based on neural networks can be used. The multilayer perceptron (MLP) ensures high recognition accuracy when performing a robust training. Moreover, the convolutional neural network (CNN), is gaining nowadays a lot of popularity for its high performance. Furthermore, both CNN and MLP may suffer from the large amount of computation in the training phase. In this paper, we establish a comparison between MLP and CNN. We provide MLP with the UCD descriptor and the appropriate network configuration. For CNN, we employ the convolutional network designed for handwritten and machine-printed character recognition (Lenet-5) and we adapt it to support 62 classes, including both digits and characters. In addition, GPU parallelization is studied to speed up both of MLP and CNN classifiers. Based on our experimentations, we demonstrate that the used real-time CNN is 2x more relevant than MLP when classifying characters.
Type de document :
Communication dans un congrès
SPIE Conference on Real-Time Image and Video Processing, Apr 2017, Anaheim, CA, United States. SPIE Proceedings 10223, Real-Time Image and Video Processing 2017. 〈10.1117/12.2262589〉
Liste complète des métadonnées

Littérature citée [25 références]  Voir  Masquer  Télécharger

https://hal-upec-upem.archives-ouvertes.fr/hal-01525504
Contributeur : Rostom Kachouri <>
Soumis le : dimanche 21 mai 2017 - 15:05:16
Dernière modification le : jeudi 5 juillet 2018 - 14:29:07
Document(s) archivé(s) le : mercredi 23 août 2017 - 10:51:09

Fichier

cnn2.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

Citation

Syrine Ben Driss, M Soua, Rostom Kachouri, Mohamed Akil. A comparison study between MLP and Convolutional Neural Network models for character recognition. SPIE Conference on Real-Time Image and Video Processing, Apr 2017, Anaheim, CA, United States. SPIE Proceedings 10223, Real-Time Image and Video Processing 2017. 〈10.1117/12.2262589〉. 〈hal-01525504〉

Partager

Métriques

Consultations de la notice

513

Téléchargements de fichiers

5284