Handwritten Sindhi Character Recognition Using Neural Networks

  • Shafique Ahmed Awan Department of Information Technology, Quaid-e-Awam University of Engineering, Sciences and Technology, Nawabshah.
  • Zahid Hussain Abro Department of Information Technology, Quaid-e-Awam University of Engineering, Sciences and Technology, Nawabshah.
  • Akhtar Hussain Jalbani Department of Information Technology, Quaid-e-Awam University of Engineering, Sciences and Technology, Nawabshah.
  • Dil Nawaz Hakro Institute of Information and Communication Technology, University of Sindh, Jamshoro.
  • Maryam Hameed Department of Information Technology, Quaid-e-Awam University of Engineering, Sciences and Technology, Nawabshah

Abstract

OCR (OpticalCharacter Recognition) is a technology in which text image is used to understand and write text by machines. The work on languages containing isolated characters such as German, English, French and others is at its peak. The OCR and ICR (Intelligent Character Recognition) research in Sindhi script is currently at in starting stages and not sufficient work have been cited in this area even though Sindhi language is rich in culture and history. This paper presents one of the initial steps in recognizing Sindhi handwritten characters. The isolated characters of Sindhi script written by thesubjects have been recognized. The various subjects were asked to write Sindhi characters in unconstrained form and then the written samples were collected and scanned through a flatbed scanner. The scanned documents were preprocessedwith the help of binary conversion, removing noise by pepper noise and the lines were segmented with the help of horizontal profile technique. The segmented lines were used to extract characters from scanned pages.This character segmentation was done by vertical projection. The extracted characters have been used to extract features so that the characters can be classified easily. Zoning was used for the feature extraction technique. For the classification, neural network has been used. The recognized characters converted into editable text with an average accuracy of 85%.

Published
Jan 1, 2018
How to Cite
AWAN, Shafique Ahmed et al. Handwritten Sindhi Character Recognition Using Neural Networks. Mehran University Research Journal of Engineering and Technology, [S.l.], v. 37, n. 1, p. 6, jan. 2018. ISSN 2413-7219. Available at: <http://publications.muet.edu.pk/index.php/muetrj/article/view/122>. Date accessed: 17 jan. 2018.