Artificial Urdu Text Detection and Localization from Individual Video Frames

  • Salahuddin Unar School of Computer Science and Technology, Faculty of Electronic Information and Electrical Engineering, Dalian University of Technology, Dalian 116024, China.
  • Akhtar Hussain Jalbani Department of Information Technology, Quaid-e-Awam University of Engineering, Science and Technology, Nawabshah
  • Muhammad Moazzam Jawaid Department of Computer System Engineering, Mehran University of Engineering and Technology, Jamshoro
  • Mohsin Shaikh Quaid-e-Awam University College of Engineering, Science and Technology, Larkano.
  • Asghar Ali Chandio School of Engineering and Information Technology, University of New South Wales, Canberra, Australia


In current era of technology, information acquisition from images and videos become most important task due to the rapid development of data mining and machine learning.The information can be either textual, visual, or combination of these. Text appearing in images or videos is a significant source of information and plays a vital role to perceive it. Developing a unified method to detect the text is hard, as textual properties (i.e. font, size, color, illumination, orientation, etc.) may vary with the complex background. So far, multimedia and computer vision community unable yet to standardize any ideal approach to extract the text smoothly. In this paper, a novel method is proposed to detect and localize artificial Urdu text in individual video frames. Firstly, Sobel and Canny edge detection operators are applied to input frame and are merged with MSER (Maximally Stable Extremal Region) detected regions. Next, geometric constraints are applied to eliminate obvious non-text regions with large and small variations. Further refining of non-text regions is achieved by stroke width transform. SVM (Support Vector Machine) classifier is trained to classify text and non-text objects. Finally, bounding boxes are used to localize the text.Experimental results show that the proposed method is robust and efficient than state-of-the-art methods.

Apr 1, 2018
How to Cite
UNAR, Salahuddin et al. Artificial Urdu Text Detection and Localization from Individual Video Frames. Mehran University Research Journal of Engineering and Technology, [S.l.], v. 37, n. 2, p. 429-438, apr. 2018. ISSN 2413-7219. Available at: <>. Date accessed: 15 july 2018. doi:
This is an open Access Article published by Mehran University of Engineering and Technolgy, Jamshoro under CCBY 4.0 International License