Bringing Shape to Textual Data – A Feasible Demonstration

  • Anoud Shaikh Institute of Information and Communication Technologies, Mehran University of Engineering and Technology, Jamshoro, Pakistan
  • Naeem Ahmed Mahoto Institute of Information and Communication Technologies, Mehran University of Engineering and Technology, Jamshoro, Pakistan
  • Mukhtiar Ali Unar Institute of Information and Communication Technologies, Mehran University of Engineering and Technology, Jamshoro, Pakistan

Abstract

The Internet has revolutionized the communication paradigm. This has led towards immense amount of unstructured data (i.e. textual data), which is a major source to get useful knowledge about people in several application domains. TM (Text Mining) extracts high quality information to discover knowledge by drawing patterns and relationships in textual data. This field has taken great attention of the research community. As a result, several attempts have been made to propose, introduce and refine techniques applied for uncovering knowledge from text data. This study aims at: (1) presenting existing TM techniques in the scientific literature, (2) reporting challenges/issues and gaps that still need attention, and (3) proposing a framework to bring shape to textual data. A prototype has been developed to demonstrate the effectiveness and potential worth of proposed approach to display how unstructured data (i.e. news articles in this study) has been brought to a shape representing interesting knowledge. The proposed framework implements basic NLP (Natural Language Processing) functions in combination of AYLIEN API (Application Programming Interface) functions. The results reveal the fact that how events, celebrities and popular news-items have been covered in the electronic media, and it also represents subjectivity of topical news events. The news coverage trends highlight the significance of daily news events, which may assist in getting insight about the media groups.

Published
Oct 1, 2019
How to Cite
SHAIKH, Anoud; MAHOTO, Naeem Ahmed; UNAR, Mukhtiar Ali. Bringing Shape to Textual Data – A Feasible Demonstration. Mehran University Research Journal of Engineering and Technology, [S.l.], v. 38, n. 4, p. 901-914, oct. 2019. ISSN 2413-7219. Available at: <https://publications.muet.edu.pk/index.php/muetrj/article/view/1237>. Date accessed: 17 nov. 2019.
This is an open Access Article published by Mehran University of Engineering and Technolgy, Jamshoro under CCBY 4.0 International License