Enhancing Knowledge and Bounce Rate in SERPs Using MicroData

Internet has revolutionized the human life. SEs (Search Engines) are one of the major tools being used for finding information over the Internet. SEs enlist the information into links as per relevance to the searched query. A searcher usually visits the top web links retrieved on SERPs (Search Engine Results Pages) in response to a search query. With the evolving nature of Internet and the increasing number of competitors; it is hard to maintain high ranking in SERPs even for professional correspondents. However, correspondents can apply the techniques of web micro-data to achieve high CTR (Click through Rate) in SERPs. Ranking in major SEs is still a critical factor, although in certain cases such as movies, books, recipes rich snippets proved profitable for webmasters. This study aims to address the gap in micro-data moving from top category such as Animals to their limited scope. Animals with information such as name, price, category will have high CTR and hence more user satisfaction for specified result will lead to high ranking in SERPs.


INTRODUCTION
Internet searching has now become a routine activity of many people around the globe.Searches are executed due to customary interest, originated through a live event, news, social report or new scientific discovery [1,2].To extract related data set from the Internet, SE queries are applied.Searched query results have revealed that SE is a primary element in extracting the true information.For example people search for facts concerned to health on SE for numerous reasons.Queries linked to recognition of infectious diseases are requested through SE that provides valuable health-related information, might also yield useful health statistics [3].
Lali et.al. [3] show that epidemics trend can be detected with information available on Web.They used SE queries and twitter tweets to detect influenza epidemics in a particular geographical area.Early detection of disease activity, when followed by a rapid response, can reduce the impact of both seasonal and pandemic influenza.
Eysenbach [4] aimed to determine the prevalence of health-related searches on the web by analyzing search terms entered by people into popular SEs.They also made some preliminary attempts in qualitatively describing and classifying these searches.Occasional difficulties in determining what constitutes a health-related search led us to propose and validate a simple method to automatically classify a search string as health-related.
Information can be extracted from the different SEs in many ways, either by typing a topic name, phrase, keyword or an appropriate question.Most popular SEs are Google, Yahoo, Bing and Yandex which fulfill the searching requirements [5].Since the last few decades, SE developers are working to provide with more accurate, reliable and relevant information.
Keeping this in mind, web masters and bloggers should acquire the skills to boost the designing and learn marketing tactics to attract the visitors to their website.Most Internet searching is performed after office timing or majorly on weekends [6].According to iprospect more than 60% users chosen to hit the top results appeared on the initial page of SERPs.Only 5% searchers go beyond 2 nd page and just on 2% visited beyond 3 rd Page [7].
Hence, it is important to maintain top position in SERPs for continuous flow of websites visitors to run a business through it.In modern era, where business grooming techniques like online trade, marketing and advertisement have a great impact to increase the revenue from the business [8].It has been recognized as a better way to generate revenue by advertising via websites.Many professionals are working to develop number of websites for advertising businesses.
Each website is developed in a particular language and style influenced by its developer/user's geographical location and civilization.However, it develops a sense of competition among online businesses that attracts the most visitors to the website.

SE optimization is the technique of optimizing a website
to gain a distinguished ranking in SERPS.Initially its idea was proposed by Sullivan and is progressing since then.

SEM (Search Engine Marketing) provides techniques to
increase the ranking of a website [9].Therefore, different SEs use various searching algorithms.
Every SE has adopted its own ranking criteria to present the specific result higher in the SERPs.With the passage of time, the IT industry has become used to these techniques.
Meanwhile, the developers have acquired the knowledge to improve the algorithms.Hence, these algorithms improved the effectiveness of the results regarding the relevant websites ranking.
However, the continuous improvements of these algorithms were not properly communicated to the industry for verifying their practical implementation.Therefore, one can not rely on these but can focus on their basic context, available material and the way it contains information to address his search queries.
SEs intend to give more importance to those pages where relevant information is present.For this, SEs such as Google, Yahoo, Microsoft, Pinterest, Yandex has introduced structured data on the Internet [10,11].
( This article focused to find out the gap in the existing microdata, to give more organic results to users.These rich snippets are not only used for ranking signals, but they will be used in future for ranking in SERPs [12].Section 2 presents the related work and important approaches and techniques.
Section 3 discusses the methodology and section 4 concludes the article.While entering a search query mostly SEs help the users with the topic prediction to assist him [15].

RELATED WORK
SEs are mainly comprised of four components; crawler, indexer, database and GUI (Graphical User Interface).
An exceptional SE should have four features; quick, correct, easy and strong.SE optimization is a broader term which covers a huge area on the Web.When a user inputs a query in the form of keywords on the SE, the results listed are organic in nature.Excluding those results having the adverts sponsored links (pay-perclick ads) [16].
Sergey Brine and Larry Page designed a link analysis algorithm known as "PageRank", which is used by SEs.The PageRank values are pre-calculated and stored for all pages acknowledged to IR system.Hence, every web page has its own rank free from the query terms.The page rank score shows the importance of different web pages having the same query terms [17,18].The maximum value which is given by the SE e.g.Google toolbar is 10.The method to calculate the page rank is shown in Equation ( 1) [19].
A website could only be visible and optimized when it is ranked accordingly and positioned among the first page of the search results.SEO (Search Engine Optimization) is a method and set of concepts, tools and procedures applied to rank websites and place them at first page in the organic search results.The various SEO techniques that can be used, are discussed in the following sections along with their effects.

On-Site Optimization
This is also referred as on-page optimization, performed directly on the page with the code of that website [20].It is further divided into a number of techniques such as code, content and link/directory structure.

Page Title
The most important and favorable factor for the SE results is the relevancy of title of web pages with the title tags.The position of the title tag in the code section is at the <head> tag section of your web coding document and that is a key way of utilizing meta information for increasing relevancy and rank of the website [21].The topic and other major keywords are also shown with the help of this tag.That is the result of combining together the string and the words, including in the title of HTML (Hyper Text Markup Language) document.

Description Tag
This tag contains HTML attributes that provide a useful explanation of the contents in a web page.The SE results in the (SERP) contain this information right after the title of the web page.In some cases, without opening the entire website, it can be a great way of providing information about the contents of the website for increasing its traffic and marketing [22].

Meta Keywords Tag
The words which are the most relevant and directly related to the content of website title are included in the Keywords Tag along with the body contents.Embedding the most important and relevant keywords first can positively improve the SERP [23].

Using Sitemaps
All the files and URL's used in a website are saved in a list with the help of sitemaps like ISTIMAGE.JPEG, FILE.PDF.
The technique of how SE prioritizes the web pages and how they crawl the website is accomplished with the help of sitemaps.It also offers robots to efficiently crawl the website [24].

Directory/URL Structure
This structure is simple, logically linked with the contents of the website and also easily understandable by the user for efficiency purposes.For example a URL like http:// www.animalshleterinpakistan.com/freeservices; helps you to decide but a URL like http://www.yhissite.com/index.phpetc. leads towards confusion.Most importantly use punctuation in URLs [25].
All the relative techniques which have been discussed for the SEO focused and enlightened the ways of getting the web page higher position in the SE results.However, they did not emphasis on the approach; how the related content which best matches the query has to be shown in the proper position.SEs retain limited knowledge about the web page and its contents but there may be other inherent meanings inside every page which user understands when they read it.
Fig. 1 shows the results of a searched query phrase "Jaguar" in the SE Google.The search results, in response to the keyword "Jagur" were animal's pictures and its relevant information.In actual this was a "Jaguar USA Car company", so there were inherent meanings inside the website that could not be understood by the SE.
Therefore, it is important to address this problem by enhancing the web visibility using its internal structure and information stored in it.This article is focused on how microdata can solve this problem and motivated to find out the gap in the existing microdata to optimize the performance.By identifying the gap, we have participated in the international community as a standard platform.
The key objective is to attain the knowledge to get the maximum visibility in SERPs with the help of internal structure and information already present inside of the

METHODOLOGY
We use analytic approach for our study by using data along with the information that is already available to perform a thorough quantitative analysis.This aims at finding an immediate problem faced by the society or industrial/ business organization.We have proposed some new microdata which does not exist in the schema.orgdictionary [26].
The collaborative effort of the three widely used SEs, Google, Bing and Yahoo has created online data repository.The repository has all the major entities and their lot of subcategories.Whenever the web developers and other online business developers need to develop their websites can directly tag all the data using this dictionary.This online combined dictionary is available in the form of Schema.org.
More than 10 million websites use this schema.orgfor marking up their web pages [26].This online vocabulary provides a shared common platform through which the various web developers and entrepreneurs can markup their web pages.These markups will lead to a better understanding of the web contents inside the web page for the major SEs during their search for a particular query or phrase [26].
The information in the web page is used to create the Rich Snippets by major SEs which produce information in the search results for the user.For example lasagna recipe has 4.8 rating, and 9,542 reviews.Which is clearly indicator that the information present on this page is liked by users as shown in Fig. 2.

FIG. 2. SEARCH RESULTS OF LASAGNA RECIPE
These search results help the user to get the most relevant results so the CTR will be higher.Higher CTR is directly proportional to the revenue and user interests on the page [27].There are three techniques for implementing this technique for getting the underlying meanings.The syntax for using the major tags and their distribution within their subtypes is listed in Fig. 3. Starting from the tags having information about the item is enclosed inside Fig. 3.By implementing the "itemtype" for a recipe, it is linked up with the schema.orgvocabulary.Furthermore, in every inside tag of "itemtprop" the property of the item type is included.This results in directly linking that specified item type with its specific category shown in Fig. 3.
There are lot of specific fields and the global scope of the different tags.They could be used by directly joining their major scope and then moving down towards the limited scope, hence enclosing it within their markups [22].Starting from the most common type and moving towards the local business, the most limited scope of the types [26].For example, the first search result listed in Fig. 4 contains both a star rating and a publication date.
Both of these can be added using schema micro-data which is coded in Fig. 5.
Adding Schema markup to HTML tags, improves the way your page displays in SERPs by enhancing the rich snippets which are displayed beneath the page title.The schema.org is an online vocabulary available for marking up the tags but it is still under developed.The vocabulary of schema.org is enhanced by the contributions from the open community who are discussing the new entities that have not been included yet.We have contributed in it, by signing up as a member at the date of 21 st May, 2015 and then performed a detail study on all the available vocabularies to find the gaps.
We discovered that there are many parts still missing in available online dictionary for the webmasters.It restrain the benefits, due the maximum unavailability of the item, their types and their properties.We observed that there was a major category missing relating the "Animal terminology" despite of the fact that there are thousands of search queries against this phrase and keywords appear daily.Some of queries searched in year 2015 are listed in Table 1.We have collected this information using Google Adwords Keyword Planner [28].
There should be a better way to handle this "Animal terminology" through micro-data.The number of searches on the web master tool and the frequency of animal related queries have increased, therefore, it is important to address this with the help of micro-data format especially recommended by Google.
Proposal for the category of animals recommended on the date 24 th August, 2015 on GitHub shown in Fig. 6, which is an open community discussion group for the acceptance and releasing of new vocabularies in existing vocabularies.
Our proposal was considered initially for discussion by the different active members.These members include from different interest groups and fields of sub emerging types of web engineering throughout the out world listed in Fig. 7.  Our identified "Animal Terminology" proposal has been accepted.Proposed micro-data tags hierarchy is shown in Fig. 9.
Animals have been classified in to their respective categories.List of supposed micro-data for animals is listed in Table 2.

FIG. 1 .
FIG. 1. QUERY SEARCHED IN GOOGLE FOR INFORMATION RELATED TO JAGUAR The vocabularies are maintained and managed by open community groups under the platform of schema.org.The different communities discuss this emerging technique of enhancing the visibility of web pages.Discussions lead towards the addition of new vocabularies in the already published list.These discussions occur by two methods.(a) Using the mailing list public-schema@w3.org(b) Through github

FIG. 3 .
FIG. 3. EXAMPLE OF MICRO-DATA FOR RECIPE AND MOVIE detailed about this open-ended community could be acquired with the live web address at: https:// github.com/schemaorg/schemaorg/issues/746.Complete information and opinions received by different people is available at this link.The detailed discussion with the different members of the Git Hub community on the proposed topic of "Animal Terminology" also guided us to cover identified gap.The resulting acceptance of the proposal is shown in Fig.8.

TABLE 1 . ANIMALS RELATED QUERIES SEARCHED IN GOOGLE FROM JANUARY 2015 TO DECEMBER 2015 FIG
66. ISSUE# 746 FOR CONSIDERING ANIMAL TERMINOLOGY

TABLE 2 . SUPPOSED MICRODATA PROPERTIES FOR ANIMALS
FIG. 7. PARTICIPATION OF OTHER CONTRIBUTORS FIG. 8. ACCEPTANCE OF PROPOSAL #746 FIG. 9. CLASSIFICATIONS OF ANIMALS AND PROPOSED MICRO-DATA ACCORDING TO THEIR CLASS

Enhancing Knowledge and Bounce Rate in SERPs Using Micro-Data Mehran University Research Journal of Engineering & Technology, Volume 36, No. 4, October, 2017 [p-ISSN: 0254-7821, e-ISSN: 2413-7219] 973 4. CONCLUSION
In recent years, the SERP's have been augmented with new markup elements that introduce seamlessly additional semantic information Nowadays, good content is king and also leads to more likes and shares, tweets and as a result returns visitors to your website.Therefore, you have to do some extra things to maximize the SEO friendliness of your post, but most important is that user likes your content.The user must get what he is looking for.Ranking is still an important factor, but introducing micro-data from top to down regarding all necessary information a casual user is looking, will enhance CTR and definitely ranking.In the future we are interested to show some region oriented techniques to rank local in business in local SERPs.