<?xml version="1.0" encoding="UTF-8"?>
<article article-type="research-article" xml:lang="en" xmlns:xlink="http://www.w3.org/1999/xlink">
<front>
<journal-meta>
<journal-id journal-id-type="publisher">london-journal-of-engineering-research</journal-id>
<journal-title-group>
<journal-title>London Journal of Engineering Research</journal-title>
</journal-title-group>
<issn publication-format="print">2631-8474</issn>
<issn publication-format="electronic">2631-8482</issn>
<publisher><publisher-name>JournalsPress</publisher-name></publisher>
<self-uri xlink:href="https://journalspress.com/journal-seo-export/jats/83780.xml" />
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">83780</article-id>
<title-group>
<article-title>An Automated Web Structure-based Method for Predicting the Importance of a Webpage</article-title>
</title-group>
<volume>22</volume>
<issue>6</issue>
<fpage>19</fpage>
<lpage>46</lpage>
<abstract><p>The aim of this article is to develop a method to find the importance of web pages without using web browser data or invading the privacy of users. Rather, it works on the structure of a website. To achieve this goal, we propose a novel method that can take webpage content as input and produce a score for each page automatically. Initially, we extract content from a web page in real-time. Subsequently, we consider two important factors based on the website structure: (1) “What is the minimum number of clicks needed to access web pages in a website?” and (2) “How a web page is linked with other web pages in a website?” We use a learning method to train our model by using the “web page views” results generated by “Google Analytics” and “Similar Web”. Experiments and Case studies on the world’s most popular websites show that our method can produce very effective results in real-time.</p></abstract>
<self-uri content-type="pdf" xlink:href="http://journalspress.com/LJER_Volume22/An-Automated-Web-Structure-Based-Method-for-Predicting-the-Importance-of-a-Webpage.pdf" />
<self-uri content-type="html" xlink:href="https://journalspress.com/an-automated-web-structure-based-method-for-predicting-the-importance-of-a-webpage/" />
</article-meta>
</front>
<body>
<sec>
<title>Full Text</title>
<p>The aim of this article is to develop a method to find the importance of web pages without using web browser data or invading the privacy of users. Rather, it works on the structure of a website. To achieve this goal, we propose a novel method that can take webpage content as input and produce a score for each page automatically. Initially, we extract content from a web page in real-time. Subsequently, we consider two important factors based on the website structure: (1) “What is the minimum number of clicks needed to access web pages in a website?” and (2) “How a web page is linked with other web pages in a website?” We use a learning method to train our model by using the “web page views” results generated by “Google Analytics” and “Similar Web”. Experiments and Case studies on the world’s most popular websites show that our method can produce very effective results in real-time.</p>
</sec>
</body>
</article>