An Automated Web Structure-based Method for Predicting the Importance of a Webpage

Dr. Syed Tauhid Zuhori

IntelliPaper

Peer Review Double Blind Handling Editor
Online published 21 September 2022

− Abstract

The aim of this article is to develop a method to find the importance of web pages without using web browser data or invading the privacy of users. Rather, it works on the structure of a website. To achieve this goal, we propose a novel method that can take webpage content as input and produce a score for each page automatically. Initially, we extract content from a web page in real-time. Subsequently, we consider two important factors based on the website structure: (1) “What is the minimum number of clicks needed to access web pages in a website?” and (2) “How a web page is linked with other web pages in a website?” We use a learning method to train our model by using the “web page views” results generated by “Google Analytics” and “Similar Web”. Experiments and Case studies on the world’s most popular websites show that our method can produce very effective results in real-time.

− Explore Digital Article Text

Full-text access for this article is currently being prepared and will be available shortly.

− Conflict of Interest

The authors declare no conflict of interest.

− Ethical Approval

Not applicable

− Data Availability

The datasets used in this study are openly available at [repository link] and the source code is available on GitHub at [GitHub link].

− Funding

This work did not receive any external funding.

− Cite this article

Generating citation...

Classification
1

DDC Code: 005.2762 LCC Code: QA76.73.J39
Version of record

v1.0
Issue date

21 September 2022
Language

en

Iconic historic building with domed tower in London, UK.

Download Article

Open Access

Research Article

CC-BY-NC 4.0

LJER Volume 22 LJER Volume 22 Issue 6, Pg. 19-46

Explore Journal

Read LJER Volume 22 Issue 6 Explore LJER Volume 22

Views 7.1K

Downloads 391

Year

Special Issue

Launch a focused special issue to highlight research, emerging trends, and expert insights in your academic field.

Keywords

4

Eigen Vector Centrality learning Page Views. Website Structure

An Automated Web Structure-based Method for Predicting the Importance of a Webpage

IntelliPaper

− Abstract

− Explore Digital Article Text

− Conflict of Interest

− Ethical Approval

− Data Availability

− Funding

− Cite this article

Classification

Version of record

Issue date

Language

Special Issue

Keywords

Next Research

Copy of Cardiovascular Risk Factors and Cardiovascular Risk in People Living with HIV: Comparison of Four Cardiovascular Risk Prediction Algorithms

Privacy preferences

An Automated Web Structure-based Method for Predicting the Importance of a Webpage

IntelliPaper

Request Review Access

Order Article Reprints

− Abstract

− Explore Digital Article Text

− Conflict of Interest

− Ethical Approval

− Data Availability

− Funding

− Cite this article

− Related Research

Classification

Version of record

Issue date

Language

Special Issue

Keywords

Next Research

Copy of Cardiovascular Risk Factors and Cardiovascular Risk in People Living with HIV: Comparison of Four Cardiovascular Risk Prediction Algorithms

Privacy preferences