Abstract/Details

Integrating clustering with page ranking


2005 2005

Other formats: Order a copy

Abstract (summary)

The enormous growth in the number of documents in the World Wide Web increases the need for improved link navigation and analysis. One of the important techniques of analyzing the web link structure is page rank computation. The focus of this thesis is on studying the conventional page ranking algorithm and designing a new page ranking algorithm by incorporating clustering technique into it.

The eye tracking study conducted by search marketing firm Enquiro and Did-it and eye tracking firm Eyetool has shown that the vast majority of eye tracking activity during a search happens in a triangle at the top of a search results page indicating that the areas of maximum interest create a "Golden Triangle". The results from this study has been adopted and used to define a clustering technique which is incorporated into the conventional page ranking algorithm. This new page ranking algorithm has been implemented and results are analyzed which reveal that the number of iterations required in the conventional method can be reduced thus reducing the computation time. The experimental results are based on the SUNYIT.EDU web graph which consists of 1000 nodes and approximately 24,300 links. This web graph has been crawled using an optimized crawler designed for this particular system implementation.

The comparison of search results for a search query based on the page ranks computed by both page ranking techniques shows that the relative position of web pages are the same. This means that if we incorporate clustering into page ranking, the number of iterations can be reduced thus saving cost of computation and time. All this is achieved without compromising with the quality of search results and thus the over all user experience with the search engine.

Indexing (details)


Subject
Computer science
Classification
0984: Computer science
Identifier / keyword
Applied sciences
Title
Integrating clustering with page ranking
Author
Mahich, Rajendra
Number of pages
53
Publication year
2005
Degree date
2005
School code
1026
Source
MAI 44/03M, Masters Abstracts International
Place of publication
Ann Arbor
Country of publication
United States
ISBN
9780542397455, 0542397455
Advisor
Sengupta, Sam
University/institution
State University of New York Institute of Technology
University location
United States -- New York
Degree
M.S.
Source type
Dissertations & Theses
Language
English
Document type
Dissertation/Thesis
Dissertation/thesis number
1429754
ProQuest document ID
305379602
Copyright
Database copyright ProQuest LLC; ProQuest does not claim copyright in the individual underlying works.
Document URL
http://search.proquest.com/docview/305379602
Access the complete full text

You can get the full text of this document if it is part of your institution's ProQuest subscription.

Try one of the following:

  • Connect to ProQuest through your library network and search for the document from there.
  • Request the document from your library.
  • Go to the ProQuest login page and enter a ProQuest or My Research username / password.