University of Wolverhampton
Browse
Collection All
bullet
bullet
bullet
bullet
Listed communities
bullet
bullet
bullet
bullet
bullet
bullet
bullet
bullet
bullet
bullet
bullet
bullet
bullet

Wolverhampton Intellectual Repository and E-Theses > School of Technology > School of Computing and IT > Statistical Cybermetrics Research Group  > Can Google's PageRank be used to find the most important academic Web pages?

Please use this identifier to cite or link to this item: http://hdl.handle.net/2436/3139
    Del.icio.us     LinkedIn     Citeulike     Connotea     Facebook     Stumble it!



Title: Can Google's PageRank be used to find the most important academic Web pages?
Authors: Thelwall, Mike
Citation: Journal of Documentation, 59(2): 205-217
Publisher: MCB UP Ltd
Issue Date: 2003
URI: http://hdl.handle.net/2436/3139
DOI: 10.1108/00220410310463491
Additional Links: http://www.emeraldinsight.com/10.1108/00220410310463491
Abstract: Google's PageRank is an influential algorithm that uses a model of Web use that is dominated by its link structure in order to rank pages by their estimated value to the Web community. This paper reports on the outcome of applying the algorithm to the Web sites of three national university systems in order to test whether it is capable of identifying the most important Web pages. The results are also compared with simple inlink counts. It was discovered that the highest inlinked pages do not always have the highest PageRank, indicating that the two metrics are genuinely different, even for the top pages. More significantly, however, internal links dominated external links for the high ranks in either method and superficial reasons accounted for high scores in both cases. It is concluded that PageRank is not useful for identifying the top pages in a site and that it must be combined with a powerful text matching techniques in order to get the quality of information retrieval results provided by Google.
Type: Article
Language: en
Description: Main article
Keywords: Algorithms
Effectiveness
Information retrieval
Universities
Internet
ISSN: 00220418,00000000
Appears in Collections: Statistical Cybermetrics Research Group
Statistical Cybermetrics Research Group

Files in This Item:
File Description Size Format View/Open
2003 JDOC Google PageRank preprint.pdf268KbAdobe PDFThumbnail
View/Open

All Items in WIRE are protected by copyright, with all rights reserved, unless otherwise indicated.

 

Fairtrade - Guarantees a better deal for Third World Producers

University of Wolverhampton, Wulfruna Street, Wolverhampton, WV1 1LY

Course enquiries: 0800 953 3222, General enquiries: 01902 321000,
Email: enquiries@wlv.ac.uk | Freedom of Information | Disclaimer and copyright | Website feedback | The University as a charity

OR Logo Powered by Open Repository | Cookies