Show simple item record

dc.contributor.authorThelwall, Mike
dc.contributor.authorVaughan, Liwen
dc.date.accessioned2006-08-23T14:38:24Z
dc.date.available2006-08-23T14:38:24Z
dc.date.issued2004
dc.identifier.citationThelwall, M. and Vaughan, L. (2004), "New versions of PageRank employing alternative Web document models", Aslib Proceedings, Vol. 56 No. 1, pp. 24-33. https://doi.org/10.1108/00012530410516840
dc.identifier.issn0001-253X
dc.identifier.doi10.1108/00012530410516840
dc.identifier.urihttp://hdl.handle.net/2436/4008
dc.descriptionThis is an accepted manuscript of an article published by Emerald Group Publishing Limited in Aslib Proceedings on 01/01/2004, available online: https://doi.org/10.1108/00012530410516840 The accepted version of the publication may differ from the final published version.
dc.description.abstractIntroduces several new versions of PageRank (the link based Web page ranking algorithm), based on an information science perspective on the concept of the Web document. Although the Web page is the typical indivisible unit of information in search engine results and most Web information retrieval algorithms, other research has suggested that aggregating pages based on directories and domains gives promising alternatives, particularly when Web links are the object of study. The new algorithms introduced based on these alternatives were used to rank four sets of Web pages. The ranking results were compared with human subjects’ rankings. The results of the tests were somewhat inconclusive: the new approach worked well for the set that includes pages from different Web sites; however, it does not work well in ranking pages that are from the same site. It seems that the new algorithms may be effective for some tasks but not for others, especially when only low numbers of links are involved or the pages to be ranked are from the same site or directory.
dc.formatapplication/pdf
dc.format.extent155829 bytes
dc.format.mimetypeapplication/pdf
dc.language.isoen
dc.publisherEmerald Group Publishing Limited
dc.relation.urlhttps://www.emerald.com/insight/content/doi/10.1108/00012530410516840/full/html
dc.subjectAlgorithmic languages
dc.subjectHypertext transfer protocol
dc.subjectInformation retrieval
dc.subjectPage description languages
dc.subjectSearch engines
dc.subjectWorld Wide Web
dc.titleNew versions of PageRank employing alternative Web document models
dc.typeJournal article
dc.identifier.journalAslib Proceedings
dc.format.digYES
rioxxterms.versionAM
dc.source.volume56
dc.source.issue1
dc.source.beginpage24
dc.source.endpage33
refterms.dateFCD2020-06-09T12:27:56Z
refterms.versionFCDAM
refterms.dateFOA2018-08-21T11:55:42Z
html.description.abstractIntroduces several new versions of PageRank (the link based Web page ranking algorithm), based on an information science perspective on the concept of the Web document. Although the Web page is the typical indivisible unit of information in search engine results and most Web information retrieval algorithms, other research has suggested that aggregating pages based on directories and domains gives promising alternatives, particularly when Web links are the object of study. The new algorithms introduced based on these alternatives were used to rank four sets of Web pages. The ranking results were compared with human subjects’ rankings. The results of the tests were somewhat inconclusive: the new approach worked well for the set that includes pages from different Web sites; however, it does not work well in ranking pages that are from the same site. It seems that the new algorithms may be effective for some tasks but not for others, especially when only low numbers of links are involved or the pages to be ranked are from the same site or directory.


Files in this item

Thumbnail
Name:
2004_new_pagerank_preprint.pdf
Size:
152.1Kb
Format:
PDF

This item appears in the following Collection(s)

Show simple item record