• Admin Login
    View Item 
    •   Home
    • Research Institute in Information and Language Processing
    • Research Institute in Information and Language Processing
    • View Item
    •   Home
    • Research Institute in Information and Language Processing
    • Research Institute in Information and Language Processing
    • View Item
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    Browse

    All of WIRECommunitiesTitleAuthorsIssue DateSubmit DateSubjectsTypesJournalDepartmentPublisherThis CollectionTitleAuthorsIssue DateSubmit DateSubjectsTypesJournalDepartmentPublisher

    Administrators

    Admin Login

    Local Links

    AboutThe University LibraryOpen Access Publications PolicyDeposit LicenceCOREWIRE Copyright and Reuse Information

    Statistics

    Display statistics

    A layered approach for investigating the topological structure of communities in the Web.

    • CSV
    • RefMan
    • EndNote
    • BibTex
    • RefWorks
    Thumbnail
    Name:
    2003 A layered approach prepri ...
    Size:
    329.4Kb
    Format:
    PDF
    Download
    Authors
    Thelwall, Mike cc
    Issue Date
    2003
    
    Metadata
    Show full item record
    Abstract
    A layered approach for identifying communities in the Web is presented and explored by applying the flake exact community identification algorithm to the UK academic Web. Although community or topic identification is a common task in information retrieval, a new perspective is developed by: the application of alternative document models, shifting the focus from individual pages to aggregated collections based upon Web directories, domains and entire sites; the removal of internal site links; and the adaptation of a new fast algorithm to allow fully-automated community identification using all possible single starting points. The overall topology of the graphs in the three least-aggregated layers was first investigated and found to include a large number of isolated points but, surprisingly, with most of the remainder being in one huge connected component, exact proportions varying by layer. The community identification process then found that the number of communities far exceeded the number of topological components, indicating that community identification is a potentially useful technique, even with random starting points. Both the number and size of communities identified was dependent on the parameter of the algorithm, with very different results being obtained in each case. In conclusion, the UK academic Web is embedded with layers of non-trivial communities and, if it is not unique in this, then there is the promise of improved results for information retrieval algorithms that can exploit this additional structure, and the application of the technique directly to partially automate Web metrics tasks such as that of finding all pages related to a given subject hosted by a single country's universities.
    Citation
    Thelwall, M. (2003), "A layered approach for investigating the topological structure of communities in the Web", Journal of Documentation, Vol. 59 No. 4, pp. 410-429. https://doi.org/10.1108/00220410310485703
    Publisher
    MCB UP Ltd
    Journal
    Journal of Documentation
    URI
    http://hdl.handle.net/2436/4009
    DOI
    10.1108/00220410310485703
    Additional Links
    http://www.emeraldinsight.com/10.1108/00220410310485703
    Type
    Journal article
    Language
    en
    Description
    This is an accepted manuscript of an article published by MCB UP Ltd in Journal of Documentation on 01/08/2003, available online: https://doi.org/10.1108/00220410310485703 The accepted version of the publication may differ from the final published version.
    ISSN
    0022-0418
    ae974a485f413a2113503eed53cd6c53
    10.1108/00220410310485703
    Scopus Count
    Collections
    Research Institute in Information and Language Processing

    entitlement

     
    DSpace software (copyright © 2002 - 2023)  DuraSpace
    Quick Guide | Contact Us
    Open Repository is a service operated by 
    Atmire NV
     

    Export search results

    The export option will allow you to export the current search results of the entered query to a file. Different formats are available for download. To export the items, click on the button corresponding with the preferred download format.

    By default, clicking on the export buttons will result in a download of the allowed maximum amount of items.

    To select a subset of the search results, click "Selective Export" button and make a selection of the items you want to export. The amount of items that can be exported at once is similarly restricted as the full export.

    After making a selection, click one of the export format buttons. The amount of items that will be exported is indicated in the bubble next to export format.