University of Wolverhampton
Browse
Collection All
bullet
bullet
bullet
bullet
Listed communities
bullet
bullet
bullet
bullet
bullet
bullet
bullet
bullet
bullet
bullet
bullet
bullet
bullet

Wolverhampton Intellectual Repository and E-Theses > School of Technology > School of Computing and IT > Statistical Cybermetrics Research Group  > Are raw RSS feeds suitable for broad issue scanning? A science concern case study

Please use this identifier to cite or link to this item: http://hdl.handle.net/2436/15862
    Del.icio.us     LinkedIn     Citeulike     Connotea     Facebook     Stumble it!



Title: Are raw RSS feeds suitable for broad issue scanning? A science concern case study
Authors: Thelwall, Mike
Prabowo, Rudy
Fairclough, Ruth
Citation: Journal of the American Society for Information Science and Technology, 57(12): 1644-1654
Publisher: Wiley InterScience
Issue Date: 2006
URI: http://hdl.handle.net/2436/15862
DOI: 10.1002/asi.20334
Additional Links: http://dx.doi.org/10.1002/asi.20334
Abstract: Broad issue scanning is the task of identifying important public debates arising in a given broad issue; really simple syndication (RSS) feeds are a natural information source for investigating broad issues. RSS, as originally conceived, is a method for publishing timely and concise information on the Internet, for example, about the main stories in a news site or the latest postings in a blog. RSS feeds are potentially a nonintrusive source of high-quality data about public opinion: Monitoring a large number may allow quantitative methods to extract information relevant to a given need. In this article we describe an RSS feed-based coword frequency method to identify bursts of discussion relevant to a given broad issue. A case study of public science concerns is used to demonstrate the method and assess the suitability of raw RSS feeds for broad issue scanning (i.e., without data cleansing). An attempt to identify genuine science concern debates from the corpus through investigating the top 1,000 burst words found only two genuine debates, however. The low success rate was mainly caused by a few pathological feeds that dominated the results and obscured any significant debates. The results point to the need to develop effective data cleansing procedures for RSS feeds, particularly if there is not a large quantity of discussion about the broad issue, and a range of potential techniques is suggested. Finally, the analysis confirmed that the time series information generated by real-time monitoring of RSS feeds could usefully illustrate the evolution of new debates relevant to a broad issue.
Type: Article
Language: en
Description: Metadata only
Keywords: Broad issue scanning
RSS feeds
Really simple syndication
ISSN: 15322882
15322890
Appears in Collections: Statistical Cybermetrics Research Group
Statistical Cybermetrics Research Group

Files in This Item:

There are no files associated with this item.



All Items in WIRE are protected by copyright, with all rights reserved, unless otherwise indicated.

 

Fairtrade - Guarantees a better deal for Third World Producers

University of Wolverhampton, Wulfruna Street, Wolverhampton, WV1 1LY

Course enquiries: 0800 953 3222, General enquiries: 01902 321000,
Email: enquiries@wlv.ac.uk | Freedom of Information | Disclaimer and copyright | Website feedback | The University as a charity

OR Logo Powered by Open Repository | Cookies