Tweet coupling: a social media methodology for clustering scientific publications
Authors
Hassan, SUAljohani, Naif Radi
Shabbir, M
Ali, U
Iqbal, S
Sarwar, R
Martínez-Cámara, E
Ventura, S
Herrera, F
Issue Date
2020-05-18
Metadata
Show full item recordAbstract
© 2020, Akadémiai Kiadó, Budapest, Hungary. We argue that classic citation-based scientific document clustering approaches, like co-citation or Bibliographic Coupling, lack to leverage the social-usage of the scientific literature originate through online information dissemination platforms, such as Twitter. In this paper, we present the methodology Tweet Coupling, which measures the similarity between two or more scientific documents if one or more Twitter users mention them in the tweet(s). We evaluate our proposal on an altmetric dataset, which consists of 3081 scientific documents and 8299 unique Twitter users. By employing the clustering approaches of Bibliographic Coupling and Tweet Coupling, we find the relationship between the bibliographic and tweet coupled scientific documents. Further, using VOSviewer, we empirically show that Tweet Coupling appears to be a better clustering methodology to generate cohesive clusters since it groups similar documents from the subfields of the selected field, in contrast to the Bibliographic Coupling approach that groups cross-disciplinary documents in the same cluster.Citation
Hassan, S., Aljohani, N.R., Shabbir, M., Ali, U., Iqbal, S., Sarwar, R., Martínez-Cámara, E., Ventura, S. and Herrera, F. (2020) Tweet coupling: a social media methodology for clustering scientific publications, Scientometrics, 124, pp. 973–991.Publisher
Springer Science and Business Media LLCJournal
ScientometricsAdditional Links
https://link.springer.com/article/10.1007/s11192-020-03499-1Type
Journal articleLanguage
enDescription
This is an accepted manuscript of an article published by Springer in Scientometrics on 18/05/2020, available online: https://doi.org/10.1007/s11192-020-03499-1 The accepted version of the publication may differ from the final published version.ISSN
0138-9130EISSN
1588-2861Sponsors
The authors (Saeed-Ul Hassan & Mudassir Shabbir) were funded by the CIPL (National Center in Big Data and Cloud Computing (NCBC) grant, received from the Planning Commission of Pakistan, through Higher Education Commission (HEC) of Pakistan. This work was partially supported by the Spanish Ministry of Science and Technology under the projects TIN2017-89517-P and TIN2017-83445-P. Eugenio Martínez Cámara was supported by the Spanish Government Programme Juan de la Cierva Incorporación (IJC2018-036092-I).ae974a485f413a2113503eed53cd6c53
10.1007/s11192-020-03499-1
Scopus Count
Except where otherwise noted, this item's license is described as https://creativecommons.org/licenses/by-nc-nd/4.0/