Sarwar, RaheemYu, ChenyunNutanong, SaranaUrailertprasert, NorawitVannaboot, NattapolRakthanmanon, ThanawinPei, JianManolopoulos, YannisSadiq, Shazia WLi, Jianxin2020-10-122020-10-122018-05-13Sarwar R., Yu C., Nutanong S., Urailertprasert N., Vannaboot N., Rakthanmanon T. (2018) A Scalable Framework for Stylometric Analysis of Multi-author Documents. In: Pei J., Manolopoulos Y., Sadiq S., Li J. (eds) Database Systems for Advanced Applications. DASFAA 2018. Lecture Notes in Computer Science, vol 10827. Springer, Cham. https://doi.org/10.1007/978-3-319-91452-7_5297833199145100302-974310.1007/978-3-319-91452-7_52http://hdl.handle.net/2436/623703This is an accepted manuscript of a chapter published by Springer in Database Systems for Advanced Applications. DASFAA 2018. Lecture Notes in Computer Science, vol 10827 on 13/05/2018, available online: https://doi.org/10.1007/978-3-319-91452-7_52 The accepted version of the publication may differ from the final published version.Stylometry is a statistical technique used to analyze the variations in the author’s writing styles and is typically applied to authorship attribution problems. In this investigation, we apply stylometry to authorship identification of multi-author documents (AIMD) task. We propose an AIMD technique called Co-Authorship Graph (CAG) which can be used to collaboratively attribute different portions of documents to different authors belonging to the same community. Based on CAG, we propose a novel AIMD solution which (i) significantly outperforms the existing state-of-the-art solution; (ii) can effectively handle a larger number of co-authors; and (iii) is capable of handling the case when some of the listed co-authors have not contributed to the document as a writer. We conducted an extensive experimental study to compare the proposed solution and the best existing AIMD method using real and synthetic datasets. We show that the proposed solution significantly outperforms existing state-of-the-art method.application/pdfenauthorship identificationco-authorship graphmulti-author documentsstylometryA scalable framework for stylometric analysis of multi-author documentsConference contributionDatabase Systems for Advanced Applications - 23rd International Conference, DASFAA 2018, Gold Coast, QLD, Australia, May 21-24, 2018, Proceedings, Part I2020-10-07