Ngrams and Engrams: the use of structural and conceptual features to discriminate between English translations of religious texts

2.50
Hdl Handle:
http://hdl.handle.net/2436/620451
Title:
Ngrams and Engrams: the use of structural and conceptual features to discriminate between English translations of religious texts
Authors:
Franklin, Emma; Oakes, Michael
Abstract:
In this paper, we present experiments using the Linguistic Inquiry and Word Count (LIWC) program, a ‘closed-class keyword’ (CCK) analysis and a ‘correspondence analysis’ (CA) to examine whether the Scientology texts of L. Ron Hubbard are linguistically and conceptually like those of other religions. A Kruskal–Wallis test comparing the frequencies of LIWC category words in the Scientology texts and the English translations of the texts of five other religions showed that there were eighteen categories for which the Scientology texts differed from the others, and between one and seventeen for the other religions. In the CCK experiment, keywords typical of each religion were found, both by comparing the religious texts with one another and with the Brown corpus of general English. The most typical keywords were looked up in a concordancer and were manually coded with conceptual tags. The set of categories found for the Scientology texts showed little overlap with those found for the others. Our CA experiments produced fairly clear clusters of texts by religion. Scientology texts were seen at one pole on the first factor, with Christian and Islamic texts at the other. It appears that, in several ways, the Scientology texts are dissimilar to the texts of some of the world's major religions.
Citation:
Ngrams and Engrams: the use of structural and conceptual features to discriminate between English translations of religious texts 2016, 11 (3):299 Corpora
Publisher:
Edinburgh University Press
Journal:
Corpora
Issue Date:
Nov-2016
URI:
http://hdl.handle.net/2436/620451
DOI:
10.3366/cor.2016.0098
Additional Links:
http://www.euppublishing.com/doi/10.3366/cor.2016.0098
Type:
Article
Language:
en
ISSN:
1749-5032; 1755-1676
Appears in Collections:
FOSS

Full metadata record

DC FieldValue Language
dc.contributor.authorFranklin, Emmaen
dc.contributor.authorOakes, Michaelen
dc.date.accessioned2017-04-13T13:52:12Z-
dc.date.available2017-04-13T13:52:12Z-
dc.date.issued2016-11-
dc.identifier.citationNgrams and Engrams: the use of structural and conceptual features to discriminate between English translations of religious texts 2016, 11 (3):299 Corporaen
dc.identifier.issn1749-5032-
dc.identifier.issn1755-1676-
dc.identifier.doi10.3366/cor.2016.0098-
dc.identifier.urihttp://hdl.handle.net/2436/620451-
dc.description.abstractIn this paper, we present experiments using the Linguistic Inquiry and Word Count (LIWC) program, a ‘closed-class keyword’ (CCK) analysis and a ‘correspondence analysis’ (CA) to examine whether the Scientology texts of L. Ron Hubbard are linguistically and conceptually like those of other religions. A Kruskal–Wallis test comparing the frequencies of LIWC category words in the Scientology texts and the English translations of the texts of five other religions showed that there were eighteen categories for which the Scientology texts differed from the others, and between one and seventeen for the other religions. In the CCK experiment, keywords typical of each religion were found, both by comparing the religious texts with one another and with the Brown corpus of general English. The most typical keywords were looked up in a concordancer and were manually coded with conceptual tags. The set of categories found for the Scientology texts showed little overlap with those found for the others. Our CA experiments produced fairly clear clusters of texts by religion. Scientology texts were seen at one pole on the first factor, with Christian and Islamic texts at the other. It appears that, in several ways, the Scientology texts are dissimilar to the texts of some of the world's major religions.en
dc.language.isoenen
dc.publisherEdinburgh University Pressen
dc.relation.urlhttp://www.euppublishing.com/doi/10.3366/cor.2016.0098en
dc.rightsArchived with thanks to Corporaen
dc.subjectClosed-class keywords analysisen
dc.subjectCorporaen
dc.subjectCorrespondence analysisen
dc.subjectLIWCen
dc.subjectReligionen
dc.subjectReligious textsen
dc.subjectScientologyen
dc.subjectStylometryen
dc.titleNgrams and Engrams: the use of structural and conceptual features to discriminate between English translations of religious textsen
dc.typeArticleen
dc.identifier.journalCorporaen
All Items in WIRE are protected by copyright, with all rights reserved, unless otherwise indicated.