Machine Learning Methods for Stylometry

Machine Learning Methods for Stylometry
Author :
Publisher : Springer Nature
Total Pages : 286
Release :
ISBN-10 : 9783030533601
ISBN-13 : 3030533603
Rating : 4/5 (603 Downloads)

Book Synopsis Machine Learning Methods for Stylometry by : Jacques Savoy

Download or read book Machine Learning Methods for Stylometry written by Jacques Savoy and published by Springer Nature. This book was released on 2020-09-28 with total page 286 pages. Available in PDF, EPUB and Kindle. Book excerpt: This book presents methods and approaches used to identify the true author of a doubtful document or text excerpt. It provides a broad introduction to all text categorization problems (like authorship attribution, psychological traits of the author, detecting fake news, etc.) grounded in stylistic features. Specifically, machine learning models as valuable tools for verifying hypotheses or revealing significant patterns hidden in datasets are presented in detail. Stylometry is a multi-disciplinary field combining linguistics with both statistics and computer science. The content is divided into three parts. The first, which consists of the first three chapters, offers a general introduction to stylometry, its potential applications and limitations. Further, it introduces the ongoing example used to illustrate the concepts discussed throughout the remainder of the book. The four chapters of the second part are more devoted to computer science with a focus on machine learning models. Their main aim is to explain machine learning models for solving stylometric problems. Several general strategies used to identify, extract, select, and represent stylistic markers are explained. As deep learning represents an active field of research, information on neural network models and word embeddings applied to stylometry is provided, as well as a general introduction to the deep learning approach to solving stylometric questions. In turn, the third part illustrates the application of the previously discussed approaches in real cases: an authorship attribution problem, seeking to discover the secret hand behind the nom de plume Elena Ferrante, an Italian writer known worldwide for her My Brilliant Friend’s saga; author profiling in order to identify whether a set of tweets were generated by a bot or a human being and in this second case, whether it is a man or a woman; and an exploration of stylistic variations over time using US political speeches covering a period of ca. 230 years. A solutions-based approach is adopted throughout the book, and explanations are supported by examples written in R. To complement the main content and discussions on stylometric models and techniques, examples and datasets are freely available at the author’s Github website.


Machine Learning Methods for Stylometry Related Books

Machine Learning Methods for Stylometry
Language: en
Pages: 286
Authors: Jacques Savoy
Categories: Computers
Type: BOOK - Published: 2020-09-28 - Publisher: Springer Nature

GET EBOOK

This book presents methods and approaches used to identify the true author of a doubtful document or text excerpt. It provides a broad introduction to all text
Authorship Attribution
Language: en
Pages: 116
Authors: Patrick Juola
Categories: Authorship, Disputed
Type: BOOK - Published: 2008 - Publisher: Now Publishers Inc

GET EBOOK

Authorship Attribution surveys the history and present state of the discipline, presenting some comparative results where available. It also provides a theoreti
Versification and Authorship Attribution
Language: en
Pages: 96
Authors: Petr Plecháč
Categories: Literary Criticism
Type: BOOK - Published: 2021-07-01 - Publisher: Charles University in Prague, Karolinum Press

GET EBOOK

The technique known as contemporary stylometry uses different methods, including machine learning, to discover a poem’s author based on features like the freq
Quantitative Methods in Corpus-based Translation Studies
Language: en
Pages: 372
Authors: Michael P. Oakes
Categories: Language Arts & Disciplines
Type: BOOK - Published: 2012 - Publisher: John Benjamins Publishing

GET EBOOK

This is a comprehensive guidebook to the quantitative methods needed for Corpus-Based Translation Studies (CBTS). It provides a systematic description of the va
Intelligent Systems Technologies and Applications
Language: en
Pages: 442
Authors: Sabu M. Thampi
Categories: Technology & Engineering
Type: BOOK - Published: 2017-10-20 - Publisher: Springer

GET EBOOK

This book constitutes the thoroughly refereed post-conference proceedings of the third International Symposium on Intelligent Systems Technologies and Applicati