A Stylometric Application of Large Language Models

Stropkay, Harrison F; Chen, Jiayi; Latifi, Mohammad J; Rockmore, Daniel N; Manning, Jeremy R

Citation Details

We show that large language models (LLMs) can be used to distinguish the writings of different authors. Specifically, an individual GPT-2 model, trained from scratch on the works of one author, will predict held-out text from that author more accurately than held-out text from other authors. We suggest that, in this way, a model trained on one author's works embodies the unique writing style of that author. We first demonstrate our approach on books written by eight different (known) authors. We also use this approach to confirm R. P. Thompson's authorship of the well-studied 15th book of the Oz series, originally attributed to F. L. Baum. more »

Award ID(s):: 2145172

PAR ID:: 10662802

Author(s) / Creator(s):: Stropkay, Harrison F; Chen, Jiayi; Latifi, Mohammad J; Rockmore, Daniel N; Manning, Jeremy R

Publisher / Repository:: arXiv

Date Published:: 2025-10-24

Journal Name:: arXivorg

ISSN:: 2331-8422

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript
Journal Article:
The DOI is not currently available.

More Like this