Feature Vector Difference based Neural Network and Logistic Regression Models for Authorship Verification

Weerasinghe, Janith; Greenstadt, Rachel

Citation Details

This paper describes the approach we took to create a machine learning model for the PAN 2020 Authorship Verification Task. For each document pair, we extracted stylometric features from the documents and used the absolute difference between the feature vectors as input to our classifier. We created two models: a Logistic Regression Model trained on a small dataset, and a Neural Network based model trained on the large dataset. These models achieved AUCs of 0.939 and 0.953 on the small and large datasets, making them the second-best models on both datasets submitted to the shared task. more »

Award ID(s):: 1931005

PAR ID:: 10249189

Author(s) / Creator(s):: Weerasinghe, Janith; Greenstadt, Rachel

Editor(s):: Cappellato, Linda; Eickhoff, Carsten; Ferro, Nicola; Névéol, Aurélie

Date Published:: 2020-01-01

Journal Name:: CEUR workshop proceedings

Volume:: 2695

ISSN:: 1613-0073

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this