LexDivPara: A Measure of Paraphrase Quality with Integrated Sentential Lexical Complexity

Tanh, T.; Do, Ha; Tanh, D.; Shi, Pu; Sathyanarayanan, A.; Khan, S.; Kohei, A.

doi:10.1007/978-3-030-82199-9_1

Citation Details

LexDivPara: A Measure of Paraphrase Quality with Integrated Sentential Lexical Complexity

We present a novel method that automatically measures quality of sentential paraphrasing. Our method balances two conflicting criteria: semantic similarity and lexical diversity. Using a diverse annotated corpus, we built learning to rank models on edit distance, BLEU, ROUGE, and cosine similarity features. Extrinsic evaluation on STS Benchmark and ParaBank Evaluation datasets resulted in a model ensemble with moderate to high quality. We applied our method on both small benchmarking and large-scale datasets as resources for the community. more »

Award ID(s):: 1838808 1849213

PAR ID:: 10293412

Author(s) / Creator(s):: Tanh, T.; Do, Ha; Tanh, D.; Shi, Pu; Sathyanarayanan, A.; Khan, S.; Kohei, A.

Date Published:: 2021-08-07

Journal Name:: International Workshop on Intelligent Systems and Applications

ISSN:: 2159-1539

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1007/978-3-030-82199-9_1

More Like this