Similarity Based Label Smoothing For Dialogue Generation

Sougata Saha; Souvik Das; Rohini K. Srihari

Citation Details

Generative neural conversational systems are typically trained by minimizing the entropy loss between the training “hard” targets and the predicted logits. Performance gains and improved generalization are often achieved by employing regularization techniques like label smoothing, which converts the training “hard” targets to soft targets. However, label smoothing enforces a data independent uniform distribution on the incorrect training targets, leading to a false assumption of equiprobability. In this paper, we propose and experiment with incorporating data-dependent word similarity-based weighing methods to transform the uniform distribution of the incorrect target probabilities in label smoothing to a more realistic distribution based on semantics. We introduce hyperparameters to control the incorrect target distribution and report significant performance gains over networks trained using standard label smoothing-based loss on two standard open-domain dialogue corpora. more »

Award ID(s):: 2214070

PAR ID:: 10441742

Author(s) / Creator(s):: Sougata Saha; Souvik Das; Rohini K. Srihari

Date Published:: 2022-12-01

Journal Name:: Proceedings of the 19th International Conference on Natural Language Processing (ICON)

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this