Text Generation to Aid Depression Detection: A Comparative Study of Conditional Sequence Generative Adversarial Networks

Tlachac, ML; Gerych, Walter; Agrawal, Kratika; Litterer, Benjamin; Jurovich, Nicholas; Thatigotla, Saitheeraj; Thadajarassiri, Jidapa; Rundensteiner, Elke A.

doi:10.1109/BigData55660.2022.10020224

Citation Details

Text Generation to Aid Depression Detection: A Comparative Study of Conditional Sequence Generative Adversarial Networks

Corpuses of unstructured textual data, such as text messages between individuals, are often predictive of medical issues such as depression. The text data usually used in healthcare applications has high value and great variety, but is typically small in volume. Generating labeled unstructured text data is important to improve models by augmenting these small datasets, as well as to facilitate anonymization. While methods for labeled data generation exist, not all of them generalize well to small datasets. In this work, we thus perform a much needed systematic comparison of conditional text generation models that are promising for small datasets due to their unified architectures. We identify and implement a family of nine conditional sequence generative adversarial networks for text generation, which we collectively refer to as cSeqGAN models. These models are characterized along two orthogonal design dimensions: weighting strategies and feedback mechanisms. We conduct a comparative study evaluating the generation ability of the nine cSeqGAN models on three diverse text datasets with depression and sentiment labels. To assess the quality and realism of the generated text, we use standard machine learning metrics as well as human assessment via a user study. While the unconditioned models produced predictive text, the cSeqGAN models produced more realistic text. Our comparative study lays a solid foundation and provides important insights for further text generation research, particularly for the small datasets common within the healthcare domain. more »

Award ID(s):: 1852498

PAR ID:: 10399254

Author(s) / Creator(s):: Tlachac, ML; Gerych, Walter; Agrawal, Kratika; Litterer, Benjamin; Jurovich, Nicholas; Thatigotla, Saitheeraj; Thadajarassiri, Jidapa; Rundensteiner, Elke A.

Date Published:: 2022-12-17

Journal Name:: 2022 IEEE International Conference on Big Data (Big Data)

Page Range / eLocation ID:: 2804 to 2813

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1109/BigData55660.2022.10020224

More Like this