NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Characterizing Multimodal Long-form Summarization: A Case Study on Financial Reports

Cao, Tianyu; Raman, Natraj; Dervovic, Danial; Tan, Chenhao (October 2024, COLM)

As large language models (LLMs) expand the power of natural language processing to handle long inputs, rigorous and systematic analyses are necessary to understand their abilities and behavior. A salient application is summarization, due to its ubiquity and controversy (e.g., researchers have declared the death of summarization). In this paper, we use financial report summarization as a case study because financial reports are not only long but also use numbers and tables extensively. We propose a computational framework for characterizing multimodal long-form summarization and investigate the behavior of Claude 2.0/2.1, GPT-4/3.5, and Cohere. We find that GPT-3.5 and Cohere fail to perform this summarization task meaningfully. For Claude 2 and GPT-4, we analyze the extractiveness of the summary and identify a position bias in LLMs. This position bias disappears after shuffling the input for Claude, which suggests that Claude seems to recognize important information. We also conduct a comprehensive investigation on the use of numeric data in LLM-generated summaries and offer a taxonomy of numeric hallucination. We employ prompt engineering to improve GPT-4's use of numbers with limited success. Overall, our analyses highlight the strong capability of Claude 2 in handling long multimodal inputs compared to GPT-4. The generated summaries and evaluation code are available at https://github.com/ChicagoHAI/characterizing-multimodal-long-form-summarization.
more » « less
Full Text Available
NetShuffle: Circumventing Censorship with Shuffle Proxies at the Edge

Kon, Patrick; Gattani, Aniket; Cao, Tianyu; Barradas, Diogo; Chen, Ang; Sherr, Micah; Ujcich, Benjamin (May 2024, IEEE Security and Privacy)

Full Text Available
Named Entity Recognition with Small Strongly Labeled and Large Weakly Labeled Data

https://doi.org/10.18653/v1/2021.acl-long.140

Jiang, Haoming; Zhang, Danqing; Cao, Tianyu; Yin, Bing; Zhao, Tuo (August 2021, Annual Meeting of the Association for Computational Linguistics)

Full Text Available
Investigating the Catalytic Requirements of Perovskite Fuel Electrodes Using Ultra-Low Metal Loadings

https://doi.org/10.1149/1945-7111/ac1703

Paige, Julian M.; Vu, Duytam; Cao, Tianyu; McIntosh, Steven; Gorte, Raymond J.; Vohs, John M. (August 2021, Journal of The Electrochemical Society)

Full Text Available

Search for: All records