FinLoRA: Finetuning Quantized Financial Large Language Models Using Low-Rank Adaptation on GPUs

Wang, D; Kim, D; Jin, B; Zhao, X; Fu, T; Yang, S Y; Liu, X

Citation Details

Finetuned large language models (LLMs) have shown remarkable performance in financial tasks, such as sentiment analysis and information retrieval. Due to privacy concerns, finetuning and deploying financial LLMs (FinLLMs) locally are crucial for institutions and individuals. In this paper, we employ quantized low-rank adaptation (QLoRA) to finetune FinLLMs, which leverage low-rank structure and quantization technique to significantly reduce computational requirements while maintaining model performance. We also employ data and pipeline parallelism to enable local finetuning on commodity GPUs. Experiments on financial datasets validate the efficacy of our approach in yielding notable improvements over the base models. more »

Award ID(s):: 2113906

PAR ID:: 10600801

Author(s) / Creator(s):: Wang, D; Kim, D; Jin, B; Zhao, X; Fu, T; Yang, S Y; Liu, X

Publisher / Repository:: arXiv preprints

Date Published:: 2024-12-16

Format(s):: Medium: X

Institution:: Stevens Institute of Technology

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Posted Content:
The DOI is not currently available.

More Like this