Kani: A Lightweight and Highly Hackable Framework for Building Language Model Applications

Zhu, Andrew; Dugan, Liam; Hwang, Alyssa; Callison-Burch, Chris

doi:10.18653/v1/2023.nlposs-1.8

Citation Details

Kani: A Lightweight and Highly Hackable Framework for Building Language Model Applications

Language model applications are becoming increasingly popular and complex, often including features like tool usage and retrieval augmentation. However, existing frameworks for such applications are often opinionated, deciding for developers how their prompts ought to be formatted and imposing limitations on customizability and reproducibility. To solve this we present Kani: a lightweight, flexible, and model-agnostic open-source framework for building language model applications. Kani helps developers implement a variety of complex features by supporting the core building blocks of chat interaction: model interfacing, chat management, and robust function calling. All Kani core functions are easily overridable and well documented to empower developers to customize functionality for their own needs. Kani thus serves as a useful tool for researchers, hobbyists, and industry professionals alike to accelerate their development while retaining interoperability and fine-grained control. more »

Award ID(s):: 1928474

PAR ID:: 10563506

Author(s) / Creator(s):: Zhu, Andrew; Dugan, Liam; Hwang, Alyssa; Callison-Burch, Chris

Publisher / Repository:: Empirical Methods in Natural Language Processing

Date Published:: 2023-01-01

Page Range / eLocation ID:: 65 to 77

Subject(s) / Keyword(s):: LLMs open source python

Format(s):: Medium: X

Location:: Singapore, Singapore

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.18653/v1/2023.nlposs-1.8

More Like this