Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback | NSF Public Access Repository

skip to main content

An official website of the United States government Here's how you know

Official websites use .gov

A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS

A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.

Citation Details

Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback

Award ID(s):: 2239570

PAR ID:: 10503468

Author(s) / Creator(s):: Lai, Viet; Nguyen, Chien; Ngo, Nghia; Nguyen, Thuat; Dernoncourt, Franck; Rossi, Ryan; Nguyen, Thien Huu

Publisher / Repository:: Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing: System Demonstrations

Date Published:: 2023-12-06

Page Range / eLocation ID:: 318 to 327

Format(s):: Medium: X

Location:: Singapore

Sponsoring Org:: National Science Foundation

Conference Paper:
https://doi.org/10.18653/v1/2023.emnlp-demo.28