Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback
- Award ID(s):
- 2239570
- PAR ID:
- 10503468
- Publisher / Repository:
- Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing: System Demonstrations
- Date Published:
- Page Range / eLocation ID:
- 318 to 327
- Format(s):
- Medium: X
- Location:
- Singapore
- Sponsoring Org:
- National Science Foundation
More Like this
No document suggestions found
An official website of the United States government
