Requirements Satisfiability with In-Context Learning

Santos, Sarah; Breaux, Travis; Norton, Thomas; Haghighi, Sara; Ghanavati, Sepideh

doi:10.1109/RE59067.2024.00025

Citation Details

Requirements Satisfiability with In-Context Learning

Language models that can learn a task at inference time, called in-context learning (ICL), show increasing promise in natural language inference tasks. In ICL, a model user constructs a prompt to describe a task with a natural language instruction and zero or more examples, called demonstrations. The prompt is then input to the language model to generate a completion. In this paper, we apply ICL to the design and evaluation of satisfaction arguments, which describe how a requirement is satisfied by a system specification and associated domain knowledge. The approach builds on three prompt design patterns, including augmented generation, prompt tuning, and chain-of-thought prompting, and is evaluated on a privacy problem to check whether a mobile app scenario and associated design description satisfies eight consent requirements from the EU General Data Protection Regulation (GDPR). The overall results show that GPT-4 can be used to verify requirements satisfaction with 96.7% accuracy and dissatisfaction with 93.2% accuracy. Inverting the requirement improves verification of dissatisfaction to 97.2%. Chain-of-thought prompting improves overall GPT-3.5 performance by 9.0% accuracy. We discuss the trade-offs among templates, models and prompt strategies and provide a detailed analysis of the generated specifications to inform how the approach can be applied in practice. more »

Award ID(s):: 2007298 2238047 2217572

PAR ID:: 10561323

Author(s) / Creator(s):: Santos, Sarah; Breaux, Travis; Norton, Thomas; Haghighi, Sara; Ghanavati, Sepideh

Publisher / Repository:: IEEE

Date Published:: 2024-06-24

ISBN:: 979-8-3503-9511-2

Page Range / eLocation ID:: 168 to 179

Format(s):: Medium: X

Location:: Reykjavik, Iceland

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript
Conference Paper:
https://doi.org/10.1109/RE59067.2024.00025

More Like this