skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Text-to-Model Transformation: Natural Language-Based Model Generation Framework
System modeling language (SysML) diagrams generated manually by system modelers can sometimes be prone to errors, which are time-consuming and introduce subjectivity. Natural language processing (NLP) techniques and tools to create SysML diagrams can aid in improving software and systems design processes. Though NLP effectively extracts and analyzes raw text data, such as text-based requirement documents, to assist in design specification, natural language, inherent complexity, and variability pose challenges in accurately interpreting the data. In this paper, we explore the integration of NLP with SysML to automate the generation of system models from input textual requirements. We propose a model generation framework leveraging Python and the spaCy NLP library to process text input and generate class/block definition diagrams using PlantUML for visual representation. The intent of this framework is to aid in reducing the manual effort in creating SysML v1.6 diagrams—class/block definition diagrams in this case. We evaluate the effectiveness of the framework using precision and recall measures. The contribution of this paper to the systems modeling domain is two-fold. First, a review and analysis of natural language processing techniques for the automated generation of SysML diagrams are provided. Second, a framework to automatically extract textual relationships tailored for generating a class diagram/block diagram that contains the classes/blocks, their relationships, methods, and attributes is presented.  more » « less
Award ID(s):
2412813
PAR ID:
10561380
Author(s) / Creator(s):
; ;
Publisher / Repository:
MDPI
Date Published:
Journal Name:
Systems
Volume:
12
Issue:
9
ISSN:
2079-8954
Page Range / eLocation ID:
369
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Augmented Reality (AR) devices offer novel capabilities that can be exploited in AR systems to positively impact human-machine interactions in a variety of future-work and education contexts. This paper presents a systems model for a no-code AR systems framework that can be used to create AR applications that present just-in-time informatics to assist and guide users in the completion of complex task sequences while ensuring operator and environment safety. The salient structural and behavioral aspects of the system, and key use cases are modeled using the Systems Modeling Language (SysML). Representative examples of the model are presented using use case, block definition, internal block, activity, and state-machine diagrams. These models offer new insights into how AR capabilities can be integrated with a variety of engineered systems. In the future such SysML models can steer the design of new tools and an ontology to strengthen connections to domain knowledge. 
    more » « less
  2. The goal of text-to-text generation is to make machines express like a human in many applications such as conversation, summarization, and translation. It is one of the most important yet challenging tasks in natural language processing (NLP). Various neural encoder-decoder models have been proposed to achieve the goal by learning to map input text to output text. However, the input text alone often provides limited knowledge to generate the desired output, so the performance of text generation is still far from satisfaction in many real-world scenarios. To address this issue, researchers have considered incorporating (i) internal knowledge embedded in the input text and (ii) external knowledge from outside sources such as knowledge base and knowledge graph into the text generation system. This research topic is known as knowledge-enhanced text generation. In this survey, we present a comprehensive review of the research on this topic over the past five years. The main content includes two parts: (i) general methods and architectures for integrating knowledge into text generation; (ii) specific techniques and applications according to different forms of knowledge data. This survey can have broad audiences, researchers and practitioners, in academia and industry. 
    more » « less
  3. As the complexity of both products and systems increases across a wide range of industry sectors, there has been an influx in demand for methods of system organization and optimization. MBSE enhances the ability to obtain, analyze, communicate, and manage data on a comprehensive architecture of a system. In this study, a military combat surveillance scenario is modeled using SysML generating state machine diagrams and activity diagrams using the Magic Model Analyst execution framework plugin. This study seeks to prove the feasibility of an MBSE-enabled framework using SysML to create and simulate a surveillance system that monitors and reports on the health status and performance of an armored fighting vehicle (combat tank) through an Unmanned Ariel Vehicle (UAV). The Magic System of Systems Architect, which actively promotes system development architectural frameworks, was used to construct SysML-compliant models, allowing the creation of intricate model diagrams. The construction of the UAV surveillance scenario emphasized the capability of modifying a diagram feature and ensuring that the alteration is communicated to all linked model diagrams. This study builds on a previously published MBSE-enabled conceptual framework for creating digital twins. The purpose of this research is to test and validate the framework's procedures. Keywords—MBSE, SysML, MBSE framework, UAV, Surveillance 
    more » « less
  4. Pre-trained language models (PLMs) aim to learn universal language representations by conducting self-supervised training tasks on large-scale corpora. Since PLMs capture word semantics in different contexts, the quality of word representations highly depends on word frequency, which usually follows a heavy-tailed distributions in the pre-training corpus. Therefore, the embeddings of rare words on the tail are usually poorly optimized. In this work, we focus on enhancing language model pre-training by leveraging definitions of the rare words in dictionaries (e.g., Wiktionary). To incorporate a rare word definition as a part of input, we fetch its definition from the dictionary and append it to the end of the input text sequence. In addition to training with the masked language modeling objective, we propose two novel self-supervised pre-training tasks on word and sentence-level alignment between input text sequence and rare word definitions to enhance language modeling representation with dictionary. We evaluate the proposed Dict-BERT model on the language understanding benchmark GLUE and eight specialized domain benchmark datasets. Extensive experiments demonstrate that Dict-BERT can significantly improve the understanding of rare words and boost model performance on various NLP downstream tasks. 
    more » « less
  5. Lu, W.; Anumba, C. (Ed.)
    The digital and integrated representation of the physical and functional characteristics of buildings enabled by building information modeling (BIM) provides a computational environment for automated compliance checking (ACC) of building designs. The integration of natural language processing (NLP) and artificial intelligence (AI) with BIM brings further opportunities for ACC – it can empower BIM with text analytics and AI capabilities, thereby injecting intelligence and automation in the compliance checking processes. This chapter highlights emerging approaches that aim to facilitate and harness the marriage of BIM, NLP, and AI to enable the next generation of automated compliance checking systems (ACC) systems. This chapter (1) reviews different types of BIM-based ACC systems that leverage NLP and AI techniques, (2) discusses how NLP and AI techniques are applied in regulatory text analytics tasks and BIM information analytics tasks in the context of ACC, and (3) discusses the future trends of BIM-based ACC systems. 
    more » « less