skip to main content


Title: Semantic Representation Learning and Information Integration of BIM and Regulations
Automated checking of the compliance of building information modeling (BIM)-based building designs with relevant codes and regulations requires bridging the semantic gap between the Industry Foundation Classes (IFC) schema and the natural language. In most of the existing automated compliance checking (ACC) systems, the integration of the IFC schema and natural language is realized through hardcoding or predefined rules, ontologies, or dictionaries. These methods require intensive manual engineering effort and are often rigid and difficult to generalize. There is, thus, a need for an automated and meanwhile flexible and generalizable information integration method. To address this need, this paper leverages transformer-based language models to learn the semantic representations of concepts in the building information models (BIMs) and regulatory documents. An automated IFC-regulatory information integration approach based on these learned semantic representations is proposed. The preliminary experimental results show that the proposed approach achieved promising performance—an accuracy of 80%—on integrating IFC and regulatory concepts.  more » « less
Award ID(s):
1827733
NSF-PAR ID:
10347906
Author(s) / Creator(s):
;
Editor(s):
Issa, R.
Date Published:
Journal Name:
Computing in Civil Engineering 2021
Page Range / eLocation ID:
466 to 473
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. To allow full automation of building code compliance checking with different building design models and codes/regulations, input building design models need to be automatically validated. Automated architecture, engineering, and construction (AEC) object identification with high accuracy is essential for such validation. For example, in order to check egress requirements, exits of a building (and their presence or absence) need to be identified automatically through object identification. To address that, the authors propose a new AEC object identification algorithm that can identify needed code checking concepts from building design models based on the invariant signatures of AEC objects, which consisted of Cartesian points-based geometry, relative location and orientation, and material mechanical properties. Building design models in industry foundation classes (IFC) format are processed into invariant signatures, which can fully represent the model data and convert them into computable representations to support automated compliance reasoning. A systematic implementation of the above invariant signatures-based object identification algorithm can be used to automatically conduct building design model validation for code compliance checking preparation. An experimental testing on Chapters 4 and 8 of the International Building Code 2015 and a convenience store design model showed the model validation using the proposed identification algorithms successfully validated ceiling and interior door concepts. Comparing to the manual validation used in current practice, this new object identification algorithm is more efficient in supporting model validation for automated building code compliance checking. 
    more » « less
  2. Lu, W. ; Anumba, C. (Ed.)
    The digital and integrated representation of the physical and functional characteristics of buildings enabled by building information modeling (BIM) provides a computational environment for automated compliance checking (ACC) of building designs. The integration of natural language processing (NLP) and artificial intelligence (AI) with BIM brings further opportunities for ACC – it can empower BIM with text analytics and AI capabilities, thereby injecting intelligence and automation in the compliance checking processes. This chapter highlights emerging approaches that aim to facilitate and harness the marriage of BIM, NLP, and AI to enable the next generation of automated compliance checking systems (ACC) systems. This chapter (1) reviews different types of BIM-based ACC systems that leverage NLP and AI techniques, (2) discusses how NLP and AI techniques are applied in regulatory text analytics tasks and BIM information analytics tasks in the context of ACC, and (3) discusses the future trends of BIM-based ACC systems. 
    more » « less
  3. Traditional manual building code compliance checking is costly, time-consuming, and human error-prone. With the adoption of Building Information Modeling (BIM), automation in such a checking process becomes more feasible. However, existing methods still face limited automation when applied to different building codes. To address that, in this paper, the authors proposed a new framework that requires minimal input from users and strives for full automation, namely, the Invariant signature, logic reasoning, and Semantic Natural language processing (NLP)-based Automated building Code compliance Checking (I-SNACC) framework. The authors developed an automated building code compliance checking (ACC) prototype system under this framework and tested it on Chapter 10 of the International Building Codes 2015 (IBC 2015). The system was tested on two real projects and achieved 95.2% precision and 100% recall in non-compliance detection. The experiment showed that the framework is promising in building code compliance checking. Compared to the state-of-the-art methods, the new framework increases the degree of automation and saves manual efforts for finding non-compliance cases. 
    more » « less
  4. Steinmetz, A. (Ed.)
    Manual building code compliance checking is a time-consuming, labor-intensive and error-prone process. Automated logic-based reasoning is an essential step in the automation of this process. There have been previous studies using logic programming languages for automated logic-based reasoning to support automated compliance checking (ACC) of building designs with building codes. As a high-performance implementation of the standard logic programming language, B-Prolog was widely used in these studies. However, due to the support of dynamic predicates and user-defined operators, the predicates’ functions vary according to different user definitions; therefore, B-Prolog is sometimes not reliable for building code reasoning. As a more expressive, scalable, and reliable alterative to B-Prolog, Picat, a logic-based multi-paradigm programming language, provides a new and potentially more powerful platform for automated logic-based reasoning in ACC. To explore the potential value of Picat in ACC, in this study, the authors compared Picat and B-Prolog performance in automatically checking 20 requirement rules in the 2015 International Building Code. The experimental results showed that the automated checking for building codes in the B-Prolog version was faster than that in the Picat version, whereas the Picat version was more reliable than the B-Prolog version. This could be the result of B-Prolog using unifica-tion and Picat using pattern matching for indexing rules. More potential applications of Picat in ACC domain need further research. Furthermore, this schema could be used in the teaching of ACC to graduate construction students, illustrating the need to focus on the reliability, predictability and scalability of the process, in order to provide a practical solution to improving code compliance checking processes. 
    more » « less
  5. Tang, P. ; Grau, D. ; El Asmar, M. (Ed.)
    Existing automated code checking (ACC) systems require the extraction of requirements from regulatory textual documents into computer-processable rule representations. The information extraction processes in those ACC systems are based on either human interpretation, manual annotation, or predefined automated information extraction rules. Despite the high performance they showed, rule-based information extraction approaches, by nature, lack sufficient scalability—the rules typically need some level of adaptation if the characteristics of the text change. Machine learning-based methods, instead of relying on hand-crafted rules, automatically capture the underlying patterns of the existing training text and have a great capability of generalizing to a variety of texts. A more scalable, machine learning-based approach is thus needed to achieve a more robust performance across different types of codes/documents for automatically generating semantically-enriched building-code sentences for the purpose of ACC. To address this need, this paper proposes a machine learning-based approach for generating semantically-enriched building-code sentences, which are annotated syntactically and semantically, for supporting IE. For improved robustness and scalability, the proposed approach uses transfer learning strategies to train deep neural network models on both general-domain and domain-specific data. The proposed approach consists of four steps: (1) data preparation and preprocessing; (2) development of a base deep neural network model for generating semantically-enriched building-code sentences; (3) model training using transfer learning strategies; and (4) model evaluation. The proposed approach was evaluated on a corpus of sentences from the 2009 International Building Code (IBC) and the Champaign 2015 IBC Amendments. The preliminary results show that the proposed approach achieved an optimal precision of 88%, recall of 86%, and F1-measure of 87%, indicating good performance. 
    more » « less