skip to main content

Title: Deep Learning of Cross-Modal Tasks for Conceptual Design of Engineered Products: A Review

Conceptual design is the foundational stage of a design process, translating ill-defined design problems to low-fidelity design concepts and prototypes. While deep learning approaches are widely applied in later design stages for design automation, we see fewer attempts in conceptual design for three reasons: 1) the data in this stage exhibit multiple modalities: natural language, sketches, and 3D shapes, and these modalities are challenging to represent in deep learning methods; 2) it requires knowledge from a larger source of inspiration instead of focusing on a single design task; and 3) it requires translating designers’ intent and feedback, and hence needs more interaction with designers and/or users. With recent advances in deep learning of cross-modal tasks (DLCMT) and the availability of large cross-modal datasets, we see opportunities to apply these learning methods to the conceptual design of product shapes. In this paper, we review 30 recent journal articles and conference papers across computer graphics, computer vision, and engineering design fields that involve DLCMT of three modalities: natural language, sketches, and 3D shapes. Based on the review, we identify the challenges and opportunities of utilizing DLCMT in 3D shape concepts generation, from which we propose a list of research questions pointing to future research directions.

more » « less
Award ID(s):
Author(s) / Creator(s):
; ;
Publisher / Repository:
American Society of Mechanical Engineers
Date Published:
Medium: X
St. Louis, Missouri, USA
Sponsoring Org:
National Science Foundation
More Like this
  1. Conceptual design is the foundational stage of a design process that translates ill-defined design problems into low-fidelity design concepts and prototypes through design search, creation, and integration. In this stage, product shape design is one of the most paramount aspects. When applying deep learning-based methods to product shape design, two major challenges exist: (1) design data exhibit in multiple modalities and (2) an increasing demand for creativity. With recent advances in deep learning of cross-modal tasks (DLCMTs), which can transfer one design modality to another, we see opportunities to develop artificial intelligence (AI) to assist the design of product shapes in a new paradigm. In this paper, we conduct a systematic review of the retrieval, generation, and manipulation methods for DLCMT that involve three cross-modal types: text-to-3D shape, text-to-sketch, and sketch-to-3D shape. The review identifies 50 articles from a pool of 1341 papers in the fields of computer graphics, computer vision, and engineering design. We review (1) state-of-the-art DLCMT methods that can be applied to product shape design and (2) identify the key challenges, such as lack of consideration of engineering performance in the early design phase that need to be addressed when applying DLCMT methods. In the end, we discuss the potential solutions to these challenges and propose a list of research questions that point to future directions of data-driven conceptual design. 
    more » « less
  2. A solid understanding of electromagnetic (E&M) theory is key to the education of electrical engineering students. However, these concepts are notoriously challenging for students to learn, due to the difficulty in grasping abstract concepts such as the electric force as an invisible force that is acting at a distance, or how electromagnetic radiation is permeating and propagating in space. Building physical intuition to manipulate these abstractions requires means to visualize them in a three-dimensional space. This project involves the development of 3D visualizations of abstract E&M concepts in Virtual Reality (VR), in an immersive, exploratory, and engaging environment. VR provides the means of exploration, to construct visuals and manipulable objects to represent knowledge. This leads to a constructivist way of learning, in the sense that students are allowed to build their own knowledge from meaningful experiences. In addition, the VR labs replace the cost of hands-on labs, by recreating the experiments and experiences on Virtual Reality platforms. The development of the VR labs for E&M courses involves four distinct phases: (I) Lab Design, (II) Experience Design, (III) Software Development, and (IV) User Testing. During phase I, the learning goals and possible outcomes are clearly defined, to provide context for the VR laboratory experience, and to identify possible technical constraints pertaining to the specific laboratory exercise. During stage II, the environment (the world) the player (user) will experience is designed, along with the foundational elements, such as ways of navigation, key actions, and immersion elements. During stage III, the software is generated as part of the course projects for the Virtual Reality course taught in the Computer Science Department at the same university, or as part of independent research projects involving engineering students. This reflects the strong educational impact of this project, as it allows students to contribute to the educational experiences of their peers. During phase IV, the VR experiences are played by different types of audiences that fit the player type. The team collects feedback and if needed, implements changes. The pilot VR Lab, introduced as an additional instructional tool for the E&M course during the Fall 2019, engaged over 100 students in the program, where in addition to the regular lectures, students attended one hour per week in the E&M VR lab. Student competencies around conceptual understanding of electromagnetism topics are measured via formative and summative assessments. To evaluate the effectiveness of VR learning, each lab is followed by a 10-minute multiple-choice test, designed to measure conceptual understanding of the various topics, rather than the ability to simply manipulate equations. This paper discusses the implementation and the pedagogy of the Virtual Reality laboratory experiences to visualize concepts in E&M, with examples for specific labs, as well as challenges, and student feedback with the new approach. We will also discuss the integration of the 3D visualizations into lab exercises, and the design of the student assessment tools used to assess the knowledge gain when the VR technology is employed. 
    more » « less
  3. Abstract Inspirational stimuli are known to be effective in supporting ideation during early-stage design. However, prior work has predominantly constrained designers to using text-only queries when searching for stimuli, which is not consistent with real-world design behavior where fluidity across modalities (e.g., visual, semantic, etc.) is standard practice. In the current work, we introduce a multi-modal search platform that retrieves inspirational stimuli in the form of 3D-model parts using text, appearance, and function-based search inputs. Computational methods leveraging a deep-learning approach are presented for designing and supporting this platform, which relies on deep-neural networks trained on a large dataset of 3D-model parts. This work further presents the results of a cognitive study ( n = 21) where the aforementioned search platform was used to find parts to inspire solutions to a design challenge. Participants engaged with three different search modalities: by keywords, 3D parts, and user-assembled 3D parts in their workspace. When searching by parts that are selected or in their workspace, participants had additional control over the similarity of appearance and function of results relative to the input. The results of this study demonstrate that the modality used impacts search behavior, such as in search frequency, how retrieved search results are engaged with, and how broadly the search space is covered. Specific results link interactions with the interface to search strategies participants may have used during the task. Findings suggest that when searching for inspirational stimuli, desired results can be achieved both by direct search inputs (e.g., by keyword) as well as by more randomly discovered examples, where a specific goal was not defined. Both search processes are found to be important to enable when designing search platforms for inspirational stimuli retrieval. 
    more » « less
  4. Conceptual diagrams are used extensively to understand abstract relationships, explain complex ideas, and solve difficult problems. To illustrate concepts effectively, experts find appropriate visual representations and translate concepts into concrete shapes. This translation step is not supported explicitly by current diagramming tools. This paper investigates how domain experts create conceptual diagrams via semi-structured interviews with 18 participants from diverse backgrounds. Our participants create, adapt, and reuse visual representations using both sketches and digital tools. However, they had trouble using current diagramming tools to transition from sketches and reuse components from earlier diagrams. Our participants also expressed frustration with the slow feedback cycles and barriers to automation of their tools. Based on these results, we suggest four opportunities of diagramming tools — exploration support, representation salience, live engagement, and vocabulary correspondence — that together enable a natural diagramming experience. Finally, we discuss possibilities to leverage recent research advances to develop natural diagramming tools. 
    more » « less
  5. Background/Context: After-school programs that focus on integrating computer programming and mathematics in authentic environments are seldomly accessible to students from culturally and linguistically diverse backgrounds, particularly bilingual Latina students in rural contexts. Providing a context that broadens Latina students’ participation in mathematics and computer programming requires educators to carefully examine how verbal and nonverbal language is used to interact and to position students as they learn new concepts in middle school. This is also an important stage for adolescents because they are likely to make decisions about their future careers in STEM. Having access to discourse and teaching practices that invite students to participate in mathematics and computer programming affords them opportunities to engage with these fields. Purpose/Focus of Study: This case study analyzes how small-group interactions mediated the positionings of Cindy, a bilingual Latina, as she learned binary numbers in an after-school program that integrated computer programming and mathematics (CPM). Setting: The Advancing Out-of-School Learning in Mathematics and Engineering (AOLME) program was held in a rural bilingual (Spanish and English) middle school in the Southwest. The after-school program was designed to provide experiences for primarily Latinx students to learn how to integrate mathematics with computer programming using Raspberry Pi and Python as a platform. Our case study explores how Cindy was positioned as she interacted with two undergraduate engineering students who served as facilitators while learning binary numbers with a group of three middle school students. Research Design: This single intrinsic case focused on exploring how small-group interactions among four students mediated Cindy’s positionings as she learned binary numbers through her participation in AOLME. Data sources included twelve 90-minute video sessions and Cindy’s journal and curriculum binder. Video logs were created, and transcripts were coded to describe verbal and nonverbal interactions among the facilitators and Cindy. Analysis of select episodes was conducted using systemic functional linguistics (SFL), specifically language modality, to identify how positioning took place. These episodes and positioning analysis describe how Cindy, with others, navigated the process of learning binary numbers under the stereotype that female students are not as good at mathematics as male students. Findings: From our analysis, three themes that emerged from the data portray Cindy’s experiences learning binary numbers. The major themes are: (1) Cindy’s struggle to reveal her understanding of binary numbers in a competitive context, (2) Cindy’s use of “fake it until you make it” to hide her cognitive dissonance, and (3) the use of Spanish and peers’ support to resolve Cindy’s understanding of binary numbers. The positioning patterns observed help us learn how, when Cindy’s bilingualism was viewed and promoted as an asset, this social context worked as a generative axis that addressed the challenges of learning binary numbers. The contrasting episodes highlight the facilitators’ productive teaching strategies and relations that nurtured Cindy’s social and intellectual participation in CPM. Conclusions/Recommendations: Cindy’s case demonstrates how the facilitator’s teaching, and participants’ interactions and discourse practices contributed to her qualitatively different positionings while she learned binary numbers, and how she persevered in this process. Analysis of communication acts supported our understanding of how Cindy’s positionings underpinned the discourse; how the facilitators’ and students’ discourse formed, shaped, or shifted Cindy’s positioning; and how discourse was larger than gender storylines that went beyond classroom interactions. Cindy’s case reveals the danger of placing students in “struggle” instead of a “productive struggle.” The findings illustrated that when Cindy was placed in struggle when confronting responding moves by the facilitator, her “safe” reaction was hiding and avoiding. In contrast, we also learned about the importance of empathetic, nurturing supporting responses that encourage students’ productive struggle to do better. We invite instructors to notice students’ hiding or avoiding and consider Cindy’s case. Furthermore, we recommend that teachers notice their choice of language because this is important in terms of positioning students. We also highlight Cindy’s agency as she chose to take up her friend’s suggestion to “fake it” rather than give up. 
    more » « less