skip to main content


Title: Development of a Pilot Manufacturing Cyberinfrastructure With an Information Rich Mechanical CAD 3D Model Repository
Abstract

Data driven advanced manufacturing research is dependent on access to large datasets made available from across the product lifecycle — from the concept design phase all the way down to end use and disposal. Despite such data being generated at a rapid pace, most product design data is archived in inaccessible silos. This is particularly acute in academic research laboratories and with data generated during product design and manufacturing courses. This project seeks to create an infrastructure that allow users (academia and the general public) to easily upload project data and related meta-data. Current manufacturing research must shift from siloed repositories of product manufacturing data to a federated, decentralized, open and inter-operable approach. In this regard, we build ‘FabWave’ a cyber-infrastructure tool designed to capture manufacturing data. In its first pilot implementation, we focused our attention to gathering information rich 3D Mechanical CAD data and related meta-data associated with them, with the intent to make it easier for users to upload and access product design data. We describe workflows that we have initially tested out within the two academic universities and under two different course structures. We have also developed automated workflows to gather license appropriate CAD assemblies from commercial repositories. Our intent is to create the only known largest available CAD model set within academia for enabling research in data-driven computational research in digital design, fabrication and quality control.

 
more » « less
Award ID(s):
1812687 1812675
NSF-PAR ID:
10173545
Author(s) / Creator(s):
; ; ; ;
Date Published:
Journal Name:
ASME 2019 14th International Manufacturing Science and Engineering Conference
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. null (Ed.)
    The first major goal of this project is to build a state-of-the-art information storage, retrieval, and analysis system that utilizes the latest technology and industry methods. This system is leveraged to accomplish another major goal, supporting modern search and browse capabilities for a large collection of tweets from the Twitter social media platform, web pages, and electronic theses and dissertations (ETDs). The backbone of the information system is a Docker container cluster running with Rancher and Kubernetes. Information retrieval and visualization is accomplished with containers in a pipelined fashion, whether in the cluster or on virtual machines, for Elasticsearch and Kibana, respectively. In addition to traditional searching and browsing, the system supports full-text and metadata searching. Search results include facets as a modern means of browsing among related documents. The system supports text analysis and machine learning to reveal new properties of collection data. These new properties assist in the generation of available facets. Recommendations are also presented with search results based on associations among documents and with logged user activity. The information system is co-designed by five teams of Virginia Tech graduate students, all members of the same computer science class, CS 5604. Although the project is an academic exercise, it is the practice of the teams to work and interact as though they are groups within a company developing a product. The teams on this project include three collection management groups -- Electronic Theses and Dissertations (ETD), Tweets (TWT), and Web-Pages (WP) -- as well as the Front-end (FE) group and the Integration (INT) group to help provide the overarching structure for the application. This submission focuses on the work of the Integration (INT) team, which creates and administers Docker containers for each team in addition to administering the cluster infrastructure. Each container is a customized application environment that is specific to the needs of the corresponding team. Each team will have several of these containers set up in a pipeline formation to allow scaling and extension of the current system. The INT team also contributes to a cross-team effort for exploring the use of Elasticsearch and its internally associated database. The INT team administers the integration of the Ceph data storage system into the CS Department Cloud and provides support for interactions between containers and the Ceph filesystem. During formative stages of development, the INT team also has a role in guiding team evaluations of prospective container components and workflows. The INT team is responsible for the overall project architecture and facilitating the tools and tutorials that assist the other teams in deploying containers in a development environment according to mutual specifications agreed upon with each team. The INT team maintains the status of the Kubernetes cluster, deploying new containers and pods as needed by the collection management teams as they expand their workflows. This team is responsible for utilizing a continuous integration process to update existing containers. During the development stage the INT team collaborates specifically with the collection management teams to create the pipeline for the ingestion and processing of new collection documents, crossing services between those teams as needed. The INT team develops a reasoner engine to construct workflows with information goal as input, which are then programmatically authored, scheduled, and monitored using Apache Airflow. The INT team is responsible for the flow, management, and logging of system performance data and making any adjustments necessary based on the analysis of testing results. The INT team has established a Gitlab repository for archival code related to the entire project and has provided the other groups with the documentation to deposit their code in the repository. This repository will be expanded using Gitlab CI in order to provide continuous integration and testing once it is available. Finally, the INT team will provide a production distribution that includes all embedded Docker containers and sub-embedded Git source code repositories. The INT team will archive this distribution on the Virginia Tech Docker Container Registry and deploy it on the Virginia Tech CS Cloud. The INT-2020 team owes a sincere debt of gratitude to the work of the INT-2019 team. This is a very large undertaking and the wrangling of all of the products and processes would not have been possible without their guidance in both direct and written form. We have relied heavily on the foundation they and their predecessors have provided for us. We continue their work with systematic improvements, but also want to acknowledge their efforts Ibid. Without them, our progress to date would not have been possible. 
    more » « less
  2. Inconsistent and incomplete applications of metadata standards and unsatisfactory approaches to connecting repository holdings across the global research infrastructure inhibit data discovery and reusability. The Realities of Academic Data Sharing (RADS) Initiative has found that institutions and researchers create and have access to the most complete metadata, but that valuable metadata found in these local institutional repositories (IRs) are not making their way into global data infrastructure such as DataCite or Crossref. This panel examines the local to global spectrum of metadata completeness, including the challenges of obtaining quality metadata at a local level, specifically at Cornell University, and the loss of metadata during the transfer processes from IRs into global data infrastructure. The metadata completeness increases over time, as users reuse data and contribute to the metadata. As metadata improves and grows, users find and develop connections within data not previously visible to them. By feeding local IR metadata into the global data infrastructure, the global infrastructure starts giving back in the form of these connections. We believe that this information will be helpful in coordinating metadata better and more effectively across data repositories and creating more robust interoperability and reusability between and among IRs. 
    more » « less
  3. Composable infrastructure holds the promise of accelerating the pace of academic research and discovery by enabling researchers to tailor the resources of a machine (e.g., GPUs, storage, NICs), on-demand, to address application needs. We were first introduced to composable infrastructure in 2018, and at the same time, there was growing demand among our College of Engineering faculty for GPU systems for data science, artificial intelligence / machine learning / deep learning, and visualization. Many purchased their own individual desktop or deskside systems, a few pursued more costly cloud and HPC solutions, and others looked to the College or campus computer center for GPU resources which, at the time, were scarce. After surveying the diverse needs of our faculty and studying product offerings by a few nascent startups in the composable infrastructure sector, we applied for and received a grant from the National Science Foundation in November 2019 to purchase a mid-scale system, configured to our specifications, for use by faculty and students for research and research training. This paper describes our composable infrastructure solution and implementation for our academic community. Given how modern workflows are progressively moving to containers and cloud frameworks (using Kubernetes) and to programming notebooks (primarily Jupyter), both for ease of use and for ensuring reproducible experiments, we initially adapted these tools for our system. We have since made it simpler to use our system, and now provide our users with a public facing JupyterHub server. We also added an expansion chassis to our system to enable composable co-location, which is a shared central architecture in which our researchers can insert and integrate specialized resources (GPUs, accelerators, networking cards, etc.) needed for their research. In February 2020, installation of our system was finalized and made operational and we began providing access to faculty in the College of Engineering. Now, two years later, it is used by over 40 faculty and students plus some external collaborators for research and research training. Their use cases and experiences are briefly described in this paper. Composable infrastructure has proven to be a useful computational system for workload variability, uneven applications, and modern workflows in academic environments. 
    more » « less
  4. After a natural disaster, multiple disciplines need to come together to rebuild the damaged infrastructure using new paradigms. For instance, urgent restoration of services demand to abridge the projects’ schedule and provide innovative solutions, thus making collaboration and integration essential for the project’s success. Commonly, the academic preparation of scholars on infrastructure-related disciplines takes place in isolated professional domains, rarely tackling interdisciplinary problems and/or learn from the systematic research of previous experiences. In Puerto Rico, the aftermath of Hurricanes Irma and Maria has heightened awareness regarding the education on infrastructure-related disciplines to provide transdisciplinary solutions to pertinent complex challenges. This taxing context compels the academia to train a new cadre of professionals properly prepared in those STEM disciplines. Further, current public awareness of the vulnerability of the existing infrastructure creates an opportunity to recruit and prepare students to become those much-needed professionals. The present work offers the conceptual framework of a collaborative effort among Architecture, Engineering, and Construction (AEC) to develop an interdisciplinary program in resilient and sustainable infrastructure. The framework includes the development of transformational pedagogic interventions and changes that will challenge the disciplinary splits among AEC. The framework targets values and skills for inter and transdisciplinary problem solving, as well as helps smooth the transition from academic education to professional practice. To implement the initiative, the project created a collaborative platform among three campuses of the University of Puerto Rico System. Each of these campuses offers a different educational component relevant to the enriching educational initiative. We expect this approach to create a new breed of professionals ready to face the challenges posed for the development of robust infrastructure. The strategy fosters readiness in environmental design in engineering and construction through evidence-based design and inter/transdisciplinary problem solving. Thus, this research contributes to the body of knowledge by presenting a collaborative effort to train future professionals to design and build a robust infrastructure that can overcome the impact of major natural catastrophes. 
    more » « less
  5. After a natural disaster, multiple disciplines need to come together to rebuild the damaged infrastructure using new paradigms. For instance, urgent restoration of services demand to abridge the projects’ schedule and provide innovative solutions, thus making collaboration and integration essential for the project’s success. Commonly, the academic preparation of scholars on infrastructure-related disciplines takes place in isolated professional domains, rarely tackling interdisciplinary problems and/or learn from the systematic research of previous experiences. In Puerto Rico, the aftermath of Hurricanes Irma and Maria has heightened awareness regarding the education on infrastructure-related disciplines to provide transdisciplinary solutions to pertinent complex challenges. This taxing context compels the academia to train a new cadre of professionals properly prepared in those STEM disciplines. Further, current public awareness of the vulnerability of the existing infrastructure creates an opportunity to recruit and prepare students to become those much-needed professionals. The present work offers the conceptual framework of a collaborative effort among Architecture, Engineering, and Construction (AEC) to develop an interdisciplinary program in resilient and sustainable infrastructure. The framework includes the development of transformational pedagogic interventions and changes that will challenge the disciplinary splits among AEC. The framework targets values and skills for inter and transdisciplinary problem solving, as well as helps smooth the transition from academic education to professional practice. To implement the initiative, the project created a collaborative platform among three campuses of the University of Puerto Rico System. Each of these campuses offers a different educational component relevant to the enriching educational initiative. We expect this approach to create a new breed of professionals ready to face the challenges posed for the development of robust infrastructure. The strategy fosters readiness in environmental design in engineering and construction through evidence-based design and inter/transdisciplinary problem solving. Thus, this research contributes to the body of knowledge by presenting a collaborative effort to train future professionals to design and build a robust infrastructure that can overcome the impact of major natural catastrophes. 
    more » « less