Inconsistent and incomplete applications of metadata standards and unsatisfactory approaches to connecting repository holdings across the global research infrastructure inhibit data discovery and reusability. The Realities of Academic Data Sharing (RADS) Initiative has found that institutions and researchers create and have access to the most complete metadata, but that valuable metadata found in these local institutional repositories (IRs) are not making their way into global data infrastructure such as DataCite or Crossref. This panel examines the local to global spectrum of metadata completeness, including the challenges of obtaining quality metadata at a local level, specifically at Cornell University, and the loss of metadata during the transfer processes from IRs into global data infrastructure. The metadata completeness increases over time, as users reuse data and contribute to the metadata. As metadata improves and grows, users find and develop connections within data not previously visible to them. By feeding local IR metadata into the global data infrastructure, the global infrastructure starts giving back in the form of these connections. We believe that this information will be helpful in coordinating metadata better and more effectively across data repositories and creating more robust interoperability and reusability between and among IRs.
more »
« less
Development of a Pilot Manufacturing Cyberinfrastructure With an Information Rich Mechanical CAD 3D Model Repository
Abstract Data driven advanced manufacturing research is dependent on access to large datasets made available from across the product lifecycle — from the concept design phase all the way down to end use and disposal. Despite such data being generated at a rapid pace, most product design data is archived in inaccessible silos. This is particularly acute in academic research laboratories and with data generated during product design and manufacturing courses. This project seeks to create an infrastructure that allow users (academia and the general public) to easily upload project data and related meta-data. Current manufacturing research must shift from siloed repositories of product manufacturing data to a federated, decentralized, open and inter-operable approach. In this regard, we build ‘FabWave’ a cyber-infrastructure tool designed to capture manufacturing data. In its first pilot implementation, we focused our attention to gathering information rich 3D Mechanical CAD data and related meta-data associated with them, with the intent to make it easier for users to upload and access product design data. We describe workflows that we have initially tested out within the two academic universities and under two different course structures. We have also developed automated workflows to gather license appropriate CAD assemblies from commercial repositories. Our intent is to create the only known largest available CAD model set within academia for enabling research in data-driven computational research in digital design, fabrication and quality control.
more »
« less
- PAR ID:
- 10173545
- Date Published:
- Journal Name:
- ASME 2019 14th International Manufacturing Science and Engineering Conference
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
The widespread growth of additive manufacturing, a field with a complex informatic “digital thread”, has helped fuel the creation of design repositories, where multiple users can upload distribute, and download a variety of candidate designs for a variety of situations. Additionally, advancements in additive manufacturing process development, design frameworks, and simulation are increasing what is possible to fabricate with AM, further growing the richness of such repositories. Machine learning offers new opportunities to combine these design repository components’ rich geometric data with their associated process and performance data to train predictive models capable of automatically assessing build metrics related to AM part manufacturability. Although design repositories that can be used to train these machine learning constructs are expanding, our understanding of what makes a particular design repository useful as a machine learning training dataset is minimal. In this study we use a metamodel to predict the extent to which individual design repositories can train accurate convolutional neural networks. To facilitate the creation and refinement of this metamodel, we constructed a large artificial design repository, and subsequently split it into sub-repositories. We then analyzed metadata regarding the size, complexity, and diversity of the sub-repositories for use as independent variables predicting accuracy and the required training computational effort for training convolutional neural networks. The networks each predict one of three additive manufacturing build metrics: (1) part mass, (2) support material mass, and (3) build time. Our results suggest that metamodels predicting the convolutional neural network coefficient of determination, as opposed to computational effort, were most accurate. Moreover, the size of a design repository, the average complexity of its constituent designs, and the average and spread of design spatial diversity were the best predictors of convolutional neural network accuracy.more » « less
-
Physical samples and their associated (meta)data underpin scientific discoveries across disciplines, and can enable new science when appropriately archived. However, there are significant gaps in community practices and infrastructure that currently prevent accurate provenance tracking, reproducibility, and attribution. For the vast majority of samples, descriptive metadata is often sparse, inaccessible, or absent. Samples and associated (meta)data may also be scattered across numerous physical collections, data repositories, laboratories, data files, and papers with no clear linkages or provenance tracking as new information is generated over time. The Physical Samples Curation Cluster has therefore developed ‘A Scientific Author Guide for Publishing Open Research Using Physical Samples.’ This involved synthesizing existing practices, community feedback, and assessing real-world examples to identify community and infrastructure needs. We identified areas of work needed to enable authors to efficiently reference samples and related data, link related samples and data, and track their use. Our goal is to help improve the discoverability, interoperability, use of physical samples and associated (meta)data into the future.more » « less
-
Composable infrastructure holds the promise of accelerating the pace of academic research and discovery by enabling researchers to tailor the resources of a machine (e.g., GPUs, storage, NICs), on-demand, to address application needs. We were first introduced to composable infrastructure in 2018, and at the same time, there was growing demand among our College of Engineering faculty for GPU systems for data science, artificial intelligence / machine learning / deep learning, and visualization. Many purchased their own individual desktop or deskside systems, a few pursued more costly cloud and HPC solutions, and others looked to the College or campus computer center for GPU resources which, at the time, were scarce. After surveying the diverse needs of our faculty and studying product offerings by a few nascent startups in the composable infrastructure sector, we applied for and received a grant from the National Science Foundation in November 2019 to purchase a mid-scale system, configured to our specifications, for use by faculty and students for research and research training. This paper describes our composable infrastructure solution and implementation for our academic community. Given how modern workflows are progressively moving to containers and cloud frameworks (using Kubernetes) and to programming notebooks (primarily Jupyter), both for ease of use and for ensuring reproducible experiments, we initially adapted these tools for our system. We have since made it simpler to use our system, and now provide our users with a public facing JupyterHub server. We also added an expansion chassis to our system to enable composable co-location, which is a shared central architecture in which our researchers can insert and integrate specialized resources (GPUs, accelerators, networking cards, etc.) needed for their research. In February 2020, installation of our system was finalized and made operational and we began providing access to faculty in the College of Engineering. Now, two years later, it is used by over 40 faculty and students plus some external collaborators for research and research training. Their use cases and experiences are briefly described in this paper. Composable infrastructure has proven to be a useful computational system for workload variability, uneven applications, and modern workflows in academic environments.more » « less
-
After a natural disaster, multiple disciplines need to come together to rebuild the damaged infrastructure using new paradigms. For instance, urgent restoration of services demand to abridge the projects’ schedule and provide innovative solutions, thus making collaboration and integration essential for the project’s success. Commonly, the academic preparation of scholars on infrastructure-related disciplines takes place in isolated professional domains, rarely tackling interdisciplinary problems and/or learn from the systematic research of previous experiences. In Puerto Rico, the aftermath of Hurricanes Irma and Maria has heightened awareness regarding the education on infrastructure-related disciplines to provide transdisciplinary solutions to pertinent complex challenges. This taxing context compels the academia to train a new cadre of professionals properly prepared in those STEM disciplines. Further, current public awareness of the vulnerability of the existing infrastructure creates an opportunity to recruit and prepare students to become those much-needed professionals. The present work offers the conceptual framework of a collaborative effort among Architecture, Engineering, and Construction (AEC) to develop an interdisciplinary program in resilient and sustainable infrastructure. The framework includes the development of transformational pedagogic interventions and changes that will challenge the disciplinary splits among AEC. The framework targets values and skills for inter and transdisciplinary problem solving, as well as helps smooth the transition from academic education to professional practice. To implement the initiative, the project created a collaborative platform among three campuses of the University of Puerto Rico System. Each of these campuses offers a different educational component relevant to the enriching educational initiative. We expect this approach to create a new breed of professionals ready to face the challenges posed for the development of robust infrastructure. The strategy fosters readiness in environmental design in engineering and construction through evidence-based design and inter/transdisciplinary problem solving. Thus, this research contributes to the body of knowledge by presenting a collaborative effort to train future professionals to design and build a robust infrastructure that can overcome the impact of major natural catastrophes.more » « less
An official website of the United States government

