skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Title: Transferring Indoor Corrosion Image Assessment Models to Outdoor Images via Domain Adaptation
Corrosion of materials impacts critical economic sectors from infrastructure, transportation, defense, health, to the environment. The development of safe anti-corrosive materials is thus an important area of study in materials science. Corrosion science of preparing materials and then monitoring their corrosion under adverse conditions is labor intensive, time consuming, and extremely costly. While deep learning has become popular in automating various engineering tasks, the development of deep models for corrosion assessment is lacking. We are the first to study deep domain adaptation (DA) models for the automated assessment of the corrosion status of anti-corrosive materials. Corrosion data, i.e., photographic images of treated corroding materials, is abundant when produced in artificially controlled laboratory settings, while corrosion image data sets from rich natural outdoor environments are more challenging to produce and thus much smaller. We leverage the more readily available indoor corrosion data to train a classifier and then transfer it via deep domain adaptation to also perform well on the small yet more realistic outdoor corrosion image data set – without requiring target labels. We empirically compare 5 popular domain adaptation models on real-world corrosion image data sets. Our study finds that DA achieves 27% improvement in test accuracy compared to the performance of the no-DA baseline for classifying real-world outdoor corrosion data.  more » « less
Award ID(s):
2021871
PAR ID:
10430301
Author(s) / Creator(s):
; ; ; ; ; ; ; ;
Date Published:
Journal Name:
2022 21st IEEE International Conference on Machine Learning and Applications (ICMLA)
Page Range / eLocation ID:
1386 - 1391
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. Corrosion is a prevalent issue in numerous industrial fields, causing expenses nearing $3 trillion or 4% of the GDP annually with safety threats and environmental pollution. To timely qualify and validate new corrosion-inhibiting materials on a large scale, accurate and efficient corrosion assessment is crucial. Yet it is hindered by a lack of automatic tools for expert-level corrosion segmentation of material science experimental images. Developing such tools is challenging due to limited domain-valid data, image artifacts visually similar to corrosion, various corrosion morphology, strong class imbalance, and millimeter-precision corrosion boundaries. To help the community address these challenges, we curate the first expert-level segmentation annotations for a real-world image dataset [1] for scientific corrosion segmentation. In addition, we design a deep learning-based model, called DeepSC-Edge that achieves guidance of ground-truth edge learning by adopting a novel loss that avoids over-fitting to edges. It also is enriched by integrating a class-balanced loss that improves segmentation with small area but crucial edges of interest for scientific corrosion assessment. Our dataset and methods pave the way to advanced deep-learning models for corrosion assessment and generation – promoting new research to connect computer vision and material science discovery. Once the appropriate approvals have been cleared, we expect to release the code and data at: https://arl.wpi.edu/ 
    more » « less
  2. Amid corrosion degradation of metallic structures causing expenses nearing 3 trillion or 4% of the GDP annually along with major safety risks, the adoption of AI technologies for accelerating the materials science life-cycle for developing materials with better corrosive properties is paramount. While initial machine learning models for corrosion assessment are being proposed in the literature, their incorporation into end-to-end tools for field experimentation by corrosion scientists remains largely unexplored. To fill this void, our university data science team in collaboration with the materials science unit at the Army Research Lab have jointly developed MOSS, an innovative AI-based digital platform to support material science corrosion research. MOSS features user-friendly iPadOS app for in-field corrosion progression data collection, deep-learning corrosion assessor, robust data repository system for long-term experimental data modeling, and visual analytics web portal for material science research. In this demonstration, we showcase the key innovations of the MOSS platform via use cases supporting the corrosion exploration processes, with the promise of accelerating the discovery of new materials. We open a MOSS video demo at: https://www.youtube.com/watch?v=CzcxMMRsxkE 
    more » « less
  3. Structural integrity can be compromised by the simultaneous presence of mechanical loads and corrosive agents. This study investigates the complex interplay between corrosion and impact loads in steel plates, utilizing discrete Fiber Bragg Grating (FBG) and distributed Optical Frequency Domain Reflectometry (OFDR) sensing technology. Generalized fiber optic-based sensing models are developed to quantify corrosion severity and rate. The experimental study was conducted using twelve epoxy-coated steel plates equipped with FBG and OFDR sensors, covering scenarios of individual exposure to corrosion, impact loads, as well as their combination. Test results reveal that specimens subjected to combined conditioning exhibit more corrosion damage than those subjected to individual corrosion. Both pit depth and its growth rate were exacerbated due to the impact loads. The study demonstrates the potential of fiber optic sensors (FOSs) for real-time monitoring and assessment of structural health under different simultaneous multiple factors in challenging conditions. 
    more » « less
  4. Domain adaptation (DA) addresses the real-world image classification problem of discrepancy between training (source) and testing (target) data distributions. We propose an unsupervised DA method that considers the presence of only unlabelled data in the target do- main. Our approach centers on finding matches between samples of the source and target domains. The matches are obtained by treating the source and target domains as hyper-graphs and carrying out a class-regularized hyper-graph matching using first-, second- and third-order similarities between the graphs. We have also developed a computationally efficient algorithm by initially selecting a subset of the samples to construct a graph and then developing a customized optimization routine for graph-matching based on Conditional Gradient and Alternating Direction Multiplier Method. This allows the proposed method to be used widely. We also performed a set of experiments on standard object recognition datasets to validate the effectiveness of our framework over previous approaches. 
    more » « less
  5. As demand grows for job-ready data science professionals, there is increasing recognition that traditional training often falls short in cultivating the higher-order reasoning and real-world problem-solving skills essential to the field. A foundational step toward addressing this gap is the identification and organization of knowledge components (KCs) that underlie data science problem solving (DSPS). KCs represent conditional knowledge—knowing about appropriate actions given particular contexts or conditions—and correspond to the critical decisions data scientists must make throughout the problem-solving process. While existing taxonomies in data science education support curriculum development, they often lack the granularity and focus needed to support the assessment and development of DSPS skills. In this paper, we present a novel framework that combines the strengths of large language models (LLMs) and human expertise to identify, define, and organize KCs specific to DSPS. We treat LLMs as ``knowledge engineering assistants" capable of generating candidate KCs by drawing on their extensive training data, which includes a vast amount of domain knowledge and diverse sets of real-world DSPS cases. Our process involves prompting multiple LLMs to generate decision points, synthesizing and refining KC definitions across models, and using sentence-embedding models to infer the underlying structure of the resulting taxonomy. Human experts then review and iteratively refine the taxonomy to ensure validity. This human-AI collaborative workflow offers a scalable and efficient proof-of-concept for LLM-assisted knowledge engineering. The resulting KC taxonomy lays the groundwork for developing fine-grained assessment tools and adaptive learning systems that support deliberate practice in DSPS. Furthermore, the framework illustrates the potential of LLMs not just as content generators but as partners in structuring domain knowledge to inform instructional design. Future work will involve extending the framework by generating a directed graph of KCs based on their input-output dependencies and validating the taxonomy through expert consensus and learner studies. This approach contributes to both the practical advancement of DSPS coaching in data science education and the broader methodological toolkit for AI-supported knowledge engineering. 
    more » « less