skip to main content


Title: Integrating process, control-flow, and data resiliency layers using a hybrid Fenix/Kokkos approach
Integrating recent advancements in resilient algorithms and techniques into existing codes is a singular challenge in fault tolerance - in part due to the underlying complexity of implementing resilience in the first place, but also due to the difficulty introduced when integrating the functionality of a standalone new strategy with the preexisting resilience layers of an application. We propose that the answer is not to build integrated solutions for users, but runtimes designed to integrate into a larger comprehensive resilience system and thereby enable the necessary jump to multi-layered recovery. Our work designs, implements, and verifies one such comprehensive system of runtimes. Utilizing Fenix, a process resilience tool with integration into preexisting resilience systems as a design priority, we update Kokkos Resilience and the use pattern of VeloC to support application-level integration of resilience runtimes. Our work shows that designing integrable systems rather than integrated systems allows for user-designed optimization and upgrading of resilience techniques while maintaining the simplicity and performance of all-in-one resilience solutions. More application-specific choice in resilience strategies allows for better long-term flexibility, performance, and - importantly - simplicity.  more » « less
Award ID(s):
1664142
NSF-PAR ID:
10393177
Author(s) / Creator(s):
; ; ; ; ; ; ;
Date Published:
Journal Name:
2022 IEEE International Conference on Cluster Computing (CLUSTER)
Page Range / eLocation ID:
418 to 428
Format(s):
Medium: X
Sponsoring Org:
National Science Foundation
More Like this
  1. The integration of 3D printed sensors into hosting structures has become a growing area of research due to simplified assembly procedures, reduced system complexity, and lower fabrication cost. Embedding 3D printed sensors into structures or bonding the sensors on surfaces are the two techniques for the integration of sensors. This review extensively discusses the fabrication of sensors through different additive manufacturing techniques. Various additive manufacturing techniques dedicated to manufacture sensors as well as their integration techniques during the manufacturing process will be discussed. This review will also discuss the basic sensing mechanisms of integrated sensors and their applications. It has been proven that integrating 3D printed sensors into infrastructures can open new possibilities for research and development in additive manufacturing and sensor materials for smart goods and the Internet of Things.

     
    more » « less
  2. With the ever growing complexity of high performance computing (HPC) systems to satisfy emerging application requirements (e.g., high memory bandwidth requirement for machine learning applications), the performance bottleneck in such systems has moved from being computation-centric to be more communication-centric. Silicon photonic interconnection networks have been proposed to address the aggressive communication requirements in HPC systems, to realize higher bandwidth, lower latency, and better energy efficiency. There have been many successful efforts on developing silicon photonic devices, integrated circuits, and architectures for HPC systems. Moreover, many efforts have been made to address and mitigate the impact of different challenges (e.g., fabrication process and thermal variations) in silicon photonic interconnects. However, most of these efforts have focused only on a single design layer in the system design space (e.g., device, circuit or architecture level). Therefore, there is often a gap between what a design technique can improve in one layer, and what it might impair in another one. In this paper, we discuss the promise of cross-layer design methodologies for HPC systems integrating silicon photonic interconnects. In particular, we discuss how such cross-layer design solutions based on cooperatively designing and exchanging design objectives among different system design layers can help achieve the best possible performance when integrating silicon photonics into HPC systems 
    more » « less
  3. Microfluidics has earned a reputation for providing numerous transformative but disconnected devices and techniques. Active research seeks to address this challenge by integrating microfluidic components, including embedded miniature pumps. However, a significant portion of existing microfluidic integration relies on the time-consuming manual fabrication that introduces device variations. We put forward a framework for solving this disconnect by combining new pumping mechanics and 3D printing to demonstrate several novel, integrated and wirelessly driven microfluidics. First, we characterized the simplicity and performance of printed microfluidics with a minimum feature size of 100 µm. Next, we integrated a microtesla (µTesla) pump to provide non-pulsatile flow with reduced shear stress on beta cells cultured on-chip. Lastly, the integration of radio frequency (RF) device and a hobby-grade brushless motor completed a self-enclosed platform that can be remotely controlled without wires. Our study shows how new physics and 3D printing approaches not only provide better integration but also enable novel cell-based studies to advance microfluidic research. 
    more » « less
  4. Abstract

    Integrating single-cell multi-omics data is a challenging task that has led to new insights into complex cellular systems. Various computational methods have been proposed to effectively integrate these rapidly accumulating datasets, including deep learning. However, despite the proven success of deep learning in integrating multi-omics data and its better performance over classical computational methods, there has been no systematic study of its application to single-cell multi-omics data integration. To fill this gap, we conducted a literature review to explore the use of multimodal deep learning techniques in single-cell multi-omics data integration, taking into account recent studies from multiple perspectives. Specifically, we first summarized different modalities found in single-cell multi-omics data. We then reviewed current deep learning techniques for processing multimodal data and categorized deep learning-based integration methods for single-cell multi-omics data according to data modality, deep learning architecture, fusion strategy, key tasks and downstream analysis. Finally, we provided insights into using these deep learning models to integrate multi-omics data and better understand single-cell biological mechanisms.

     
    more » « less
  5. Given the increasing occurrence of high-impact low-probability (HILP) contingencies in existing power systems, strengthening the resilience of these systems has become of paramount importance. Enhancing the resilience of power systems is not solely a technical issue but also a socio-economic and policy concern. Therefore, improving the performance of power systems greatly relies on the guidance provided by energy policies. While the decarbonization response, supported by these policies to mitigate climate change, influences the adoption of energy technologies, its impact on the resilience of the system remains uncertain. To uncover the interactions between technologies, policies, and economics concerning power systems resilience, this study focuses on constructing resilience-oriented networked microgrid systems. It develops a two-stage stochastic programming model by integrating a method for selecting power outage scenarios identified by users, in the presence of emissions policies. The results confirm the contributions of integrated systems in enhancing resilience, but they also reveal that low-carbon emissions policies play an inhibiting role by increasing the financial costs associated with resilience planning and operations. Nevertheless, a 30% emissions reduction threshold can still be achieved from the integrated network, facilitating the dual benefits of maximizing emissions reduction and minimizing the burden of emissions taxes. The study's contributions are threefold: firstly, it incorporates techno-economic incentives and regulations simultaneously; secondly, it quantifies the unintended consequences of policies on resilience; and thirdly, it provides constructive guidance for future energy policymaking, particularly in maintaining system resilience. 
    more » « less