With the ever growing complexity of high performance computing (HPC) systems to satisfy emerging application requirements (e.g., high memory bandwidth requirement for machine learning applications), the performance bottleneck in such systems has moved from being computation-centric to be more communication-centric. Silicon photonic interconnection networks have been proposed to address the aggressive communication requirements in HPC systems, to realize higher bandwidth, lower latency, and better energy efficiency. There have been many successful efforts on developing silicon photonic devices, integrated circuits, and architectures for HPC systems. Moreover, many efforts have been made to address and mitigate the impact of different challenges (e.g., fabrication process and thermal variations) in silicon photonic interconnects. However, most of these efforts have focused only on a single design layer in the system design space (e.g., device, circuit or architecture level). Therefore, there is often a gap between what a design technique can improve in one layer, and what it might impair in another one. In this paper, we discuss the promise of cross-layer design methodologies for HPC systems integrating silicon photonic interconnects. In particular, we discuss how such cross-layer design solutions based on cooperatively designing and exchanging design objectives among different system design layers can help achieve the best possible performance when integrating silicon photonics into HPC systems
more »
« less
Integrating process, control-flow, and data resiliency layers using a hybrid Fenix/Kokkos approach
Integrating recent advancements in resilient algorithms and techniques into existing codes is a singular challenge in fault tolerance - in part due to the underlying complexity of implementing resilience in the first place, but also due to the difficulty introduced when integrating the functionality of a standalone new strategy with the preexisting resilience layers of an application. We propose that the answer is not to build integrated solutions for users, but runtimes designed to integrate into a larger comprehensive resilience system and thereby enable the necessary jump to multi-layered recovery. Our work designs, implements, and verifies one such comprehensive system of runtimes. Utilizing Fenix, a process resilience tool with integration into preexisting resilience systems as a design priority, we update Kokkos Resilience and the use pattern of VeloC to support application-level integration of resilience runtimes. Our work shows that designing integrable systems rather than integrated systems allows for user-designed optimization and upgrading of resilience techniques while maintaining the simplicity and performance of all-in-one resilience solutions. More application-specific choice in resilience strategies allows for better long-term flexibility, performance, and - importantly - simplicity.
more »
« less
- Award ID(s):
- 1664142
- PAR ID:
- 10393177
- Date Published:
- Journal Name:
- 2022 IEEE International Conference on Cluster Computing (CLUSTER)
- Page Range / eLocation ID:
- 418 to 428
- Format(s):
- Medium: X
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
Recent disruptions in transportation systems resulting from natural disasters, cyber accidents, and other factors clearly show the fragility of the airports and underscore the need for building resilience. This study introduces a comprehensive framework for evaluating the resilience of airport infrastructure, integrating critical functions and performance indicators in the context of specific missions that the airport needs to achieve. By focusing on the Dallas-Fort Worth International Airport (DFW) as a case study, the paper outlines a multi-criteria decision analysis (MCDA) methodology for identifying and assessing the critical functions of airports as well as their ability to recover and adapt under different threat scenarios including threat-agnostic situation. The methodology and its application to the DFW case study offer insights into the resilience of airport operations, highlighting key areas for improvement and the potential for policy intervention. This study provides a robust tool for airport administrators and policymakers to enhance infrastructure resilience through a detailed analysis and visualization of airport performance indicators, thereby contributing to the broader discourse on transportation system sustainability and disaster preparedness.more » « less
-
The integration of 3D printed sensors into hosting structures has become a growing area of research due to simplified assembly procedures, reduced system complexity, and lower fabrication cost. Embedding 3D printed sensors into structures or bonding the sensors on surfaces are the two techniques for the integration of sensors. This review extensively discusses the fabrication of sensors through different additive manufacturing techniques. Various additive manufacturing techniques dedicated to manufacture sensors as well as their integration techniques during the manufacturing process will be discussed. This review will also discuss the basic sensing mechanisms of integrated sensors and their applications. It has been proven that integrating 3D printed sensors into infrastructures can open new possibilities for research and development in additive manufacturing and sensor materials for smart goods and the Internet of Things.more » « less
-
Microfluidics has earned a reputation for providing numerous transformative but disconnected devices and techniques. Active research seeks to address this challenge by integrating microfluidic components, including embedded miniature pumps. However, a significant portion of existing microfluidic integration relies on the time-consuming manual fabrication that introduces device variations. We put forward a framework for solving this disconnect by combining new pumping mechanics and 3D printing to demonstrate several novel, integrated and wirelessly driven microfluidics. First, we characterized the simplicity and performance of printed microfluidics with a minimum feature size of 100 µm. Next, we integrated a microtesla (µTesla) pump to provide non-pulsatile flow with reduced shear stress on beta cells cultured on-chip. Lastly, the integration of radio frequency (RF) device and a hobby-grade brushless motor completed a self-enclosed platform that can be remotely controlled without wires. Our study shows how new physics and 3D printing approaches not only provide better integration but also enable novel cell-based studies to advance microfluidic research.more » « less
-
Abstract Integrating single-cell multi-omics data is a challenging task that has led to new insights into complex cellular systems. Various computational methods have been proposed to effectively integrate these rapidly accumulating datasets, including deep learning. However, despite the proven success of deep learning in integrating multi-omics data and its better performance over classical computational methods, there has been no systematic study of its application to single-cell multi-omics data integration. To fill this gap, we conducted a literature review to explore the use of multimodal deep learning techniques in single-cell multi-omics data integration, taking into account recent studies from multiple perspectives. Specifically, we first summarized different modalities found in single-cell multi-omics data. We then reviewed current deep learning techniques for processing multimodal data and categorized deep learning-based integration methods for single-cell multi-omics data according to data modality, deep learning architecture, fusion strategy, key tasks and downstream analysis. Finally, we provided insights into using these deep learning models to integrate multi-omics data and better understand single-cell biological mechanisms.more » « less
An official website of the United States government

