NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Deep RC: A Scalable Data Engineering and Deep Learning Pipeline

Sarker, Arup; Alsaadi, Aymen; Halpern, Alexander; Tangella, Prabhath; Titov, Mikhail; Perera, Niranda; Staylor, Mills; Laszewski, Gregor von; Jha, Shantenu; Fox, Geoffrey (June 2025, Springer. JSSPP 2025: Job Scheduling Strategies for Parallel Processing)

Significant obstacles exist in scientific domains including genetics, climate modeling, and astronomy due to the management, preprocess, and training on complicated data for deep learning. Even while several large-scale solutions offer distributed execution environments, open-source alternatives that integrate scalable runtime tools, deep learning and data frameworks on high-performance computing platforms remain crucial for accessibility and flexibility. In this paper, we introduce Deep Radical-Cylon(RC), a heterogeneous runtime system that combines data engineering, deep learning frameworks, and workflow engines across several HPC environments, including cloud and supercomputing infrastructures. Deep RC supports heterogeneous systems with accelerators, allows the usage of communication libraries like \texttt{MPI}, \texttt{GLOO} and \texttt{NCCL} across multi-node setups, and facilitates parallel and distributed deep learning pipelines by utilizing Radical Pilot as a task execution framework. By attaining an end-to-end pipeline including preprocessing, model training, and postprocessing with 11 neural forecasting models (PyTorch) and hydrology models (TensorFlow) under identical resource conditions, the system reduces 3.28 and 75.9 seconds, respectively. The design of Deep RC guarantees the smooth integration of scalable data frameworks, such as Cylon, with deep learning processes, exhibiting strong performance on cloud platforms and scientific HPC systems. By offering a flexible, high-performance solution for resource-intensive applications, this method closes the gap between data preprocessing, model training, and postprocessing.
more » « less
Free, publicly-accessible full text available June 3, 2026
Templated Hybrid Reusable Computational Analytics Workflow Management with Cloudmesh, Applied to the Deep Learning MLCommons Cloudmask Application

https://doi.org/10.1109/e-Science58273.2023.10254942

von Laszewski, Gregor; Fleischer, J.P.; Fox, Geoffrey C.; Papay, Juri; Jackson, Sam; Thiyagalingam, Jeyan (October 2023, IEEE)
Opportunities for enhancing MLCommons efforts while leveraging insights from educational MLCommons earthquake benchmarks efforts

https://doi.org/10.3389/fhpcp.2023.1233877

von Laszewski, Gregor; Fleischer, J. P.; Knuuti, Robert; Fox, Geoffrey C.; Kolessar, Jake; Butler, Thomas S.; Fox, Judy (October 2023, Frontiers in High Performance Computing)

MLCommons is an effort to develop and improve the artificial intelligence (AI) ecosystem through benchmarks, public data sets, and research. It consists of members from start-ups, leading companies, academics, and non-profits from around the world. The goal is to make machine learning better for everyone. In order to increase participation by others, educational institutions provide valuable opportunities for engagement. In this article, we identify numerous insights obtained from different viewpoints as part of efforts to utilize high-performance computing (HPC) big data systems in existing education while developing and conducting science benchmarks for earthquake prediction. As this activity was conducted across multiple educational efforts, we project if and how it is possible to make such efforts available on a wider scale. This includes the integration of sophisticated benchmarks into courses and research activities at universities, exposing the students and researchers to topics that are otherwise typically not sufficiently covered in current course curricula as we witnessed from our practical experience across multiple organizations. As such, we have outlined the many lessons we learned throughout these efforts, culminating in the need forbenchmark carpentryfor scientists using advanced computational resources. The article also presents the analysis of an earthquake prediction code benchmark while focusing on the accuracy of the results and not only on the runtime; notedly, this benchmark was created as a result of our lessons learned. Energy traces were produced throughout these benchmarks, which are vital to analyzing the power expenditure within HPC environments. Additionally, one of the insights is that in the short time of the project with limited student availability, the activity was only possible by utilizing a benchmark runtime pipeline while developing and using software to generate jobs from the permutation of hyperparameters automatically. It integrates a templated job management framework for executing tasks and experiments based on hyperparameters while leveraging hybrid compute resources available at different institutions. The software is part of a collection calledcloudmeshwith its newly developed components, cloudmesh-ee (experiment executor) and cloudmesh-cc (compute coordinator).
more » « less
Full Text Available
In-depth analysis on parallel processing patterns for high-performance Dataframes

https://doi.org/10.1016/j.future.2023.07.007

Perera, Niranda; Sarker, Arup Kumar; Staylor, Mills; von Laszewski, Gregor; Shan, Kaiying; Kamburugamuve, Supun; Widanage, Chathura; Abeykoon, Vibhatha; Kanewela, Thejaka Amila; Fox, Geoffrey (December 2023, Future Generation Computer Systems)

Full Text Available
Time Series Analysis of Cryptocurrency Prices Using Long Short-Term Memory

https://doi.org/10.3390/a15070230

Fleischer, Jacques Phillipe; von Laszewski, Gregor; Theran, Carlos; Parra Bautista, Yohn Jairo (July 2022, Algorithms)

Digitization is changing our world, creating innovative finance channels and emerging technology such as cryptocurrencies, which are applications of blockchain technology. However, cryptocurrency price volatility is one of this technology’s main trade-offs. In this paper, we explore a time series analysis using deep learning to study the volatility and to understand this behavior. We apply a long short-term memory model to learn the patterns within cryptocurrency close prices and to predict future prices. The proposed model learns from the close values. The performance of this model is evaluated using the root-mean-squared error and by comparing it to an ARIMA model.
more » « less
Full Text Available
HPTMT Parallel Operators for High Performance Data Science and Data Engineering

https://doi.org/10.3389/fdata.2021.756041

Abeykoon, Vibhatha; Kamburugamuve, Supun; Widanage, Chathura; Perera, Niranda; Uyar, Ahmet; Kanewala, Thejaka Amila; von Laszewski, Gregor; Fox, Geoffrey (February 2022, Frontiers in Big Data)

Data-intensive applications are becoming commonplace in all science disciplines. They are comprised of a rich set of sub-domains such as data engineering, deep learning, and machine learning. These applications are built around efficient data abstractions and operators that suit the applications of different domains. Often lack of a clear definition of data structures and operators in the field has led to other implementations that do not work well together. The HPTMT architecture that we proposed recently, identifies a set of data structures, operators, and an execution model for creating rich data applications that links all aspects of data engineering and data science together efficiently. This paper elaborates and illustrates this architecture using an end-to-end application with deep learning and data engineering parts working together. Our analysis show that the proposed system architecture is better suited for high performance computing environments compared to the current big data processing systems. Furthermore our proposed system emphasizes the importance of efficient compact data structures such as Apache Arrow tabular data representation defined for high performance. Thus the system integration we proposed scales a sequential computation to a distributed computation retaining optimum performance along with highly usable application programming interface.
more » « less
Full Text Available
AICov: An Integrative Deep Learning Framework for COVID-19 Forecasting with Population Covariates

https://doi.org/10.6339/21-JDS1007

Fox, Geoffrey C.; von Laszewski, Gregor; Wang, Fugang; Pyne, Saumyadipta (February 2021, Journal of Data Science)
null (Ed.)
The COVID-19 (COrona VIrus Disease 2019) pandemic has had profound global consequences on health, economic, social, behavioral, and almost every major aspect of human life. Therefore, it is of great importance to model COVID-19 and other pandemics in terms of the broader social contexts in which they take place. We present the architecture of an artificial intelligence enhanced COVID-19 analysis (in short AICov), which provides an integrative deep learning framework for COVID-19 forecasting with population covariates, some of which may serve as putative risk factors. We have integrated multiple different strategies into AICov, including the ability to use deep learning strategies based on Long Short-Term Memory (LSTM) and event modeling. To demonstrate our approach, we have introduced a framework that integrates population covariates from multiple sources. Thus, AICov not only includes data on COVID-19 cases and deaths but, more importantly, the population’s socioeconomic, health, and behavioral risk factors at their specific locations. The compiled data are fed into AICov, and thus we obtain improved prediction by the integration of the data to our model as compared to one that only uses case and death data. As we use deep learning our models adapt over time while learning the model from past data.
more » « less
Full Text Available
HPTMT: Operator-Based Architecture for Scalable High-Performance Data-Intensive Frameworks

https://doi.org/10.1109/CLOUD53861.2021.00036

Kamburugamuve, Supun; Widanage, Chathura; Perera, Niranda; Abeykoon, Vibhatha; Uyar, Ahmet; Kanewala, Thejaka Amila; Von Laszewski, Gregor; Fox, Geoffrey (September 2021, 2021 IEEE 14th International Conference on Cloud Computing (CLOUD))

Full Text Available
Using Cloudmesh GAS for Speedy Generation of Hybrid Multi-Cloud Auto Generated AI Services

https://doi.org/10.1109/COMPSAC51774.2021.00032

Laszewski, Gregor von; Orlowski, Anthony; Otten, Richard H.; Markowitz, Reilly; Gandhi, Sunny; Chai, Adam; Fox, Geoffrey C.; Chang, Wo L. (July 2021, 2021 IEEE 45th Annual Computers, Software, and Applications Conference (COMPSAC))

Today’s problems require a plethora of analytics tasks to be conducted to tackle state-of-the-art computational challenges posed in society impacting many areas including health care, automotive, banking, natural language processing, image detection, and many more data analytics-related tasks. Sharing existing analytics functions allows reuse and reduces overall effort. However, integrating deployment frameworks in the age of cloud computing are often out of reach for domain experts. Simple frameworks are needed that allow even non-experts to deploy and host services in the cloud. To avoid vendor lock-in, we require a generalized composable analytics service framework that allows users to integrate their services and those offered in clouds, not only by one, but by many cloud compute and service providers.We report on work that we conducted to provide a service integration framework for composing generalized analytics frame-works on multi-cloud providers that we call our Generalized AI Service (GAS) Generator. We demonstrate the framework’s usability by showcasing useful analytics workflows on various cloud providers, including AWS, Azure, and Google, and edge computing IoT devices. The examples are based on Scikit learn so they can be used in educational settings, replicated, and expanded upon. Benchmarks are used to compare the different services and showcase general replicability.
more » « less
Full Text Available
Evaluating the Scientific Impact of XSEDE

https://doi.org/10.1145/3219104.3219124

Wang, Fugang; Laszewski, Gregor von; Whitson, Timothy; Fox, Geoffrey C.; Furlani, Thomas R.; DeLeon, Robert L.; Gallo, Steven M. (July 2018, ", Proceedings of the 2018 Practice and Experience on Advanced Research Computing, ser PEARC '18)

Full Text Available

« Prev Next »

Search for: All records