While artificial intelligence and machine learning (AI/ML) frameworks gain prominence in science and engineering, most researchers face significant challenges in adopting complex AI/ML workflows to campus and national cyberinfrastructure (CI) environments. Data from the Texas A&M High Performance Computing (HPRC) researcher training program indicate that researchers increasingly want to learn how to migrate and work with their pre-existing AI/ML frameworks on large scale computing environments. Building on the continuing success of our work in developing innovative pedagogical approaches for CI- training approaches, we expand CI-infused pedagogical approaches to teach technology-based AI and data sciences. We revisit the pedagogical approaches used in the decades-old tradition of laboratories in the Physical Sciences that taught concepts via experiential learning. Here, we structure a series of exercises on interactive computing environments that give researchers immediate hands-on experience in AI/ML and data science technologies that they will use as they work on larger CI resources. These exercises, called “tech-labs,” assume that participating researchers are familiar with AI/ML approaches and focus on hands-on exercises that teach researchers how to use these approaches on large-scale CI. The tech-labs offer four consecutive sessions, each introducing a learner to specific technologies offered in CI environments for AI/ML and data workflows. We report on our tech-lab offered for Python-based AI/ML approaches during which learners are introduced to Jupyter Notebooks followed by exercises using Pandas, Matplotlib, Scikit-learn, and Keras. The program includes a series of enhancements such as container support and easy launch of virtual environments in our Web-based computing interface. The approach is scalable to programs using a command line interface (CLI) as well. In all, the program offers a shift in focus from teaching AI/ML toward increasing adoption of AI/ML in large-scale CI.
more »
« less
This content will become publicly available on July 17, 2025
Building a Cybertraining program for Climate Scientists in the Pacific to integrate Cyberinfrastructure and Open Science
The Cyberinfrastructure Training and Capacity Building in Climate and Environmental Sciences (CI-TRACS) program represents a pioneering initiative aimed at enhancing cyberinfrastructure proficiency within Hawaii’s academic community. This paper outlines the program’s comprehensive strategy, which integrates curriculum development, hands-on workshops, and professional growth opportunities to cultivate a robust foundation in CI practices. The initiative’s core objective is to elevate CI literacy, promote cross-disciplinary cooperation, and endorse the principles of open science. Significant contributions from the CI-TRACS program include a suite of educational materials and resources tailored for integration into higher education syllabi. Collaboration with the Hawaii Data Science Institute has been instrumental in nurturing a burgeoning network of data science professionals. The CI-TRACS program is instrumental in realizing the shared vision of equipping Hawaii’s emerging workforce with the sophisticated CI skills necessary to navigate and excel in the evolving landscape of climate and environmental sciences.
more »
« less
- PAR ID:
- 10545441
- Publisher / Repository:
- ACM
- Date Published:
- ISBN:
- 9798400704192
- Page Range / eLocation ID:
- 1 to 4
- Subject(s) / Keyword(s):
- Cyberinfrastructure Climate Science Data Science
- Format(s):
- Medium: X
- Location:
- Providence RI USA
- Sponsoring Org:
- National Science Foundation
More Like this
-
-
This paper provides an overview of the Hour of Cyberinfrastructure (Hour of CI), a project creating a suite of self-paced, hour-long lessons aimed at helping learners in the areas of spatial, social, and environmental sciences take their first steps in the path toward cyberinfrastructure. Using collaboratively developed lessons written in Jupyter Notebooks, the Hour of CI aims to lower barriers to cyberinfrastructure for next-generation scientists and scholars from broad and diverse backgrounds. Early findings based on a pilot of four lessons suggest our approach has created engaging and appropriately challenging lessons for diverse learners. The project will continue developing lessons to help learners build cyber literacy for GIScience and prepare them to tackle global problems.more » « less
-
In 2017, National Center for Supercomputing Applications (NCSA) at the University of Illinois at Urbana-Champaign (UIUC) established a pilot internship program for cyberinfrastructure (CI) professionals. The program, funded by NSF’s Office of Advanced Cyberinfrastructure (OAC) (award 1730519), was designed to address the shortage of a workforce with the specialized skills needed to support advanced CI operations. The program was envisioned to provide internship opportunities for individuals who want to gain first-hand experience in the CI operations at a supercomputing center, and develop and refine instructional materials to serve as a template that is openly distributed for use by other centers and institutions to train CI professionals. Program interns are selected from a pool of applicants with the main selection criteria of having a completed classwork equivalent to an associate degree and a demonstrated interest in a career in CI operations. Interns work directly with a group of NCSA engineers in one of the areas of CI focus to gain hands-on experience in the deployment and operation of high-performance computing (HPC) infrastructure at a leading HPC center. The expectation is that interns will enter a workforce that will develop, deploy, manage and support advanced CI at other universities, centers, and industry to meet the needs of the national computational science research community across academia and industry.more » « less
-
null (Ed.)Abstract—Networks have entered the mainstream lexicon over the last ten years. This coincides with the pervasive use of networks in a host of disciplines of interest to industry and academia, including biology, neurology, genomics, psychology, social sciences, economics, psychology, and cyber-physical systems and infrastructure. Several dozen journals and conferences regularly contain articles related to networks. Yet, there are no general purpose cyberinfrastructures (CI) that can be used across these varied disciplines and domains. Furthermore, while there are scientific gateways that include some network science capabilities for particular domains (e.g., biochemistry, genetics), there are no general-purpose network-based scientific gateways. In this work, we introduce net.science, a CI for Network Engineering and Science, that is designed to be a community resource. This paper provides an overview of net.science, addressing key requirements and concepts, CI components, the types of applications that our CI will support, and various dimensions of our evaluation process. Index Terms—cyberinfrastructure, network science, net.sciencemore » « less
-
The needs of cyberinfrastructure (CI) Users are different from those of CI Contributors. Typically, much of the training in advanced CI addresses developer topics such as MPI, OpenMP, CUDA and application profiling, leaving a gap in training for these users. To remedy this situation, we developed a new program: COMPrehensive Learning for end-users to Effectively utilize CyberinfraStructure (COMPLECS). COMPLECS focuses exclusively on helping CI Users acquire the skills and knowledge they need to efficiently accomplish their compute- and data-intensive research, covering topics such as parallel computing concepts, data management, batch computing, cybersecurity, HPC hardware overview, and high throughput computing.more » « less