NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

DIFFSERVE: EFFICIENTLY SERVING TEXT-TO-IMAGE DIFFUSION MODELS WITH QUERY-AWARE MODEL SCALING

Ahmad, Sohaib; Yang, Qizheng; Wang, Haoliang; Sitaraman, Ramesh; Guan, Hui (May 2025, Proceedings of the 8 th MLSys Conference)

Free, publicly-accessible full text available May 18, 2026
DIFFSERVE: EFFICIENTLY SERVING TEXT-TO-IMAGE DIFFUSION MODELS WITH QUERY-AWARE MODEL SCALING

Ahmad, Sohaib; Yang, Qizheng; Wang, Haoliang; Sitaraman, Ramesh; Guan, Hui (May 2025, Proceedings of the 8 th MLSys Conference)

Free, publicly-accessible full text available May 17, 2026
GSPLIT: SCALING GRAPH NEURAL NETWORK TRAINING ON LARGE GRAPHS VIA PROBABILISTIC SPLITTING

Polisetty, Sandeep; Liu, Juelin; Falus, Jacob; Fung, Yiren; Lim, Seung Hwan; Guan, Hui; Serafini, Marco (May 2025, Proceedings of the 8 th MLSys Conference)

Free, publicly-accessible full text available May 17, 2026
What Makes a Visualization Visually Complex?

https://doi.org/10.1145/3706599.3719983

Lin, Kylie; Ru, Sean Sheng-tse; Rapp, David N; Guan, Hui; Xiong_Bearfield, Cindy (April 2025, ACM)

Free, publicly-accessible full text available April 25, 2026
An Empirical Study of Microscaling Formats for Low-Precision LLM Training

Yang, Hanmei; Deng, Summer; Nagpal, Amit; Naumov, Maxim; Janani, Mohammad; Liu, Tongping; Guan, Hui (April 2025, 2025 IEEE 32nd Symposium on Computer Arithmetic (ARITH))

Free, publicly-accessible full text available April 17, 2026
An Empirical Study of Microscaling Formats for Low-Precision LLM Training

Yang, Hanmei; Deng, Summer; Nagpal, Amit; Naumov, Maxim; Janani, Mohammad; Liu, Tongping; Guan, Hui (April 2025, 2025 IEEE 32nd Symposium on Computer Arithmetic (ARITH))

Free, publicly-accessible full text available April 17, 2026
Reimagining Parameter Space Exploration with Diffusion Models

Zhang, Lijun; Liu, Xiao; Guan, Hui (December 2024, First Exploration in AI Today Workshop at ICML (EXAIT at ICML 2025))

Free, publicly-accessible full text available December 9, 2025
Reimagining Parameter Space Exploration with Diffusion Models

Zhang, Lijun; Liu, Xiao; Guan, Hui (December 2024, First Exploration in AI Today Workshop at ICML (EXAIT at ICML 2025))

Free, publicly-accessible full text available December 9, 2025
Graph Neural Network Training Systems: A Performance Comparison of Full-Graph and Mini-Batch

https://doi.org/10.14778/3717755.3717776

Bajaj, Saurabh; Son, Hojae; Liu, Juelin; Guan, Hui; Serafini, Marco (December 2024, Proceedings of the VLDB Endowment)

Graph Neural Networks (GNNs) have gained significant attention in recent years due to their ability to learn representations of graph-structured data. Two common methods for training GNNs are mini-batch training and full-graph training. Since these two methods require different training pipelines and systems optimizations, two separate classes of GNN training systems emerged, each tailored for one method. Works that introduce systems belonging to a particular category predominantly compare them with other systems within the same category, offering limited or no comparison with systems from the other category. Some prior work also justifies its focus on one specific training method by arguing that it achieves higher accuracy than the alternative. The literature, however, has incomplete and contradictory evidence in this regard. In this paper, we provide a comprehensive empirical comparison of representative full-graph and mini-batch GNN training systems. We find that the mini-batch training systems consistently converge faster than the full-graph training ones across multiple datasets, GNN models, and system configurations. We also find that minibatch training techniques converge to similar to or often higher accuracy values than full-graph training ones, showing that minibatch sampling is not necessarily detrimental to accuracy. Our work highlights the importance of comparing systems across different classes, using time-to-accuracy rather than epoch time for performance comparison, and selecting appropriate hyperparameters for each training method separately.
more » « less
Free, publicly-accessible full text available December 1, 2025
Thinking Forward: Memory-Efficient Federated Finetuning of Language Models

Panchal, Kunjal; Parikh, Nisarg; Choudhary, Sunav; Zhang, Lijun; Brun, Yuriy; Guan, Hui (December 2024, NeurIPS)

Free, publicly-accessible full text available December 9, 2025

« Prev Next »

Search for: All records