NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

SION: Elastic Serverless Cloud Storage

Zhang, Jingyuan; Wang, Ao; Ma, Xiaolong; Carver, Benjamin; Newman, Nicholas; Anwar, Ali; Rupprecht, Lukas; Skourtis, Dimitrios; Tarasov, Vasily; Yan, Feng; et al (August 2023, International Conference on Very Large Data Bases (VLDB 2023))

Full Text Available
ScaleNAS: Multi-Path One-Shot NAS for Scale-Aware High-Resolution Representation

Cheng, Hsin-Pai; Liang, Feng; Li, Meng; Cheng, Bowen; Yan, Feng; Li, Hai; Chandra, Vikas; Chen, Yiran (July 2023, The AutoML Conference 2022)

Full Text Available
HDFL: A Heterogeneity and Client Dropout-Aware Federated Learning Framework

Zawad, Syed; Anwar, Ali; Zhou, Yi; Baracaldo, Nathalie; Yan, Feng (May 2023, IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing (CCGRID 2023))

Full Text Available
DySR: Adaptive Super-Resolution via Algorithm and System Co-design

Zawad, Syed; Li, Cheng; Yao, Zhewei; Zheng, Elton; He, Yuxiong; Yan, Feng (May 2023, International Conference on Learning Representations (ICLR 2023))

Full Text Available
NASRec: Weight Sharing Neural Architecture Search for Recommender Systems

Zhang, Tunhou; Cheng, Dehua; He, Yuchen; Chen, Zhengxing; Dai, Xiaoliang; Xiong, Liang; Yan, Feng; Li, Hai; Chen, Yiran; Wen, Wei (May 2023, 2023 ACM Web Conference (WWW 2023))

Full Text Available
MPress: Democratizing Billion-Scale Model Training on Multi-GPU Servers via Memory-Saving Inter-Operator Parallelism

https://doi.org/10.1109/HPCA56546.2023.10071077

Zhou, Quan; Wang, Haiquan; Yu, Xiaoyan; Li, Cheng; Bai, Youhui; Yan, Feng; Xu, Yinlong (February 2023, IEEE International Symposium on High-Performance Computer Architecture)

Full Text Available
: Joint Point Interaction-Dimension Search for 3D Point Cloud

https://doi.org/10.1109/WACV56688.2023.00135

Zhang, Tunhou; Ma, Mingyuan; Yan, Feng; Li, Hai; Chen, Yiran (January 2023, 2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV))

The interaction and dimension of points are two important axes in designing point operators to serve hierarchical 3D models. Yet, these two axes are heterogeneous and challenging to fully explore. Existing works craft point operator under a single axis and reuse the crafted operator in all parts of 3D models. This overlooks the opportunity to better combine point interactions and dimensions by exploiting varying geometry/density of 3D point clouds. In this work, we establish PIDS, a novel paradigm to jointly explore point interactions and point dimensions to serve semantic segmentation on point cloud data. We establish a large search space to jointly consider versatile point interactions and point dimensions. This supports point operators with various geometry/density considerations. The enlarged search space with heterogeneous search components calls for a better ranking of candidate models. To achieve this, we improve the search space exploration by leveraging predictor-based Neural Architecture Search (NAS), and enhance the quality of prediction by assigning unique encoding to heterogeneous search components based on their priors. We thoroughly evaluate the networks crafted by PIDS on two semantic segmentation benchmarks, showing ∼ 1% mIOU improvement on SemanticKITTI and S3DIS over state-of-the-art 3D models.
more » « less
Full Text Available
Optimizing inference serving on serverless platforms

https://doi.org/10.14778/3547305.3547313

Ali, Ahsan; Pinciroli, Riccardo; Yan, Feng; Smirni, Evgenia (June 2022, Proceedings of the VLDB Endowment)

Serverless computing is gaining popularity for machine learning (ML) serving workload due to its autonomous resource scaling, easy to use and pay-per-use cost model. Existing serverless platforms work well for image-based ML inference, where requests are homogeneous in service demands. That said, recent advances in natural language processing could not fully benefit from existing serverless platforms as their requests are intrinsically heterogeneous. Batching requests for processing can significantly increase ML serving efficiency while reducing monetary cost, thanks to the pay-per-use pricing model adopted by serverless platforms. Yet, batching heterogeneous ML requests leads to additional computation overhead as small requests need to be "padded" to the same size as large requests within the same batch. Reaching effective batching decisions (i.e., which requests should be batched together and why) is non-trivial: the padding overhead coupled with the serverless auto-scaling forms a complex optimization problem. To address this, we develop Multi-Buffer Serving (MBS), a framework that optimizes the batching of heterogeneous ML inference serving requests to minimize their monetary cost while meeting their service level objectives (SLOs). The core of MBS is a performance and cost estimator driven by analytical models supercharged by a Bayesian optimizer. MBS is prototyped and evaluated on AWS using bursty workloads. Experimental results show that MBS preserves SLOs while outperforming the state-of-the-art by up to 8 x in terms of cost savings while minimizing the padding overhead by up to 37 x with 3 x less number of serverless function invocations.
more » « less
Full Text Available
Citadel: Protecting Data Privacy and Model Confidentiality for Collaborative Learning

https://doi.org/10.1145/3472883.3486998

Zhang, Chengliang; Xia, Junzhe; Yang, Baichen; Puyang, Huancheng; Wang, Wei; Chen, Ruichuan; Akkus, Istemi Ekin; Aditya, Paarijaat; Yan, Feng (November 2021, ACM Symposium on Cloud Computing 2021 (SoCC 2021))

Full Text Available
Unbalanced Parallel I/O: An Often-Neglected Side Effect of Lossy Scientific Data Compression

https://doi.org/10.1109/DRBSD754563.2021.00008

Wang, Xinying; Wan, Lipeng; Chen, Jieyang; Gong, Qian; Whitney, Ben; Wang, Jinzhen; Gainaru, Ana; Liu, Qing; Podhorszki, Norbert; Zhao, Dongfang; et al (November 2021, 2021 7th International Workshop on Data Analysis and Reduction for Big Scientific Data (DRBSD-7))

Full Text Available

« Prev Next »

Search for: All records