NASRec: Weight Sharing Neural Architecture Search for Recommender Systems

Zhang, Tunhou; Cheng, Dehua; He, Yuchen; Chen, Zhengxing; Dai, Xiaoliang; Xiong, Liang; Yan, Feng; Li, Hai; Chen, Yiran; Wen, Wei

doi:10.1145/3543507.3583446

Citation Details

NASRec: Weight Sharing Neural Architecture Search for Recommender Systems

The rise of deep neural networks offers new opportunities in optimizing recommender systems. However, optimizing recommender systems using deep neural networks requires delicate architecture fabrication. We propose NASRec, a paradigm that trains a single supernet and efficiently produces abundant models/sub-architectures by weight sharing. To overcome the data multi-modality and architecture heterogeneity challenges in the recommendation domain, NASRec establishes a large supernet (i.e., search space) to search the full architectures. The supernet incorporates versatile choice of operators and dense connectivity to minimize human efforts for finding priors. The scale and heterogeneity in NASRec impose several challenges, such as training inefficiency, operator-imbalance, and degraded rank correlation. We tackle these challenges by proposing single-operator any-connection sampling, operator-balancing interaction modules, and post-training fine-tuning. Our crafted models, NASRecNet, show promising results on three Click-Through Rates (CTR) prediction benchmarks, indicating that NASRec outperforms both manually designed models and existing NAS methods with state-of-the-art performance. Our work is publicly available here. more »

Award ID(s):: 2120333 2112562 2140247 1937435 2305491

PAR ID:: 10441671

Author(s) / Creator(s):: Zhang, Tunhou; Cheng, Dehua; He, Yuchen; Chen, Zhengxing; Dai, Xiaoliang; Xiong, Liang; Yan, Feng; Li, Hai; Chen, Yiran; Wen, Wei

Date Published:: 2023-04-30

Journal Name:: the ACM Web Conference 2023

Page Range / eLocation ID:: 1199 to 1207

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
https://doi.org/10.1145/3543507.3583446

More Like this