NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

AutoRAC: Automated Processing-in-Memory Accelerator Design for Recommender Systems

https://doi.org/10.1145/3716368.3735229

Cheng, Feng; Zhang, Tunhou; Zhang, Junyao; Ku, Jonathan Hao-Cheng; Wang, Yitu; Yang, Xiaoxuan; Li, Hai; Chen, Yiran (June 2025, ACM)

Free, publicly-accessible full text available June 29, 2026
DistDNAS: Search Efficient Feature Interactions within 2 Hours

Zhang, Tunhou; Wen, Wei; Fedorov, Igor; Liu, Xi; Zhang, Buyun; Han, Fangqiu; Chen, Wen-Yen; Han, Yiping; Han, Feng; Li, Hai; et al (December 2024, IEEE International Conference on Big ata (IEEE BigData))

Full Text Available
DistDNAS: Search Efficient Feature Interactions within 2 Hours

https://doi.org/10.1109/BigData62323.2024.10825061

Zhang, Tunhou; Wen, Wei; Fedorov, Igor; Liu, Xi; Zhang, Buyun; Han, Fangqiu; Chen, Wen-Yen; Han, Yiping; Yan, Feng; Li, Hai; et al (December 2024, IEEE)

Full Text Available
Towards Automated Model Design on Recommender Systems

https://doi.org/10.1145/3706124

Zhang, Tunhou; Cheng, Dehua; He, Yuchen; Chen, Zhengxing; Dai, Xiaoliang; Xiong, Liang; Liu, Yudong; Cheng, Feng; Cao, Yufan; Yan, Feng; et al (December 2024, ACM Transactions on Recommender Systems)

The increasing popularity of deep learning models has created new opportunities for developing AI-based recommender systems. Designing recommender systems using deep neural networks requires careful architecture design, and further optimization demands extensive co-design efforts on jointly optimizing model architecture and hardware. Design automation, such as Automated Machine Learning (AutoML), is necessary to fully exploit the potential of recommender model design, including model choices and model-hardware co-design strategies. We introduce a novel paradigm that utilizes weight sharing to explore abundant solution spaces. Our paradigm creates a large supernet to search for optimal architectures and co-design strategies to address the challenges of data multi-modality and heterogeneity in the recommendation domain. From a model perspective, the supernet includes a variety of operators, dense connectivity, and dimension search options. From a co-design perspective, it encompasses versatile Processing-In-Memory (PIM) configurations to produce hardware-efficient models. Our solution space’s scale, heterogeneity, and complexity pose several challenges, which we address by proposing various techniques for training and evaluating the supernet. Our crafted models show promising results on three Click-Through Rates (CTR) prediction benchmarks, outperforming both manually designed and AutoML-crafted models with state-of-the-art performance when focusing solely on architecture search. From a co-design perspective, we achieve 2 × FLOPs efficiency, 1.8 × energy efficiency, and 1.5 × performance improvements in recommender models.
more » « less
Full Text Available
CSCO: Connectivity Search of Convolutional Operators

Zhang, Tunhou; Li, Shiyu; Cheng, Hsin-Pai; Yan, Feng; Li, Hai; Chen, Yiran (June 2024, Fifth Workshop on Neural Architecture Search)

Full Text Available
Toward Fully Automated Machine Learning for Routability Estimator Development

https://doi.org/10.1109/TCAD.2023.3330818

Chang, Chen-Chia; Pan, Jingyu; Xie, Zhiyao; Zhang, Tunhou; Hu, Jiang; Chen, Yiran (March 2024, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems)

The rise of machine learning (ML) technology inspires a boom in its applications in electronic design automation (EDA) and helps improve the degree of automation in chip designs. However, manually crafting ML models remains a complex and time-consuming process because it requires extensive human expertise and tremendous engineering efforts to carefully extract features and design model architectures. In this work, we leverage automated ML techniques to automate the ML model development for routability prediction, a well-established technique that can help to guide cell placement toward routable solutions. We present an automated feature selection method to identify suitable features for model inputs. We develop a neural architecture search method to search for high-quality neural architectures without human interference. Our search method supports various operations and highly flexible connections, leading to architectures significantly different from all previous human-crafted models. Our experimental results demonstrate that our automatically generated models clearly outperform multiple representative manually crafted solutions with a superior 9.9% improvement. Moreover, compared with human-crafted models, which easily take weeks or months to develop, our efficient automated machine-learning framework completes the whole model development process in only 1 day.
more » « less
Full Text Available
LISSNAS: Locality-based Iterative Search Space Shrinkage for Neural Architecture Search

https://doi.org/10.24963/ijcai.2023/86

Gopal, Bhavna; Sridhar, Arjun; Zhang, Tunhou; Chen, Yiran (August 2023, The Thirty-Second International Joint Conference on Artificial Intelligence)

Search spaces hallmark the advancement of Neural Architecture Search (NAS). Large and complex search spaces with versatile building operators and structures provide more opportunities to brew promising architectures, yet pose severe challenges on efficient exploration and exploitation. Subsequently, several search space shrinkage methods optimize by selecting a single sub-region that contains some well-performing networks. Small performance and efficiency gains are observed with these methods but such techniques leave room for significantly improved search performance and are ineffective at retaining architectural diversity. We propose LISSNAS, an automated algorithm that shrinks a large space into a diverse, small search space with SOTA search performance. Our approach leverages locality, the relationship between structural and performance similarity, to efficiently extract many pockets of well-performing networks. We showcase our method on an array of search spaces spanning various sizes and datasets. We accentuate the effectiveness of our shrunk spaces when used in one-shot search by achieving the best Top-1 accuracy in two different search spaces. Our method achieves a SOTA Top-1 accuracy of 77.6% in ImageNet under mobile constraints, best-in-class Kendal-Tau, architectural diversity, and search space size.
more » « less
Full Text Available
PIDS: Joint Point Interaction-Dimension Search for 3D Point Cloud

Zhang, Tunhou; Ma, Mingyuan; Yan, Feng; Li, Hai; Chen, Yiran (January 2023, EEE/CVF Winter Conference on Applications of Computer Vision)

The interaction and dimension of points are two important axes in designing point operators to serve hierarchical 3D models. Yet, these two axes are heterogeneous and challenging to fully explore. Existing works craft point operator under a single axis and reuse the crafted operator in all parts of 3D models. This overlooks the opportunity to better combine point interactions and dimensions by exploiting varying geometry/density of 3D point clouds. In this work, we establish PIDS, a novel paradigm to jointly explore point interactions and point dimensions to serve semantic segmentation on point cloud data. We establish a large search space to jointly consider versatile point interactions and point dimensions. This supports point operators with various geometry/density considerations. The enlarged search space with heterogeneous search components calls for a better ranking of candidate models. To achieve this, we improve the search space exploration by leveraging predictor-based Neural Architecture Search (NAS), and enhance the quality of prediction by assigning unique encoding to heterogeneous search components based on their priors. We thoroughly evaluate the networks crafted by PIDS on two semantic segmentation benchmarks, showing 1% mIOU improvement on SemanticKITTI and S3DIS over state-of-the-art 3D models.
more » « less
Full Text Available
: Joint Point Interaction-Dimension Search for 3D Point Cloud

https://doi.org/10.1109/WACV56688.2023.00135

Zhang, Tunhou; Ma, Mingyuan; Yan, Feng; Li, Hai; Chen, Yiran (January 2023, 2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV))

The interaction and dimension of points are two important axes in designing point operators to serve hierarchical 3D models. Yet, these two axes are heterogeneous and challenging to fully explore. Existing works craft point operator under a single axis and reuse the crafted operator in all parts of 3D models. This overlooks the opportunity to better combine point interactions and dimensions by exploiting varying geometry/density of 3D point clouds. In this work, we establish PIDS, a novel paradigm to jointly explore point interactions and point dimensions to serve semantic segmentation on point cloud data. We establish a large search space to jointly consider versatile point interactions and point dimensions. This supports point operators with various geometry/density considerations. The enlarged search space with heterogeneous search components calls for a better ranking of candidate models. To achieve this, we improve the search space exploration by leveraging predictor-based Neural Architecture Search (NAS), and enhance the quality of prediction by assigning unique encoding to heterogeneous search components based on their priors. We thoroughly evaluate the networks crafted by PIDS on two semantic segmentation benchmarks, showing ∼ 1% mIOU improvement on SemanticKITTI and S3DIS over state-of-the-art 3D models.
more » « less
Full Text Available
PIDS: Joint Point Interaction-Dimension Search for 3D Point Cloud

Zhang, Tunhou; Ma, Mingyuan; Yan, Feng; Li, Hai; Chen, Yiran. (January 2023, Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV))

Full Text Available

« Prev Next »

Search for: All records