NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Parallel Software for Million-scale Exact Kernel Regression

https://doi.org/10.1145/3577193.3593737

Chen, Yu; Skon, Lucca; Mccombs, James; Liu, Zhenming; Stathopoulos, Andreas (June 2023, ACM)
Symphony in the Latent Space: Provably Integrating High-dimensional Techniques with Non-linear Machine Learning Models

Qiong Wu; Jian Li; Zhenming Liu; Yanhua Li; Mihai Cucuringu (April 2023, Proceedings of the AAAI Conference on Artificial Intelligence)

This paper revisits building machine learning algorithms that involve interactions between entities, such as those between financial assets in an actively managed portfolio, or interac- tions between users in a social network. Our goal is to forecast the future evolution of ensembles of multivariate time series in such applications (e.g., the future return of a financial asset or the future popularity of a Twitter account). Designing ML algorithms for such systems requires addressing the challenges of high-dimensional interactions and non-linearity. Existing approaches usually adopt an ad-hoc approach to integrating high-dimensional techniques into non-linear models and re- cent studies have shown these approaches have questionable efficacy in time-evolving interacting systems. To this end, we propose a novel framework, which we dub as the additive influence model. Under our modeling assump- tion, we show that it is possible to decouple the learning of high-dimensional interactions from the learning of non-linear feature interactions. To learn the high-dimensional interac- tions, we leverage kernel-based techniques, with provable guarantees, to embed the entities in a low-dimensional latent space. To learn the non-linear feature-response interactions, we generalize prominent machine learning techniques, includ- ing designing a new statistically sound non-parametric method and an ensemble learning algorithm optimized for vector re- gressions. Extensive experiments on two common applica- tions demonstrate that our new algorithms deliver significantly stronger forecasting power compared to standard and recently proposed methods.
more » « less
Full Text Available
C3-GAN: Complex-Condition-Controlled Urban Traffic Estimation through Generative Adversarial Networks

https://doi.org/10.1109/ICDM51629.2021.00196

Zhang, Yingxue; Li, Yanhua; Zhou, Xun; Liu, Zhenming; Luo, Jun (December 2021, IEEE International Conference on Data Mining)

Full Text Available
Equity2Vec: End-to-end Deep Learning Framework for Cross-sectional Asset Pricing

Wu, Q; Brinton, C; Zhang, Z; Cucuringu, M; Pizzoferrato, A; Liu, Z (November 2021, 2nd ACM International Conference on AI in Finance)

Full Text Available
BATS: A Spectral Biclustering Approach to Single Document Topic Modeling and Segmentation

Wu, Q; Tu, Y; Wang, S; Hare, A; Liu, Z; Brinton, C (October 2021, ACM transactions on intelligent systems and technology)

Full Text Available
Toward Efficient Interactions between Python and Native Libraries

Tan, J; Chen, C; Liu, Z; Ren, R; Song, R; Shen, X; Liu, X (August 2021, The 29th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering (ESEC/FSE))

Full Text Available
Adaptive Reduced Rank Regression

Wu, Q; Kanade, V; Wong, F; Li, Y; Liu, Z (December 2020, The 34nd Annual Conference on Neural Information Processing Systems (NeurIPS))

Full Text Available
Adaptive Reduced Rank Regression.

Qiong Wu, Felix Ming (December 2020, Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, NeurIPS 2020)

We study the low rank regression problem $$\my = M\mx + \epsilon$$, where $$\mx$$ and $$\my$$ are d1 and d2 dimensional vectors respectively. We consider the extreme high-dimensional setting where the number of observations n is less than d1+d2. Existing algorithms are designed for settings where n is typically as large as $$\Rank(M)(d_1+d_2)$$. This work provides an efficient algorithm which only involves two SVD, and establishes statistical guarantees on its performance. The algorithm decouples the problem by first estimating the precision matrix of the features, and then solving the matrix denoising problem. To complement the upper bound, we introduce new techniques for establishing lower bounds on the performance of any algorithm for this problem. Our preliminary experiments confirm that our algorithm often out-performs existing baselines, and is always at least competitive.
more » « less
Full Text Available
BATS: A Spectral Biclustering Approach to Single Document Topic Modeling and Segmentation

Wang, S; Tu, Y; Wu, Q; Hare, A; Liu, L; Brinton, C; Li, Y (August 2020, ArXivorg)

Full Text Available
On Efficient Constructions of Checkpoints

Yu Chen, Zhenming Liu (July 2020, Proceedings of the 37th International Conference on Machine Learning, ICML 2020, 13-18 July 2020)
null (Ed.)
Efficient construction of checkpoints/snapshots is a critical tool for training and diagnosing deep learning models. In this paper, we propose a lossy compression scheme for checkpoint constructions (called LC-Checkpoint). LC-Checkpoint simultaneously maximizes the compression rate and optimizes the recovery speed, under the assumption that SGD is used to train the model. LC-Checkpointuses quantization and priority promotion to store the most crucial information for SGD to recover, and then uses a Huffman coding to leverage the non-uniform distribution of the gradient scales. Our extensive experiments show that LC-Checkpoint achieves a compression rate up to 28× and recovery speedup up to 5.77× over a state-of-the-art algorithm (SCAR).
more » « less
Full Text Available

« Prev Next »

Search for: All records