NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Equity2Vec: End-to-end Deep Learning Framework for Cross-sectional Asset Pricing

Wu, Q; Brinton, C; Zhang, Z; Cucuringu, M; Pizzoferrato, A; Liu, Z (November 2021, 2nd ACM International Conference on AI in Finance)

Full Text Available
BATS: A Spectral Biclustering Approach to Single Document Topic Modeling and Segmentation

Wu, Q; Tu, Y; Wang, S; Hare, A; Liu, Z; Brinton, C (October 2021, ACM transactions on intelligent systems and technology)

Full Text Available
Toward Efficient Interactions between Python and Native Libraries

Tan, J; Chen, C; Liu, Z; Ren, R; Song, R; Shen, X; Liu, X (August 2021, The 29th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering (ESEC/FSE))

Full Text Available
Adaptive Reduced Rank Regression.

Qiong Wu, Felix Ming (December 2020, Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, NeurIPS 2020)

We study the low rank regression problem $$\my = M\mx + \epsilon$$, where $$\mx$$ and $$\my$$ are d1 and d2 dimensional vectors respectively. We consider the extreme high-dimensional setting where the number of observations n is less than d1+d2. Existing algorithms are designed for settings where n is typically as large as $$\Rank(M)(d_1+d_2)$$. This work provides an efficient algorithm which only involves two SVD, and establishes statistical guarantees on its performance. The algorithm decouples the problem by first estimating the precision matrix of the features, and then solving the matrix denoising problem. To complement the upper bound, we introduce new techniques for establishing lower bounds on the performance of any algorithm for this problem. Our preliminary experiments confirm that our algorithm often out-performs existing baselines, and is always at least competitive.
more » « less
Full Text Available
BATS: A Spectral Biclustering Approach to Single Document Topic Modeling and Segmentation

Wang, S; Tu, Y; Wu, Q; Hare, A; Liu, L; Brinton, C; Li, Y (August 2020, ArXivorg)

Full Text Available
On Efficient Constructions of Checkpoints

Chen, Y; Liu, Z; Ren, R; Jin, X (January 2020, International Conference on Machine Learning 2020)

Full Text Available
Is Reinforcement Learning the Choice of Human Learners? A Case Study of Taxi Drivers

Pan, M; Huang, W; Li, Y; Zhou, X; Liu, Z; Bao, J; Zheng, Y; Luo, J (January 2020, ACM SIGSPATIAL 2020)

Full Text Available
A Deep Learning Framework for Pricing Financial Instruments

Wu, Q; Zhang, Z; Pizzoferroto, A; Cucuringu, M; Liu, Z (September 2019, ArXivorg)

Full Text Available
Dissecting the Learning Curve of Taxi Drivers: A Data-Driven Approach

https://doi.org/https://doi.org/10.1137/1.9781611975673.88

Pan, Menghai; Li, Yanhua; Zhou, Xun; Liu, Zhenming; Song, Rui; Lu, Hui; Luo, Jun (May 2019, Proceedings of the ... SIAM International Conference on Data Mining)

Full Text Available
Near-Neighbor Methods in Random Preference Completion

https://doi.org/https://doi.org/10.1609/aaai.v33i01.33014336

Liu, Ao; Wu, Qiong; Liu, Zhenming; Xia, Lirong (January 2019, Proceedings of the ... AAAI Conference on Artificial Intelligence)

This paper studies a stylized, yet natural, learning-to-rank problem and points out the critical incorrectness of a widely used nearest neighbor algorithm. We consider a model with n agents (users) {xi}i∈[n] and m alternatives (items) {yl}l∈[m], each of which is associated with a latent feature vector. Agents rank items nondeterministically according to the Plackett-Luce model, where the higher the utility of an item to the agent, the more likely this item will be ranked high by the agent. Our goal is to identify near neighbors of an arbitrary agent in the latent space for prediction. We first show that the Kendall-tau distance based kNN produces incorrect results in our model. Next, we propose a new anchor-based algorithm to find neighbors of an agent. A salient feature of our algorithm is that it leverages the rankings of many other agents (the so-called “anchors”) to determine the closeness/similarities of two agents. We provide a rigorous analysis for one-dimensional latent space, and complement the theoretical results with experiments on synthetic and real datasets. The experiments confirm that the new algorithm is robust and practical.
more » « less
Full Text Available

Search for: All records