Search for: All records

Creators/Authors contains: "Liang, P."

« Prev Next »

Total Resources

10

Resource Type
Conference Paper

9

Conference Proceeding

0

Dataset

0

Journal Article

1

Workshop Report

0

Availability
Full Text / Resource Available

10

Citation Only

0

Save Results
Excel (limit 2000)
CSV (limit 5000)
XML (limit 5000)

Have feedback or suggestions for a way to improve these results?
!

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

GreaseLM: Graph REASoning Enhanced Language Models for Question Answering

https://doi.org/10.48550/arXiv.2201.08860

Zhang, X ; Bosselut, A ; Yasunaga, M ; Ren, H ; Liang, P ; Manning, C ; Leskovec, J ( January 2022 , International Conference on Representation Learning (ICLR))

Answering complex questions about textual narratives requires reasoning over both stated context and the world knowledge that underlies it. However, pretrained language models (LM), the foundation of most modern QA systems, do not robustly represent latent relationships between concepts, which is necessary for reasoning. While knowledge graphs (KG) are often used to augment LMs with structured representations of world knowledge, it remains an open question how to effectively fuse and reason over the KG representations and the language context, which provides situational constraints and nuances. In this work, we propose GreaseLM, a new model that fuses encoded representations from pretrained LMs and graph neural networks over multiple layers of modality interaction operations. Information from both modalities propagates to the other, allowing language context representations to be grounded by structured world knowledge, and allowing linguistic nuances (e.g., negation, hedging) in the context to inform the graph representations of knowledge. Our results on three benchmarks in the commonsense reasoning (i.e., CommonsenseQA, OpenbookQA) and medical question answering (i.e., MedQA-USMLE) domains demonstrate that GreaseLM can more reliably answer questions that require reasoning over both situational constraints and structured knowledge, even outperforming models 8x larger.
more » « less
Full Text Available
Multiple Mechanisms in Proton-Induced Nucleon Removal at ∼100 MeV/Nucleon

https://doi.org/10.1103/PhysRevLett.130.172501

Pohl, T. ; Sun, Y. L. ; Obertelli, A. ; Lee, J. ; Gómez-Ramos, M. ; Ogata, K. ; Yoshida, K. ; Cai, B. S. ; Yuan, C. X. ; Brown, B. A. ; et al ( April 2023 , Physical Review Letters)

Full Text Available
Towards Understanding and Mitigating Social Biases in Language Models

Liang, P. P. ; Wu, C. ; Morency, L-P. ; Salakhutdino, R ( January 2021 , Proceedings of the 38th International Conference on Machine Learning)

Full Text Available
MultiBench: Multiscale Benchmarks for Multimodal Representation Learning

Liang, P. ; Lyu, Y. ; Fan, X. ; Wu, Z. ; Cheng, Y ; Wu, J. ; Chen, L.Y. ; Wu, P. ; Lee, M.A. ; Zhu, Y. ; et al ( January 2021 , In Proceedings of the Neural Information Processing Systems Conference (Neurips))

Full Text Available
Strategies For Pre-training Graph Neural Networks

Hu, W. ; Liu, B. ; Gomes, J. ; Zitnik, M. ; Liang, P. ; Pande, V. ; Leskovec, J. ( April 2020 , International Conference on Learning Representations (ICLR))

Many applications of machine learning require a model to make accurate predictions on test examples that are distributionally different from training ones, while task-specific labels are scarce during training. An effective approach to this challenge is to pre-train a model on related tasks where data is abundant, and then fine-tune it on a downstream task of interest. While pre-training has been effective in many language and vision domains, it remains an open question how to effectively use pre-training on graph datasets. In this paper, we develop a new strategy and self-supervised methods for pre-training Graph Neural Networks (GNNs). The key to the success of our strategy is to pre-train an expressive GNN at the level of individual nodes as well as entire graphs so that the GNN can learn useful local and global representations simultaneously. We systematically study pre-training on multiple graph classification datasets. We find that naïve strategies, which pre-train GNNs at the level of either entire graphs or individual nodes, give limited improvement and can even lead to negative transfer on many downstream tasks. In contrast, our strategy avoids negative transfer and improves generalization significantly across downstream tasks, leading up to 9.4% absolute improvements in ROC-AUC over non-pre-trained models and achieving state-of-the-art performance for molecular property prediction and protein function prediction.
more » « less
Full Text Available
MOSEAS: A Multimodal Language Dataset for Spanish, Portuguese, German and French.

Zadeh, A. ; Cao, Y. ; Liang, P. ; Poria, S. ; Morency, L.-P. ( January 2020 , Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP))
null (Ed.)
Full Text Available
Selection via Proxy: Efficient Data Selection for Deep Learning

Coleman, C. ; Yeh, C. ; Mussmann, S. ; Mirzasoleiman, B. ; Bailis, P. ; Liang, P. ; Leskovec, J. ; Zaharia, M. ( April 2020 , International Conference on Learning Representations (ICLR))

Data selection methods, such as active learning and core-set selection, are useful tools for machine learning on large datasets. However, they can be prohibitively expensive to apply in deep learning because they depend on feature representations that need to be learned. In this work, we show that we can greatly improve the computational efficiency by using a small proxy model to perform data selection (e.g., selecting data points to label for active learning). By removing hidden layers from the target model, using smaller architectures, and training for fewer epochs, we create proxies that are an order of magnitude faster to train. Although these small proxy models have higher error rates, we find that they empirically provide useful signals for data selection. We evaluate this “selection via proxy” (SVP) approach on several data selection tasks across five datasets: CIFAR10, CIFAR100, ImageNet, Amazon Review Polarity, and Amazon Review Full. For active learning, applying SVP can give an order of magnitude improvement in data selection runtime (i.e., the time it takes to repeatedly train and select points) without significantly increasing the final error (often within 0.1%). For core-set selection on CIFAR10, proxies that are over 10 faster to train than their larger, more accurate targets can remove up to 50% of the data without harming the final accuracy of the target, leading to a 1:6 end-to-end training time improvement.
more » « less
Full Text Available
MOSEAS: A Multimodal Language Dataset for Spanish, Portuguese, German and French.

Zadeh, A. ; Cao, Y. ; Hessner, S. ; Liang, P. ; Poria, S. ; Morency, L.-P. ( January 2020 , Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2020)
null (Ed.)
Full Text Available
Towards Debiasing Sentence Representations

Liang, P. P. ; Li, I. ; Zheng, E. ; Lim, Y. ; Salakhutdinov, R. ; Morency, L.-P. ( January 2020 , Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL))
null (Ed.)
Full Text Available
Deep Gamblers: Learning to Abstain with Portfolio Theory.

Ziyin, L ; Wang, Z ; Liang, P ; Salakhutdinov, R ; Morency, L ; Ueda, M ( January 2019 , Proceedings of the Neural Information Processing Systems Conference)

Full Text Available