NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Topology-aware Embedding Memory for Continual Learning on Expanding Networks

https://doi.org/10.1145/3637528.3671732

Zhang, Xikun; Song, Dongjin; Chen, Yixin; Tao, Dacheng (August 2024, ACM)

Full Text Available
Empowering Time Series Analysis with Large Language Models: A Survey

https://doi.org/10.24963/ijcai.2024/895

Jiang, Yushan; Pan, Zijie; Zhang, Xikun; Garg, Sahil; Schneider, Anderson; Nevmyvaka, Yuriy; Song, Dongjin (August 2024, International Joint Conferences on Artificial Intelligence Organization)

Recently, remarkable progress has been made over large language models (LLMs), demonstrating their unprecedented capability in varieties of natural language tasks. However, completely training a large general-purpose model from the scratch is challenging for time series analysis, due to the large volumes and varieties of time series data, as well as the non-stationarity that leads to concept drift impeding continuous model adaptation and re-training. Recent advances have shown that pre-trained LLMs can be exploited to capture complex dependencies in time series data and facilitate various applications. In this survey, we provide a systematic overview of existing methods that leverage LLMs for time series analysis. Specifically, we first state the challenges and motivations of applying language models in the context of time series as well as brief preliminaries of LLMs. Next, we summarize the general pipeline for LLM-based time series analysis, categorize existing methods into different groups (\textit{i.e.}, direct query, tokenization, prompt design, fine-tune, and model integration), and highlight the key ideas within each group. We also discuss the applications of LLMs for both general and spatial-temporal time series data, tailored to specific domains. Finally, we thoroughly discuss future research opportunities to empower time series analysis with LLMs.
more » « less
Full Text Available
Ricci Curvature-Based Graph Sparsification for Continual Graph Representation Learning

https://doi.org/10.1109/TNNLS.2023.3303454

Zhang, Xikun; Song, Dongjin; Tao, Dacheng (January 2024, IEEE Transactions on Neural Networks and Learning Systems)

Full Text Available
Hierarchical Prototype Networks for Continual Graph Representation Learning

https://doi.org/10.1109/TPAMI.2022.3186909

Zhang, Xikun; Song, Dongjin; Tao, Dacheng (April 2023, IEEE Transactions on Pattern Analysis and Machine Intelligence)

Full Text Available
Sparsified Subgraph Memory for Continual Graph Representation Learning

https://doi.org/10.1109/ICDM54844.2022.00177

Zhang, Xikun; Song, Dongjin; Tao, Dacheng (November 2022, IEEE International Conference on Data Mining (ICDM) 2022)

Full Text Available
How to build the virtual cell with artificial intelligence: Priorities and opportunities

https://doi.org/10.1016/j.cell.2024.11.015

Bunne, Charlotte; Roohani, Yusuf; Rosen, Yanay; Gupta, Ankit; Zhang, Xikun; Roed, Marcel; Alexandrov, Theo; AlQuraishi, Mohammed; Brennan, Patricia; Burkhardt, Daniel B; et al (December 2024, Cell)

Full Text Available
Deep Bidirectional Language-Knowledge Graph Pretraining

Yasunaga, Michihiro; Bosselut, Antoine; Ren, Hongyu; Zhang, Xikun; Manning, Christopher D.; Liang, Percy; Leskovec, Jure (January 2022, Advances in neural information processing systems)

Pretraining a language model (LM) on text has been shown to help various downstream NLP tasks. Recent works show that a knowledge graph (KG) can complement text data, offering structured background knowledge that provides a useful scaffold for reasoning. However, these works are not pretrained to learn a deep fusion of the two modalities at scale, limiting the potential to acquire fully joint representations of text and KG. Here we propose DRAGON (Deep Bidirectional Language-Knowledge Graph Pretraining), a self-supervised approach to pretraining a deeply joint language-knowledge foundation model from text and KG at scale. Specifically, our model takes pairs of text segments and relevant KG subgraphs as input and bidirectionally fuses information from both modalities. We pretrain this model by unifying two self-supervised reasoning tasks, masked language modeling and KG link prediction. DRAGON outperforms existing LM and LM+KG models on diverse downstream tasks including question answering across general and biomedical domains, with +5% absolute gain on average. In particular, DRAGON achieves notable performance on complex reasoning about language and knowledge (+10% on questions involving long contexts or multi-step reasoning) and low-resource QA (+8% on OBQA and RiddleSense), and new state-of-the-art results on various BioNLP tasks. Our code and trained models are available at https://github.com/michiyasunaga/dragon.
more » « less
Full Text Available
TruePIE: Discovering Reliable Patterns in Pattern-Based Information Extraction

https://doi.org/10.1145/3219819.3220017

Li, Qi; Jiang, Meng; Zhang, Xikun; Qu, Meng; Hanratty, Timothy P.; Gao, Jing; Han, Jiawei (August 2018, Proceedings of the 24th {ACM} {SIGKDD} International Conference on Knowledge Discovery {\&} Data Mining, {KDD} 2018)

Pattern-based methods have been successful in information extraction and NLP research. Previous approaches learn the quality of a textual pattern as relatedness to a certain task based on statistics of its individual content (e.g., length, frequency) and hundreds of carefully-annotated labels. However, patterns of good contentquality may generate heavily conflicting information due to the big gap between relatedness and correctness. Evaluating the correctness of information is critical in (entity, attribute, value)-tuple extraction. In thiswork,we propose a novel method, called TruePIE, that finds reliable patterns which can extract not only related but also correct information. TruePIE adopts the self-training framework and repeats the training-predicting-extracting process to gradually discover more and more reliable patterns. To better represent the textual patterns, pattern embeddings are formulated so that patterns with similar semantic meanings are embedded closely to each other. The embeddings jointly consider the local pattern information and the distributional information of the extractions. To conquer the challenge of lacking supervision on patterns’ reliability, TruePIE can automatically generate high quality training patterns based on a couple of seed patterns by applying the arity-constraints to distinguish highly reliable patterns (i.e., positive patterns) and highly unreliable patterns (i.e., negative patterns). Experiments on a huge news dataset (over 25GB) demonstrate that the proposed TruePIE significantly outperforms baseline methods on each of the three tasks: reliable tuple extraction, reliable pattern extraction, and negative pattern extraction.
more » « less
Full Text Available

Search for: All records