NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Ontology Enrichment for Effective Fine-grained Entity Typing

https://doi.org/10.1145/3637528.3671857

Ouyang, Siru; Huang, Jiaxin; Pillai, Pranav; Zhang, Yunyi; Zhang, Yu; Han, Jiawei (August 2024, ACM)
Baeza-Yates, Ricardo; Bonchi, Francesco (Ed.)
Fine-grained entity typing (FET) is the task of identifying specific entity types at a fine-grained level for entity mentions based on their contextual information. Conventional methods for FET require extensive human annotation, which is time-consuming and costly given the massive scale of data. Recent studies have been developing weakly supervised or zero-shot approaches.We study the setting of zero-shot FET where only an ontology is provided. However, most existing ontology structures lack rich supporting information and even contain ambiguous relations, making them ineffective in guiding FET. Recently developed language models, though promising in various few-shot and zero-shot NLP tasks, may face challenges in zero-shot FET due to their lack of interaction with task-specific ontology. In this study, we propose OnEFET, where we (1) enrich each node in the ontology structure with two categories of extra information: instance information for training sample augmentation and topic information to relate types with contexts, and (2) develop a coarse-to-fine typing algorithm that exploits the enriched information by training an entailment model with contrasting topics and instance-based augmented training samples. Our experiments show that OnEFET achieves high-quality fine-grained entity typing without human annotation, outperforming existing zero-shot methods by a large margin and rivaling supervised methods. OnEFET also enjoys strong transferability to unseen and finer-grained types. Code is available at https://github.com/ozyyshr/OnEFET.
more » « less
Full Text Available
Pretrained Language Representations for Text Understanding: A Weakly-Supervised Perspective

https://doi.org/10.1145/3580305.3599569

Meng, Yu; Huang, Jiaxin; Zhang, Yu; Zhang, Yunyi; Han, Jiawei (August 2023, ACM)

Full Text Available
Tuning language models as training data generators for augmentation-enhanced few-shot learning

Meng, Yu; Michalski, Martin; Huang, Jiaxin; Zhang, Yu; Abdelzaher, Tarek; Han, Jiawei (July 2023, ACM)

Full Text Available
FineSum: Target-Oriented, Fine-Grained Opinion Summarization

https://doi.org/10.1145/3539597.3570397

Ge, Suyu; Huang, Jiaxin; Meng, Yu; Han, Jiawei (February 2023, ACM)
Proc. 2023 ACM Int. Conf. on Web Search and Data Mining (Ed.)
Target-oriented opinion summarization is to profile a target by extracting user opinions from multiple related documents. Instead of simply mining opinion ratings on a target (e.g., a restaurant) or on multiple aspects (e.g., food, service) of a target, it is desirable to go deeper, to mine opinion on fine-grained sub-aspects (e.g., fish). However, it is expensive to obtain high-quality annotations at such fine-grained scale. This leads to our proposal of a new framework, FineSum, which advances the frontier of opinion analysis in three aspects: (1) minimal supervision, where no document-summary pairs are provided, only aspect names and a few aspect/sentiment keywords are available; (2) fine-grained opinion analysis, where sentiment analysis drills down to a specific subject or characteristic within each general aspect; and (3) phrase-based summarization, where short phrases are taken as basic units for summarization, and semantically coherent phrases are gathered to improve the consistency and comprehensiveness of summary. Given a large corpus with no annotation, FineSum first automatically identifies potential spans of opinion phrases, and further reduces the noise in identification results using aspect and sentiment classifiers. It then constructs multiple fine-grained opinion clusters under each aspect and sentiment. Each cluster expresses uniform opinions towards certain sub-aspects (e.g., “fish” in “food” aspect) or characteristics (e.g., “Mexican” in “food” aspect). To accomplish this, we train a spherical word embedding space to explicitly represent different aspects and sentiments. We then distill the knowledge from embedding to a contextualized phrase classifier, and perform clustering using the contextualized opinion-aware phrase embedding. Both automatic evaluations on the benchmark and quantitative human evaluation validate the effectiveness of our approach.
more » « less
Full Text Available
Generating training data with language models: Towards zero-shot language understanding

Meng, Yu; Huang, Jiaxin; Zhang, Yu; Han, Jiawei (October 2022, NeurIPS/OpenReview)

Full Text Available
Few-Shot Fine-Grained Entity Typing with Automatic Label Interpretation and Instance Generation

https://doi.org/10.1145/3534678.3539443

Huang, Jiaxin; Meng, Yu; Han, Jiawei (August 2022, KDD'22:The 28th {ACM} {SIGKDD} Conference on Knowledge Discovery and Data Mining, August 14-18, 2021)

We study the problem of few-shot Fine-grained Entity Typing (FET), where only a few annotated entity mentions with contexts are given for each entity type. Recently, prompt-based tuning has demonstrated superior performance to standard fine-tuning in few-shot scenarios by formulating the entity type classification task as a “fill-in-the-blank” problem. This allows effective utilization of the strong language modeling capability of Pre-trained Language Models (PLMs). Despite the success of current prompt-based tuning approaches, two major challenges remain: (1) the verbalizer in prompts is either manually designed or constructed from external knowledge bases, without considering the target corpus and label hierarchy information, and (2) current approaches mainly utilize the representation power of PLMs, but have not explored their generation power acquired through extensive general-domain pre-training. In this work, we propose a novel framework for fewshot FET consisting of two modules: (1) an entity type label interpretation module automatically learns to relate type labels to the vocabulary by jointly leveraging few-shot instances and the label hierarchy, and (2) a type-based contextualized instance generator produces new instances based on given instances to enlarge the training set for better generalization. On three benchmark datasets, our model outperforms existing methods by significant margins.
more » « less
Full Text Available
Adapting Pretrained Representations for Text Mining

https://doi.org/10.1145/3534678.3542607

Meng, Yu; Huang, Jiaxin; Zhang, Yu; Han, Jiawei (August 2022, ACM)

Full Text Available
Topic Discovery via Latent Space Clustering of Pretrained Language Model Representations

https://doi.org/10.1145/3485447.3512034

Meng, Yu; Zhang, Yunyi; Huang, Jiaxin; Zhang, Yu; Han, Jiawei (April 2022, ACM)

Full Text Available
On the Power of Pre-Trained Text Representations: Models and Applications in Text Mining

https://doi.org/10.1145/3447548.3470810

Meng, Yu; Huang, Jiaxin; Zhang, Yu; Han, Jiawei (August 2021, KDD'21:The 27th {ACM} {SIGKDD} Conference on Knowledge Discovery and Data Mining, August 14-18, 2021)
null (Ed.)
Recent years have witnessed the enormous success of text representation learning in a wide range of text mining tasks. Earlier word embedding learning approaches represent words as fixed low-dimensional vectors to capture their semantics. The word embeddings so learned are used as the input features of task-specific models. Recently, pre-trained language models (PLMs), which learn universal language representations via pre-training Transformer-based neural models on large-scale text corpora, have revolutionized the natural language processing (NLP) field. Such pre-trained representations encode generic linguistic features that can be transferred to almost any text-related applications. PLMs outperform previous task-specific models in many applications as they only need to be fine-tuned on the target corpus instead of being trained from scratch. In this tutorial, we introduce recent advances in pre-trained text embeddings and language models, as well as their applications to a wide range of text mining tasks. Specifically, we first overview a set of recently developed self-supervised and weakly-supervised text embedding methods and pre-trained language models that serve as the fundamentals for downstream tasks. We then present several new methods based on pre-trained text embeddings and language models for various text mining applications such as topic discovery and text classification. We focus on methods that are weakly-supervised, domain-independent, language-agnostic, effective and scalable for mining and discovering structured knowledge from large-scale text corpora. Finally, we demonstrate with real world datasets how pre-trained text representations help mitigate the human annotation burden and facilitate automatic, accurate and efficient text analyses.
more » « less
Full Text Available
Embedding-Driven Multi-Dimensional Topic Mining and Text Analysis

https://doi.org/10.1145/3394486.3406483

Meng, Yu; Huang, Jiaxin; Han, Jiawei (August 2020, KDD:20 The 26th {ACM} {SIGKDD} Conference on Knowledge Discovery and Data Mining)
null (Ed.)
Full Text Available

« Prev Next »

Search for: All records