Search for: All records

Award ID contains: 1749917

« Prev Next »

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Automated neural patent landscaping in the small data regime using citations and CPC codes

https://doi.org/10.1007/s10506-025-09483-5

Islam_Erana, Tisa; Finlayson, Mark A (October 2025, Artificial Intelligence and Law)

Patent landscaping is the process of identifying all patents related to a particular technological area, and is important for assessing various aspects of the intellectual property context. Traditionally, constructing patent landscapes is intensely laborious and expensive, and the rapid expansion of patenting activity in recent decades has driven an increasing need for efficient and effective automated patent landscaping approaches. In particular, it is critical that we be able to construct patent landscapes using a minimal number of labeled examples, as labeling patents for a narrow technology area requires highly specialized (and hence expensive) technical knowledge. We present an automated neural patent landscaping system that demonstrates significantly improved performance on difficult examples (0.69 on ‘hard’ examples, versus 0.6 for previously reported systems), and also significant improvements with much less training data (overall 0.75 on as few as 24 examples). Furthermore, in evaluating such automated landscaping systems, acquiring good data is challenge; we demonstrate a higher-quality training data generation procedure by merging (Abood and Feltenberger Artif Intell Law 26:103–125 2018) “seed/anti-seed” approach with active learning to collect difficult labeled examples near the decision boundary. Using this procedure we created a new dataset of labeled AI patents for training and testing. As in prior work we compare our approach with a number of baseline systems, and we release our code and data for others to build upon “(Code and data may be downloaded from https://doi.org/10.34703/gzx1-9v95/QDLKVWCode and data are released under the Creative Commons NC-BY 4.0 license at https://creativecommons.org/licenses/by-nc/4.0/)”.
more » « less
Free, publicly-accessible full text available October 4, 2026
The artificial intelligence patent dataset (AIPD) 2023 update

https://doi.org/10.1007/s10961-025-10189-8

Pairolero, Nicholas A; Giczy, Alexander V; Torres, Gerard; Islam_Erana, Tisa; Finlayson, Mark A; Toole, Andrew A (February 2025, The Journal of Technology Transfer)

The 2023 update to the Artificial Intelligence Patent Dataset (AIPD) extends the original AIPD to all United States Patent and Trademark Office (USPTO) patent documents (i.e., patents and pre-grant publications, or PGPubs) published through 2023, while incorporating an improved patent landscaping methodology to identify AI within patents and PGPubs. This new approach substitutes BERT for Patents for the Word2Vec embeddings used previously, and uses active learning to incorporate additional training data closer to the “decision boundary” between AI and not-AI to help improve predictions. We show that this new approach achieves substantially better performance than the original methodology on a set of patent documents where the two methods disagreed—on this set, the AIPD 2023 achieved precision of 68.18 percent and recall of 78.95 percent, while the original AIPD achieved 50 percent and 21.05 percent, respectively. To help researchers, practitioners, and policy-makers better understand the determinants and impacts of AI invention, we have made the AIPD 2023 publicly available on the USPTO’s economic research web page.
more » « less
Free, publicly-accessible full text available February 22, 2026
pyTLEX: A Python Library for TimeLine EXtraction

Akul Singh, Jared Hummer (March 2024, Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics: System Demonstrations)

pyTLEX is an implementation of the TimeLine EXtraction algorithm (TLEX; Finlayson et al.,2021) that enables users to work with TimeML annotations and perform advanced temporal analysis, offering a comprehensive suite of features. TimeML is a standardized markup language for temporal information in text. pyTLEX allows users to parse TimeML annotations, construct TimeML graphs, and execute the TLEX algorithm to effect complete timeline extraction. In contrast to previous implementations (i.e., jTLEX for Java), pyTLEX sets itself apart with a range of advanced features. It introduces a React-based visualization system, enhancing the exploration of temporal data and the comprehension of temporal connections within textual information. Furthermore, pyTLEX incorporates an algorithm for increasing connectivity in temporal graphs, which identifies graph disconnectivity and recommends links based on temporal reasoning, thus enhancing the coherence of the graph representation. Additionally, pyTLEX includes a built-in validation algorithm, ensuring compliance with TimeML annotation guidelines, which is essential for maintaining data quality and reliability. pyTLEX equips researchers and developers with an extensive toolkit for temporal analysis, and its testing across various datasets validates its accuracy and reliability.
more » « less
jTLEX: a Java Library for TimeLine EXtraction

https://doi.org/10.18653/v1/2023.eacl-demo.4

Ocal, Mustafa; Singh, Akul; Hummer, Jared; Radas, Antonela; Finlayson, Mark (January 2023, Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics: System Demonstrations)

jTLEX is a programming library that provides a Java implementation of the TimeLine EXtraction algorithm (TLEX; Finlayson et al.,2021), along with utilities for programmatic manipulation of TimeML graphs. Timelines are useful for a number of natural language understanding tasks, such as question answering, cross-document event coreference, and summarization & visualization. jTLEX provides functionality for (1) parsing TimeML annotations into Java objects, (2) construction of TimeML graphs from scratch, (3) partitioning of TimeML graphs into temporally connected subgraphs, (4) transforming temporally connected subgraphs into point algebra (PA) graphs, (5) extracting exact timeline of TimeML graphs, (6) detecting inconsistent subgraphs, and (7) calculating indeterminate sections of the timeline. The library has been tested on the entire TimeBank corpus, and comes with a suite of unit tests. We release the software as open source with a free license for non-commercial use.
more » « less
Inducing Stereotypical Character Roles from Plot Structure

https://doi.org/10.18653/v1/2021.emnlp-main.39

Jahan, Labiba; Mittal, Rahul; Finlayson, Mark (November 2021, Proceedings of the 25th Conference on Empirical Methods in Natural Language Process (EMNLP 2021))

Stereotypical character roles-also known as archetypes or dramatis personae-play an important function in narratives: they facilitate efficient communication with bundles of default characteristics and associations and ease understanding of those characters’ roles in the overall narrative. We present a fully unsupervised k-means clustering approach for learning stereotypical roles given only structural plot information. We demonstrate the technique on Vladimir Propp’s structural theory of Russian folktales (captured in the extended ProppLearner corpus, with 46 tales), showing that our approach can induce six out of seven of Propp’s dramatis personae with F1 measures of up to 0.70 (0.58 average), with an additional category for minor characters. We have explored various feature sets and variations of a cluster evaluation method. The best-performing feature set comprises plot functions, unigrams, tf-idf weights, and embeddings over coreference chain heads. Roles that are mentioned more often (Hero, Villain), or have clearly distinct plot patterns (Princess) are more strongly differentiated than less frequent or distinct roles (Dispatcher, Helper, Donor). Detailed error analysis suggests that the quality of the coreference chain and plot functions annotations are critical for this task. We provide all our data and code for reproducibility.
more » « less
Full Text Available
Confirming the Generalizability of a Chain-Based Animacy Detector

Jahan, Labiba; Yarlott, W. Victor; Rahul, Mittal; Finlayson, Mark A. (January 2021, 1st Workshop on Artificial Intelligence for Narratives (AI4N 2020))
null (Ed.)
Animacy is the characteristic of a referent beingable to independently carry out actions in a storyworld (e.g., movement, communication). It is anecessary property of characters in stories, and sodetecting animacy is an important step in automaticstory understanding; it is also potentially useful formany other natural language processing tasks suchas word sense disambiguation, coreference resolu-tion, character identification, and semantic role la-beling. Recent work by Jahanet al.[2018]demon-strated a new approach to detecting animacy whereanimacy is considered a direct property of corefer-ence chains (and referring expressions) rather thanwords. In Jahanet al., they combined hand-builtrules and machine learning (ML) to identify the an-imacy of referring expressions and used majorityvoting to assign the animacy of coreference chains,and reported high performance of up to 0.90F1. Inthis short report we verify that the approach gener-alizes to two different corpora (OntoNotes and theCorpus of English Novels) and we confirmed thatthe hybrid model performs best, with the rule-basedmodel in second place. Our tests apply the animacyclassifier to almost twice as much data as Jahanetal.’s initial study. Our results also strongly suggest,as would be expected, the dependence of the mod-els on coreference chain quality. We release ourdata and code to enable reproducibility.
more » « less
Full Text Available
Story Fragment Stitching: The Case of the Story of Moses

Aldawsari, Mohammed; Asgari, Ehsaneddin; Finlayson, Mark A. (January 2021, 1st Workshop on Artificial Intelligence for Narratives (AI4N 2020))
null (Ed.)
We introduce the task ofstory fragment stitching,which is the process of automatically aligning andmerging event sequences of partial tellings of astory (i.e.,story fragments). We assume that eachfragment contains at least one event from the storyof interest, and that every fragment shares at leastone event with another fragment. We propose agraph-based unsupervised approach to solving thisproblem in which events mentions are representedas nodes in the graph, and the graph is compressedusing a variant of model merging to combine nodes.The goal is for each node in the final graph to con-tain only coreferent event mentions. To find coref-erent events, we use BERT contextualized embed-ding in conjunction with atf-idfvector representa-tion. Constraints on the merge compression pre-serve the overall timeline of the story, and the finalgraph represents the full story timeline. We evalu-ate our approach using a new annotated corpus ofthe partial tellings of the story of Moses found inthe Quran, which we release for public use. Ourapproach achieves a performance of 0.63F1score
more » « less
Full Text Available
Distinguishing Between Foreground and Background Events in News

https://doi.org/10.18653/v1/2020.coling-main.453

Aldawsari, Mohammed; Perez, Adrian; Banisakher, Deya; Finlayson, Mark (January 2020, 28th International Conference on Computational Linguistics (COLING 2020))
null (Ed.)
Determining whether an event in a news article is a foreground or background event would be useful in many natural language processing tasks, for example, temporal relation extraction, summarization, or storyline generation. We introduce the task of distinguishing between foreground and background events in news articles as well as identifying the general temporal position of background events relative to the foreground period (past, present, future, and their combinations). We achieve good performance (0.73 F1 for background vs. foreground and temporal position, and 0.79 F1 for background vs. foreground only) on a dataset of news articles by leveraging discourse information in a featurized model. We release our implementation and annotated data for other researchers
more » « less
Full Text Available
A Straightforward Approach to Narratologically Grounded Character Identification

https://doi.org/10.18653/v1/2020.coling-main.536

Jahan, Labiba; Mittal, Rahul; Yarlott, W. Victor; Finlayson, Mark (January 2020, 28th International Conference on Computational Linguistics (COLING 2020))
null (Ed.)
One of the most fundamental elements of narrative is character: if we are to understand a narrative, we must be able to identify the characters of that narrative. Therefore, character identification is a critical task in narrative natural language understanding. Most prior work has lacked a narratologically grounded definition of character, instead relying on simplified or implicit definitions that do not capture essential distinctions between characters and other referents in narratives. In prior work we proposed a preliminary definition of character that was based in clear narratological principles: a character is an animate entity that is important to the plot. Here we flesh out this concept, demonstrate that it can be reliably annotated (0.78 Cohen’s κ), and provide annotations of 170 narrative texts, drawn from 3 different corpora, containing 1,347 character co-reference chains and 21,999 non-character chains that include 3,937 animate chains. Furthermore, we have shown that a supervised classifier using a simple set of easily computable features can effectively identify these characters (overall F1 of 0.90). A detailed error analysis shows that character identification is first and foremost affected by co-reference quality, and further, that the shorter a chain is the harder it is to effectively identify as a character. We release our code and data for the benefit of other researchers
more » « less
Full Text Available
Improving the Identification of the Discourse Function of News Article Paragraphs

https://doi.org/10.18653/v1/2020.nuse-1.3

Banisakher, Deya; Yarlott, W. Victor; Aldawsari, Mohammed; Rishe, Naphtali; Finlayson, Mark (January 2020, 1st Joint Workshop on Narrative Understanding, Storylines, and Events (NUSE 2020))
null (Ed.)
Identifying the discourse structure of documents is an important task in understanding written text. Building on prior work, we demonstrate an improved approach to automatically identifying the discourse function of paragraphs in news articles. We start with the hierarchical theory of news discourse developed by van Dijk (1988) which proposes how paragraphs function within news articles. This discourse information is a level intermediate between phrase- or sentence-sized discourse segments and document genre, characterizing how individual paragraphs convey information about the events in the storyline of the article. Specifically, the theory categorizes the relationships between narrated events and (1) the overall storyline (such as Main Events, Background, or Consequences) as well as (2) commentary (such as Verbal Reactions and Evaluations). We trained and tested a linear chain conditional random field (CRF) with new features to model van Dijk’s labels and compared it against several machine learning models presented in previous work. Our model significantly outperformed all baselines and prior approaches, achieving an average of 0.71 F1 score which represents a 31.5% improvement over the previously best-performing support vector machine model.
more » « less
Full Text Available

« Prev Next »