NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

TyGr: Type Inference on Stripped Binaries using Graph Neural Networks

Zhu, Chang; Li, Ziyang; Xue, Anton; Bajaj, Ati Priya; Gibbs, Wil; Liu, Yibo; Alur, Rajeev; Bao, Tiffany; Dai, Hanjun; Doupé, Adam; et al (August 2024, USENIX)

Full Text Available
TYGR: type inference on stripped binaries using graph neural networks

Zhu, Chang; Li, Ziyang; Xue, Anton; Bajaj, Ati Priya; Gibbs, Wil; Liu, Yibo; Alur, Rajeev; Bao, Tiffany; Dai, Hanjun; Doupé, Adam; et al (August 2024, Proceedings of the 33rd USENIX Conference on Security Symposium)

Full Text Available
SMORE: Knowledge Graph Completion and Multi-hop Reasoning in Massive Knowledge Graphs

https://doi.org/10.1145/3534678.3539405

Ren, Hongyu; Dai, Hanjun; Dai, Bo; Chen, Xinyun; Zhou, Denny; Leskovec, Jure; Schuurmans, Dale (August 2022, Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining)

Knowledge graphs (KGs) capture knowledge in the form of head– relation–tail triples and are a crucial component in many AI systems. There are two important reasoning tasks on KGs: (1) single-hop knowledge graph completion, which involves predicting individual links in the KG; and (2), multi-hop reasoning, where the goal is to predict which KG entities satisfy a given logical query. Embedding-based methods solve both tasks by first computing an embedding for each entity and relation, then using them to form predictions. However, existing scalable KG embedding frameworks only support single-hop knowledge graph completion and cannot be applied to the more challenging multi-hop reasoning task. Here we present Scalable Multi-hOp REasoning (SMORE), the first general framework for both single-hop and multi-hop reasoning in KGs. Using a single machine SMORE can perform multi-hop reasoning in Freebase KG (86M entities, 338M edges), which is 1,500× larger than previously considered KGs. The key to SMORE’s runtime performance is a novel bidirectional rejection sampling that achieves a square root reduction of the complexity of online training data generation. Furthermore, SMORE exploits asynchronous scheduling, overlapping CPU-based data sampling, GPU-based embedding computation, and frequent CPU–GPU IO. SMORE increases throughput (i.e., training speed) over prior multi-hop KG frameworks by 2.2× with minimal GPU memory requirements (2GB for training 400-dim embeddings on 86M-node Freebase) and achieves near linear speed-up with the number of GPUs. Moreover, on the simpler single-hop knowledge graph completion task SMORE achieves comparable or even better runtime performance to state-of-the-art frameworks on both single GPU and multi-GPU settings.
more » « less
Full Text Available
CodeTrek: Flexible Modeling of Code using an Extensible Relational Representation

Pashakhanloo, Pardis; Naik, Aaditya; Wang, Yuepeng; Dai, Hanjun; Maniatis, Petros; Naik, Mayur (January 2022, International Conference on Learning Representations)

Full Text Available
Molecule optimization by explainable evolution

Chen, Binghong; Wang, Tianzhe; Li, Chengtao; Dai, Hanjun; Song, Le (January 2021, International Conference on Learning Representation (ICLR))

Optimizing molecules for desired properties is a fundamental yet challenging task in chemistry, material science, and drug discovery. This paper develops a novel algorithm for optimizing molecular properties via an Expectation- Maximization (EM) like explainable evolutionary process. The algorithm is designed to mimic human experts in the process of searching for desirable molecules and alternate between two stages: the first stage on explainable local search which identifies rationales, i.e., critical subgraph patterns accounting for desired molecular properties, and the second stage on molecule completion which explores the larger space of molecules containing good rationales. We test our approach against various baselines on a real-world multi-property optimization task where each method is given the same number of queries to the property oracle. We show that our evolution-by-explanation algorithm is 79% better than the best baseline in terms of a generic metric combining aspects such as success rate, novelty, and diversity. Human expert evaluation on optimized molecules shows that 60% of top molecules obtained from our methods are deemed successful.
more » « less
Full Text Available
Differentiable Top-k Operator with Optimal Transport

Xie, Yujia; Dai, Hanjun; Chen, Minshuo; Dai, Bo; Zhao, Tuo; Zha, Hongyuan; Wei, Wei; Pfister, Tomas. (December 2020, Advances in neural information processing systems)
null (Ed.)
Full Text Available
LEGO: Latent Execution-Guided Reasoning for Multi-Hop Question Answering on Knowledge Graphs

Ren, Hongyu; Dai, Hanjun; Dai, Bo; Chen, Xinyun; Yasunaga, Michihiro; Sun, Haitian; Schuurmans, Dale; Leskovec, Jure; Zhou, Denny (January 2021, Proceedings of Machine Learning Research)
null (Ed.)
Answering complex natural language questions on knowledge graphs (KGQA) is a challenging task. It requires reasoning with the input natural language questions as well as a massive, incomplete heterogeneous KG. Prior methods obtain an abstract structured query graph/tree from the input question and traverse the KG for answers following the query tree. However, they inherently cannot deal with missing links in the KG. Here we present LEGO, a Latent ExecutionGuided reasOning framework to handle this challenge in KGQA. LEGO works in an iterative way, which alternates between (1) a Query Synthesizer, which synthesizes a reasoning action and grows the query tree step-by-step, and (2) a Latent Space Executor that executes the reasoning action in the latent embedding space to combat against the missing information in KG. To learn the synthesizer without step-wise supervision, we design a generic latent execution guided bottom-up search procedure to find good execution traces efficiently in the vast query space. Experimental results on several KGQA benchmarks demonstrate the effectiveness of our framework compared with previous state of the art.
more » « less
Full Text Available
Code2Inv: A Deep Learning Framework for Program Verification

https://doi.org/10.1007/978-3-030-53291-8_9

Si, Xujie; Naik, Aaditya; Dai, Hanjun; Naik, Mayur; Song, Le (January 2020, Computer Aided Verification (CAV))
null (Ed.)
Full Text Available
Scan B -statistic for kernel change-point detection

https://doi.org/10.1080/07474946.2019.1686886

Li, Shuang; Xie, Yao; Dai, Hanjun; Song, Le (October 2019, Sequential Analysis)

Full Text Available
Hoppity: Learning Graph Transformations to Detect and Fix Bugs in Programs

Dinella, Elizabeth; Dai, Hanjun; Li, Ziyang; Naik, Mayur; Song, Le; Wang, Ke (January 2020, International Conference on Learning Representations (ICLR))
null (Ed.)
Full Text Available

« Prev Next »

Search for: All records