NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Methods and Benchmark for Detecting Cryptographic API Misuses in Python

https://doi.org/10.1109/TSE.2024.3377182

Frantz, Miles; Xiao, Ya; Pias, Tanmoy Sarkar; Meng, Na; Yao, Danfeng (May 2024, IEEE Transactions on Software Engineering)

Extensive research has been conducted to explore cryptographic API misuse in Java. However, despite the tremendous popularity of the Python language, uncovering similar issues has not been fully explored. The current static code analysis tools for Python are unable to scan the increasing complexity of the source code. This limitation decreases the analysis depth, resulting in more undetected cryptographic misuses. In this research, we propose Cryptolation, a Static Code Analysis (SCA) tool that provides security guarantees for complex Python cryptographic code. Most existing analysis tools for Python solely focus on specific Frameworks such as Django or Flask. However, using a SCA approach, Cryptolation focuses on the language and not any framework. Cryptolation performs an inter-procedural data-flow analysis to handle many Python language features through variable inference (statically predicting what the variable value is) and SCA. Cryptolation covers 59 Python cryptographic modules and can identify 18 potential cryptographic misuses that involve complex language features. In this paper, we also provide a comprehensive analysis and a state-of-the-art benchmark for understanding the Python cryptographic Application Program Interface (API) misuses and their detection. Our state-of-the-art benchmark PyCryptoBench includes 1,836 Python cryptographic test cases that cover both 18 cryptographic rules and five language features. PyCryptoBench also provides a framework for evaluating and comparing different cryptographic scanners for Python. To evaluate the performance of our proposed cryptographic Python scanner, we evaluated Cryptolation against three other state-of-the-art tools: Bandit, Semgrep, and Dlint. We evaluated these four tools using our benchmark PyCryptoBench and manual evaluation of (four Top-Ranked and 939 Un-Ranked) real-world projects. Our results reveal that, overall, Cryptolation achieved the highest precision throughout our testing; and the highest accuracy on our benchmark. Cryptolation had 100% precision on PyCryptoBench, and the highest precision on real-world projects.
more » « less
Full Text Available
Broadly Enabling KLEE to Effortlessly Find Unrecoverable Errors in Rust

https://doi.org/10.1145/3639477.3639714

Zhang, Ying; Li, Peng; Ding, Yu; Wang, Lingxiang; Williams, Dan; Meng, Na (April 2024, ACM)

Full Text Available
Measurement of Embedding Choices on Cryptographic API Completion Tasks

https://doi.org/10.1145/3625291

Xiao, Ya; Song, Wenjia; Ahmed, Salman; Ge, Xinyang; Viswanath, Bimal; Meng, Na; Yao, Danfeng Daphne (March 2024, ACM Transactions on Software Engineering and Methodology)

In this article, we conduct a measurement study to comprehensively compare the accuracy impacts of multiple embedding options in cryptographic API completion tasks. Embedding is the process of automatically learning vector representations of program elements. Our measurement focuses on design choices of three important aspects,program analysis preprocessing,token-level embedding, andsequence-level embedding. Our findings show that program analysis is necessary even under advanced embedding. The results show 36.20% accuracy improvement, on average, when program analysis preprocessing is applied to transfer bytecode sequences into API dependence paths. With program analysis and the token-level embedding training, the embeddingdep2vecimproves the task accuracy from 55.80% to 92.04%. Moreover, only a slight accuracy advantage (0.55%, on average) is observed by training the expensive sequence-level embedding compared with the token-level embedding. Our experiments also suggest the differences made by the data. In the cross-app learning setup and a data scarcity scenario, sequence-level embedding is more necessary and results in a more obvious accuracy improvement (5.10%).
more » « less
Full Text Available
SpanL: Creating Algorithms for Automatic API Misuse Detection with Program Analysis Compositions

Rahaman, Sazzadur; Frantz, Miles; Miller, Barton; Yao, Danfeng Daphne (June 2023, Springer)

Full Text Available
SpanL: Creating Algorithms for Automatic API Misuse Detection with Program Analysis Compositions

Rahaman, S; Frantz, M; Miller, B; Yao, D (June 2023, Springer)

Full Text Available
Evaluation of Static Vulnerability Detection Tools with Java Cryptographic API Benchmarks

https://doi.org/10.1109/TSE.2022.3154717

Afrose, Sharmin; Xiao, Ya; Rahaman, Sazzadur; Miller, Barton; Yao, Danfeng Daphne (February 2023, IEEE Transactions on Software Engineering)

Several studies showed that misuses of cryptographic APIs are common in real-world code (e.g., Apache projects and Android apps). There exist several open-sourced and commercial security tools that automatically screen Java programs to detect misuses. To compare their accuracy and security guarantees, we develop two comprehensive benchmarks named CryptoAPI-Bench and ApacheCryptoAPI-Bench. CryptoAPI-Bench consists of 181 unit test cases that cover basic cases, as well as complex cases, including interprocedural, field sensitive, multiple class test cases, and path sensitive data flow of misuse cases. The benchmark also includes correct cases for testing false-positive rates. The ApacheCryptoAPI-Bench consists of 121 cryptographic cases from 10 Apache projects. We evaluate four tools, namely, SpotBugs, CryptoGuard, CrySL, and another tool (anonymous) using both benchmarks. We present their performance and comparative analysis. The ApacheCryptoAPI-Bench also examines the scalability of the tools. Our benchmarks are useful for advancing state-of-the-art solutions in the space of misuse detection.
more » « less
Full Text Available
Specializing Neural Networks for Cryptographic Code Completion Applications

https://doi.org/10.1109/TSE.2023.3265362

Xiao, Ya; Song, Wenjia; Qi, Jingyuan; Viswanath, Bimal; McDaniel, Patrick; Yao, Danfeng (January 2023, IEEE Transactions on Software Engineering)

Full Text Available
Being the Developers’ Friend: Our Experience Developing a High-Precision Tool for Secure Coding

https://doi.org/10.1109/MSEC.2022.3159481

Yao, Danfeng Daphne; Rahaman, Sazzadur; Xiao, Ya; Afrose, Sharmin; Frantz, Miles; Tian, Ke; Meng, Na; Cifuentes, Cristina; Zhao, Yang; Allen, Nicholas; et al (November 2022, IEEE Security & Privacy)
Industrial Strength Static Detection for Cryptographic API Misuses

https://doi.org/10.1109/SecDev53368.2022.00022

Xiao, Ya; Zhao, Yang; Allen, Nicholas; Keynes, Nathan; Yao, Danfeng; Cifuentes, Cristina (October 2022, 2022 IEEE Secure Development Conference (SecDev))

Full Text Available
The Relevance of Classic Fuzz Testing: Have We Solved This One?

https://doi.org/10.1109/TSE.2020.3047766

Miller, Barton P.; Zhang, Mengxiao; Heymann, Elisa R. (June 2022, IEEE Transactions on Software Engineering)

Full Text Available

« Prev Next »

Search for: All records