NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Power Side-Channel Key Recovery Attack On a Hardware Implementation of BIKE

Beckwith, Luke; Zhou, Huizhen; Kaps, Jens-Peter; Gaj, Kris (December 2024, IEEE Xplore)
Hayashi, Yuichi; Cui, Aijiao (Ed.)
BIKE is a code-based Key Encapsulation Mechanism (KEM) currently under consideration for standardization by the National Institute of Standards and Technology (NIST). BIKE, along with several other candidates, is being evaluated in the fourth round of the NIST Post-Quantum Cryptography (PQC) competition. In comparison to the lattice-based candidates, relatively little effort has been focused on analyzing this algorithm for side-channel vulnerabilities, especially in hardware. There have been several works on side-channel attacks and countermeasures on software implementations of BIKE, but as of yet, there have been no works focused on hardware. This work presents the first side-channel attack on a hardware implementation of BIKE. The attack targets a public implementation of the algorithm and is able to fully recover the long-term secret key with only several dozen traces. This work reveals BIKE’s significant susceptibilities to side-channel attacks when implemented in hardware and the need for investigation of hardware countermeasures.
more » « less
Free, publicly-accessible full text available December 16, 2025
PUF-Kyber: Design of a PUF-Based Kyber Architecture Benchmarked on Diverse ARM Processors

https://doi.org/10.1109/TCAD.2024.3399669

Aghapour, Saeed; Ahmadi, Kasra; Anastasova, Mila; Mozaffari_Kermani, Mehran; Azarderakhsh, Reza (December 2024, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems)

Free, publicly-accessible full text available December 1, 2025
Efficient Error Detection Cryptographic Architectures Benchmarked on FPGAs for Montgomery Ladder

https://doi.org/10.1109/TVLSI.2024.3419700

Ahmadi, Kasra; Aghapour, Saeed; Mozaffari_Kermani, Mehran; Azarderakhsh, Reza (November 2024, IEEE Transactions on Very Large Scale Integration (VLSI) Systems)

Free, publicly-accessible full text available November 1, 2025
Universal Gaussian elimination hardware for cryptographic purposes

https://doi.org/10.1007/s13389-024-00355-3

Hu, Jingwei; Wang, Wen; Gaj, Kris; Chen, Donglong; Wang, Huaxiong (June 2024, Journal of Cryptographic Engineering)

In this paper, we investigate the possibility of performing Gaussian elimination for arbitrary binary matrices on hardware. In particular, we presented a generic approach for hardware-based Gaussian elimination, which is able to process both non-singular and singular matrices. Previous works on hardware-based Gaussian elimination can only process non-singular ones. However, a plethora of cryptosystems, for instance, quantum-safe key encapsulation mechanisms based on rank-metric codes, ROLLO and RQC, which are among NIST post-quantum cryptography standardization round-2 candidates, require performing Gaussian elimination for random matrices regardless of the singularity. We accordingly implemented an optimized and parameterized Gaussian eliminator for (singular) matrices over binary fields, making the intense computation of linear algebra feasible and efficient on hardware. To the best of our knowledge, this work solves for the first time eliminating a singular matrix on reconfigurable hardware and also describes the a generic hardware architecture for rank-code based cryptographic schemes. The experimental results suggest hardware-based Gaussian elimination can be done in linear time regardless of the matrix type.
more » « less
Full Text Available
Cryptographic Engineering a Fast and Efficient SIKE in FPGA

https://doi.org/10.1145/3584919

Elkhatib, Rami; Koziel, Brian; Azarderakhsh, Reza; Mozaffari_Kermani, Mehran (March 2024, ACM Transactions on Embedded Computing Systems)

Recent attacks have shown that SIKE is not secure and should not be used in its current state. However, this work was completed before these attacks were discovered and might be beneficial to other cryptosystems such as SQISign. The primary downside of SIKE is its performance. However, this work achieves new SIKE speed records even using less resources than the state-of-the-art. Our approach entails designing and optimizing a new field multiplier, SIKE-optimized Keccak unit, and high-level controller. On a Xilinx Virtex-7 FPGA, this architecture performs the NIST Level 1 SIKE scheme key encapsulation and key decapsulation functions in 2.23 and 2.39 ms, respectively. The combined key encapsulation and decapsulation time is 4.62 ms, which outperforms the next best Virtex-7 implementation by nearly 2 ms. Our implementation achieves speed records for the NIST Level 1, 2, and 3 parameter sets. Only our NIST Level 5 parameter set was beat by an all-out performance implementation. Our implementations also efficiently utilize the FPGA resources, achieving new records in area-time product metrics for all parameter sets. Overall, this work continues to push the bar for accelerating SIKE computations to make a stronger case for SIKE standardization.
more » « less
Full Text Available
Efficient and side-channel resistant Ed25519 on ARM Cortex-M4

https://doi.org/10.1109/TCSI.2024.3384414

Owens, Daniel; El Khatib, Rabih; Bisheh-Niasar, Mojtaba; Azarderakhsh, Reza; Mozaffari Kermani, Mehran (November 2023, DAC-SSH-SoC: Safety and Security in Heterogeneous Open System-on-Chip Platforms)

As the cryptographic community turns its focus toward post-quantum cryptography, the demand for classical cryptographic schemes such as Elliptic Curve Cryptography (ECC) remains high. In this work, we present an optimized implementation of the Edwards Curve Digital Signature Algorithm (EdDSA) operations Keygen, Sign, and Verify using the Ed25519 parameter on the ARM Cortex-M4 using optimized assembly code. We discuss the optimization of field and group arithmetic to produce high-throughput cryptographic primitives. Then, we present the first SCA-resistant implementation of the Signed Comb method, and Test Vector Leakage Assessment (TVLA) measurements. Our fastest implementation performs Ed25519 Keygen in 200,000 cycles, Sign in 240,000 cycles, and Verify in 720,000 cycles on the ARM Cortex-M4.
more » « less
Full Text Available
Hardware Accelerators for Digital Signature Algorithms Dilithium and FALCON

https://doi.org/10.1109/MDAT.2023.3305156

Beckwith, Luke; Nguyen, Duc T.; Gaj, Kris (October 2023, IEEE design test)
Bhasin, Shivam; Chattopadhyay, Anupam; Güneysu, Tim; Bhunia, Swarup (Ed.)
Digital signature algorithms are the foundation of many secure communication protocols, including those used in Internet of Things (IoT) applications. While the current generation of signature schemes is secure against classical attacks, they are potentially vulnerable to attacks using quantum computers. Because of this threat, multiple new schemes have been developed and evaluated in recent years. From among these schemes, the National Institute of Standards and Technology standardized two and selected additional three for near-term standardization. For use in IoT, these schemes must be sufficiently efficient in terms of their public-key and signature sizes and the timing of major operations. In this paper, we analyze the choice between two primary schemes considered for extensive use in IoT, CRYSTALS-Dilithium and FALCON, from the point of view of developing efficient hardware accelerators supporting cryptographic operations performed by IoT clients and servers.
more » « less
Full Text Available
A High-Performance Hardware Implementation of the LESS Digital Signature Scheme

https://doi.org/10.1007/978-3-031-40003-2_3

Beckwith, Luke; Wallace, Robert; Mohajerani, Kamyar; Gaj, Kris (August 2023, 14th International Conference on Post-Quantum Cryptography, PQCrypto 2023)
Johansson, Thomas; Smith-Tone, Daniel (Ed.)
In 2022, NIST selected the first set of four post-quantum cryptography schemes for near-term standardization. Three of them - CRYSTALS-Kyber, CRYSTALS-Dilithium, and FALCON - belong to the lattice-based family and one - SPHINCS+ - to the hash-based family. NIST has also announced an ”on-ramp” for new digital signature candidates to add greater diversity to the suite of new standards. One promising set of schemes - a subfamily of code-based cryptography - is based on the linear code equivalence problem. This well-studied problem can be used to design flexible and efficient digital signature schemes. One of these schemes, LESS, was submitted to the NIST standardization process in June 2023. In this work, we present a high-performance hardware implementation of LESS targeting Xilinx FPGAs. The obtained results are compared with those for the state-of-the-art hardware implementations of CRYSTALS-Dilithium, SPHINCS+, and FALCON.
more » « less
Full Text Available
Fast Falcon Signature Generation and Verification Using ARMv8 NEON Instructions

https://doi.org/10.1007/978-3-031-37679-5_18

Nguyen, Duc T.; Gaj Kris (July 2023, 14th International Conference on Cryptology, AFRICACRYPT 2023)
El Mrabet, N.; De Feo, L.; Duquesne, S. (Ed.)
We present our speed records for Falcon signature generation and verification on ARMv8-A architecture. Our implementations are benchmarked on Apple M1 ‘Firestorm’, Raspberry Pi 4 Cortex-A72, and Jetson AGX Xavier. Our optimized signature generation is 2x slower, but signature verification is 3–3.9x faster than the state-of-the-art CRYSTALS-Dilithium implementation on the same platforms. Faster signature verification may be particularly useful for the client side on con-strained devices. Our Falcon implementation outperforms the previous work targeting Jetson AGX Xavier by the factors 1.48x for signing in falcon512 and falcon1024, 1.52x for verifying in falcon512, and 1.70x for verifying in falcon1024. We achieve improvement in Falcon signature generation by supporting a larger subset of possible parameter values for FFT-related functions and applying our compressed twiddle-factor table to reduce memory usage. We also demonstrate that the recently proposed signature scheme Hawk, sharing optimized functionality with Falcon, has 3.3x faster signature generation and 1.6–1.9x slower signature verification when implemented on the same ARMv8 processors as Falcon.
more » « less
Full Text Available
Engineering Practical Rank-Code-Based Cryptographic Schemes on Embedded Hardware. A Case Study on ROLLO

https://doi.org/10.1109/TC.2022.3225080

Hu, Jingwei; Wang, Wen; Gaj, Kris; Wang, Liping; Wang, Huaxiong (July 2023, IEEE Transactions on Computers)

In this paper, we investigate the practical performance of rank-code based cryptography on FPGA platforms by presenting a case study on the quantum-safe KEM scheme based on LRPC codes called ROLLO, which was among NIST post-quantum cryptography standardization round-2 candidates. Specifically, we present an FPGA implementation of the encapsulation and decapsulation operations of the ROLLO KEM scheme with some variations to the original specification. The design is fully parameterized, using code-generation scripts to support a wide range of parameter choices for security levels specified in ROLLO. At the core of the ROLLO hardware, we presented a generic approach for hardware-based Gaussian elimination, which can process both non-singular and singular matrices. Previous works on hardware-based Gaussian elimination can only process non-singular ones. However, a plethora of cryptosystems, for instance, quantum-safe key encapsulation mechanisms based on rank-metric codes, ROLLO and RQC, which are among NIST post-quantum cryptography standardization round-2 candidates, require performing Gaussian elimination for random matrices regardless of the singularity. To the best of our knowledge, this work is the first hardware implementation for rank-code-based cryptographic schemes. The experimental results suggest rank-code-based schemes can be highly efficient.
more » « less
Full Text Available

« Prev Next »

Search for: All records