Can Today's Hardware Protect Tomorrow's Secrets?

Quantum computing transforms cybersecurity. This blog covers research on accelerating CRYSTALS-Kyber with adaptive FPGA architectures and quantum methods for high-performance post-quantum cryptography.
Like

Share this post

Choose a social network to share with, or copy the URL to share elsewhere

This is a representation of how your post may appear on social media. The actual post will vary between social networks

Explore the Research

MDPI
MDPI MDPI

Adaptive FPGA-Based Mixed-Radix NTT Architectures with Classical and Quantum Evaluation for CRYSTALS-Kyber

The imminent threat of large-scale quantum computers motivates the deployment of post-quantum cryptography (PQC). CRYSTALS-Kyber, a leading lattice-based Key Encapsulation Mechanism, relies heavily on Number Theoretic Transform (NTT) operations, which remain a major performance and resource bottleneck. This paper presents a cross-platform NTT evaluation framework for CRYSTALS-Kyber, centered on an adaptive FPGA-based mixed-radix accelerator supporting radix-2, radix-4, and radix-8 configurations, together with comparative classical implementations and exploratory quantum-circuit prototypes. Classical evaluations show that an iterative Cooley–Tukey implementation outperforms a matrix-based baseline (≈3.6× faster for the forward NTT, ≈6.3× faster for the inverse NTT). Quantum prototypes implemented in Qiskit demonstrate proof-of-concept QFT-based NTT constructions under classical simulation environments, highlighting circuit-depth growth and noise sensitivity rather than practical hardware acceleration. The proposed FPGA design, based on a Xilinx Virtex UltraScale+ platform, employs an adaptive radix controller, LUT-based twiddle management, and Montgomery/Barrett modular arithmetic. Montgomery reduction provides superior timing and area trade-offs, with an estimated Fmax of up to 231.48 MHz and only 5 DSPs for radix-2. At the same time, radix-2 offers the best resource/performance balance with a latency of approximately 32,804 cycles. The hybrid approach strikes a balance between near-term FPGA practicality and long-term quantum potential while preserving Kyber’s MLWE-based security. Experimental results and comparative analysis indicate that the adaptive design substantially reduces resource usage and timing overhead compared to recent HLS-based NTT accelerators.

Quantum Threat Is Closer Than You Think


Every day, billions rely on public-key cryptography for digital security, from online banking to healthcare records, depending on math problems today's computers can't solve. But what if tomorrow's computers can?

The rapid progress of quantum computing is turning a distant possibility into a real threat. Powerful quantum computers could crack public-key systems like RSA and ECC, endangering digital security worldwide. This has driven a major shift in cybersecurity towards post-quantum cryptography (PQC).

CRYSTALS-Kyber, chosen by NIST, is the top candidate for future key exchange security. But replacing current cryptography with quantum-resistant algorithms isn’t enough; they must also be efficient for real-world devices from cloud servers to IoT systems.

That challenge became the motivation behind our latest research.

Looking Beyond Cryptographic Algorithms

One key lesson from our research is that creating a secure cryptographic algorithm is just the first step. Successful deployment relies equally on efficient implementation and mathematical security.

In CRYSTALS-Kyber, the dominant operation is the Number Theoretic Transform (NTT), which speeds up polynomial multiplication and boosts performance but also causes the main computational bottleneck. Improving NTT efficiency can greatly enhance overall system performance.

Many studies focus on specific hardware or software; we wanted to see if a broader view reveals new insights. Instead of asking, "How can we make NTT faster?" we asked a different question:

How does the same cryptographic engine behave across classical software, modern FPGA hardware, and emerging quantum computing platforms?

Bridging Three Computing Worlds

To answer this question, we developed a unified evaluation framework that investigates NTT implementations across three computational domains.

First, we implemented classical software versions to establish algorithmic baselines and study computational complexity. Next, we designed adaptive FPGA-based hardware architectures supporting multiple mixed-radix configurations to explore performance, latency, and hardware resource utilization. Finally, we investigated proof-of-concept quantum implementations using Qiskit to understand both the opportunities and current limitations of quantum circuit realizations.

Although these implementations target different computing environments, comparing them within a common framework clarifies the trade-offs future post-quantum systems will face.

Sometimes Simpler Is Better

One interesting outcome was that increased architectural complexity doesn't always mean better performance. People often assume larger radix configurations or more sophisticated hardware will always outperform simpler designs, but our evaluation shows a more nuanced reality.

Among the investigated architectures, the adaptive radix-2 implementation combined with Montgomery modular reduction consistently provided the best balance between operating frequency, latency, and hardware resource consumption. This configuration achieved an estimated operating frequency exceeding 230 MHz while requiring remarkably few hardware resources, making it particularly attractive for practical FPGA implementations.

This finding reinforces that the most effective solution isn't always the most complex. Careful architectural optimization often provides more practical value than just increasing computational sophistication.

What About Quantum Computing?

No discussion of post-quantum cryptography is complete without considering quantum computers. While current quantum hardware can't perform full cryptographic workloads, we explored future possibilities. Using IBM's Qiskit, we developed proof-of-concept quantum circuits for key arithmetic operations needed by the number theoretic transform.

These experiments weren't meant to replace FPGA implementations but to explore practical challenges like circuit depth, qubit needs, and noise sensitivity—factors restricting current quantum tech. Understanding these guides future research as hardware improves.

More Than Performance Numbers

For us, this project represents more than another hardware optimization study.

Preparing for the quantum era requires collaboration among cryptographers, hardware engineers, and quantum researchers, integrating secure algorithms, efficient hardware, and new computational paradigms. This unified approach helps transition post-quantum cryptography from theory to practice.

Looking Ahead

Quantum computers that could break current cryptography may be years away, but the shift to quantum-resistant systems has begun. Organizations worldwide are planning their migration before that day arrives.

Research on post-quantum cryptography now focuses on making algorithms practical, efficient, and deployable across diverse modern platforms, not just inventing stronger algorithms.

We hope our work aids this effort by offering a unified view on software implementations, FPGA acceleration, and quantum feasibility. Each platform has distinct advantages and challenges, but together they shed light on secure computing in the quantum age.

The journey toward quantum-safe cybersecurity has only just begun—and it is an exciting time to be part of it.

Final Thoughts

Preparing for the quantum era requires more than new cryptographic algorithms—it demands efficient and practical implementations. Through this research, we explored how software, FPGA hardware, and quantum computing can work together to advance CRYSTALS-Kyber toward real-world deployment.

We look forward to continuing our research on scalable, high-performance, and quantum-ready cryptographic solutions for the next generation of secure digital systems.

Our Related article:  Adaptive FPGA-Based Mixed-Radix NTT Architectures with Classical and Quantum Evaluation for CRYSTALS-Kyber 

Journal: Applied Sciences, 2026. 

Published article: https://doi.org/10.3390/app16126183

ResearchGate: https://www.researchgate.net/publication/407264992

Please sign in or register for FREE

If you are a registered user on Research Communities by Springer Nature, please sign in

Follow the Topic

Quantum Communications and Cryptography
Physical Sciences > Physics and Astronomy > Quantum Physics > Quantum Communications and Cryptography
Cyber-Physical Systems
Technology and Engineering > Electrical and Electronic Engineering > Electronic Circuits and Systems > Cyber-Physical Systems
Hardware Performance and Reliability
Mathematics and Computing > Computer Science > Computer Hardware > Hardware Performance and Reliability
Data and Information Security
Mathematics and Computing > Computer Science > Data and Information Security