Status | 已发表Published |
Title | Area-Time Efficient Architecture of FFT-Based Montgomery Multiplication |
Creator | |
Date Issued | 2017-03-01 |
Source Publication | IEEE Transactions on Computers
![]() |
ISSN | 0018-9340 |
Volume | 66Issue:3Pages:375-388 |
Abstract | The modular multiplication operation is the most time-consuming operation for number-theoretic cryptographic algorithms involving large integers, such as RSA and Diffie-Hellman. Implementations reveal that more than 75 percent of the time is spent in the modular multiplication function within the RSA for more than 1,024-bit moduli. There are fast multiplier architectures to minimize the delay and increase the throughput using parallelism and pipelining. However such designs are large in terms of area and low in efficiency. In this paper, we integrate the fast Fourier transform (FFT) method into the McLaughlin's framework, and present an improved FFT-based Montgomery modular multiplication (MMM) algorithm achieving high area-time efficiency. Compared to the previous FFT-based designs, we inhibit the zero-padding operation by computing the modular multiplication steps directly using cyclic and nega-cyclic convolutions. Thus, we reduce the convolution length by half. Furthermore, supported by the number-theoretic weighted transform, the FFT algorithm is used to provide fast convolution computation. We also introduce a general method for efficient parameter selection for the proposed algorithm. Architectures with single and double butterfly structures are designed obtaining low area-latency solutions, which we implemented on Xilinx Virtex-6 FPGAs. The results show that our work offers a better area-latency efficiency compared to the state-of-the-art FFT-based MMM architectures from and above 1,024-bit operand sizes. We have obtained area-latency efficiency improvements up to 50.9 percent for 1,024-bit, 41.9 percent for 2,048-bit, 37.8 percent for 4,096-bit and 103.2 percent for 7,680-bit operands. Furthermore, the operating latency is also outperformed with high clock frequency for length-64 transform and above. |
Keyword | fast Fourier transform (FFT) field-programmable gate array (FPGA) Montgomery modular multiplication number-theoretic weighted transform |
DOI | 10.1109/TC.2016.2601334 |
URL | View source |
Indexed By | SCIE |
Language | 英语English |
WOS Research Area | Computer Science ; Engineering |
WOS Subject | Computer Science, Hardware & ArchitectureEngineering ; Electrical & Electronic |
WOS ID | WOS:000395629500001 |
Scopus ID | 2-s2.0-85013031137 |
Citation statistics | |
Document Type | Journal article |
Identifier | http://repository.uic.edu.cn/handle/39GCC9TT/9114 |
Collection | Research outside affiliated institution |
Corresponding Author | Chen, Donald Donglong |
Affiliation | 1.Department of Electronic Engineering,City University of Hong Kong,Kowloon,Hong Kong 2.Department of Computer Science,Universty of California Santa Barbara,Santa Barbara,93106,United States |
Recommended Citation GB/T 7714 | Dai, Wangchen,Chen, Donald Donglong,Cheung, Ray C.C.et al. Area-Time Efficient Architecture of FFT-Based Montgomery Multiplication[J]. IEEE Transactions on Computers, 2017, 66(3): 375-388. |
APA | Dai, Wangchen, Chen, Donald Donglong, Cheung, Ray C.C., & Koç, Çetin Kaya. (2017). Area-Time Efficient Architecture of FFT-Based Montgomery Multiplication. IEEE Transactions on Computers, 66(3), 375-388. |
MLA | Dai, Wangchen,et al."Area-Time Efficient Architecture of FFT-Based Montgomery Multiplication". IEEE Transactions on Computers 66.3(2017): 375-388. |
Files in This Item: | There are no files associated with this item. |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment