Parallelized Radix-4 Scalable Montgomery Multipliers

David M Harris,Nathaniel Pinckney

doi:10.29292/jics.v3i1.280

Parallelized Radix-4 Scalable Montgomery Multipliers

David M Harris, Nathaniel Pinckney

Open Access

https://doi.org/10.29292/jics.v3i1.280

Copy DOI

Journal: Journal of Integrated Circuits and Systems	Publication Date: Nov 18, 2020
Citations: 16	License type: CC BY-NC-ND 4.0

#Short Ones #Processing Elements + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

This paper describes a parallelized radix-4 scalable Montgomery multiplier implementation. The design does not require hardware multipliers, and uses parallelized multiplication to shorten the critical path. By left-shifting the sources rather than right-shifting the result, the latency between processing elements is shortened from two cycles to nearly one. Multiplexers are used to select precomputed products. Carry-save adders propagate carry bits before words are discarded. The new design can perform 1024-bit modular exponentiation in 9.4 ms and 256-bit exponentiation in 0.38 ms using 4997 Virtex2 4-input lookup tables, while consuming 30% fewer LUTs than a previous parallelized radix-4 design. This is comparable to radix-2 for long multiplies and nearly twice as fast for short ones.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

More From: Journal of Integrated Circuits and Systems

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.