2015
DOI: 10.1007/978-3-319-17533-1_31
|View full text |Cite
|
Sign up to set email alerts
|

Accelerating RSA with Fine-Grained Parallelism Using GPU

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

1
7
0

Year Published

2015
2015
2021
2021

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 19 publications
(8 citation statements)
references
References 14 publications
1
7
0
Order By: Relevance
“…Emmart and Weems [7] applied one thread to a row oriented multiplication for 1024-bit modular exponentiation and a distributed model based on CIOS method for 2048bit modular exponentiation. Yang [24] used an Integrated Operand Scanning (IOS) method with single limb or two limbs for Montgomery multiplication algorithm to implement RSA-1024/2048 decryption.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…Emmart and Weems [7] applied one thread to a row oriented multiplication for 1024-bit modular exponentiation and a distributed model based on CIOS method for 2048bit modular exponentiation. Yang [24] used an Integrated Operand Scanning (IOS) method with single limb or two limbs for Montgomery multiplication algorithm to implement RSA-1024/2048 decryption.…”
Section: Related Workmentioning
confidence: 99%
“…Note that RSA-1024 decryption of [22] has latency of about 150 ms, while 2048bit RSA decryption of ours reaches 21.52 ms when it reaches the throughput peak. Yang [24] reported the latency for RSA-2048 throughput of 5,244 (scaled as 31,782) is 195.27 ms (scaled as 225.60 ms). The throughput of the proposed RSA-2048 implementation is 132% performance of Yang's work, but the latency is 9.4% of their work [24].…”
Section: Proposed Versusmentioning
confidence: 99%
See 1 more Smart Citation
“…With specialized hardware that incorporates many custom modular exponentiation modules, overall execution time can be reduced significantly. Similarly, a GPU implementation is reported [82] to reach the peak throughtput [20]. On the other hand, our solution requires 449 ms for the same authentication setup.…”
Section: Complexity Analysis Of the Proposed Systemmentioning
confidence: 75%
“…On the other hand, one advantage of the proposed system from the time complexity point of view is that majority of the expensive operations (i.e., mainly modular exponentiations) can be performed in parallel. Therefore, custom ASIC [81] or GPU implementations [82] can accelerate protocol considerably. For instance, the custom modular exponentiation circuit for RSA in [81] reports 0.89 ms execution time on a circuit of 153,000 equivalent gate counts.…”
Section: Complexity Analysis Of the Proposed Systemmentioning
confidence: 99%