2015
DOI: 10.1007/978-3-319-15943-0_20
|View full text |Cite
|
Sign up to set email alerts
|

Montgomery Modular Multiplication on ARM-NEON Revisited

Abstract: Abstract. Montgomery modular multiplication constitutes the "arithmetic foundation" of modern public-key cryptography with applications ranging from RSA, DSA and Diffie-Hellman over elliptic curve schemes to pairing-based cryptosystems. The increased prevalence of SIMD-type instructions in commodity processors (e.g. Intel SSE, ARM NEON) has initiated a massive body of research on vector-parallel implementations of Montgomery modular multiplication. In this paper, we introduce the Cascade Operand Scanning (COS)… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
17
0

Year Published

2016
2016
2021
2021

Publication Types

Select...
5
1

Relationship

2
4

Authors

Journals

citations
Cited by 15 publications
(17 citation statements)
references
References 10 publications
0
17
0
Order By: Relevance
“…In ICISC 2014, Seo et al . introduced a novel two‐way cascade operand scanning (COS) multiplication . This method processes the partial products in a non‐conventional order to reduce the number of data dependencies in the carry propagations from the least to most significant words.…”
Section: Introductionmentioning
confidence: 99%
See 2 more Smart Citations
“…In ICISC 2014, Seo et al . introduced a novel two‐way cascade operand scanning (COS) multiplication . This method processes the partial products in a non‐conventional order to reduce the number of data dependencies in the carry propagations from the least to most significant words.…”
Section: Introductionmentioning
confidence: 99%
“…However, there are still two interesting topics for Montgomery algorithm on ARM‐NEON processors . First, the previous work mainly focused on multiplication not squaring.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…This approach allows one to split the computation of the interleaved Montgomery multiplication into two parts which can be computed in parallel. Note that in a follow-up work [65] by Seo, Liu, Großschädl, and Choi it is shown how to improve the performance on 2-way SIMD architectures even further. Instead of computing the two multiplications concurrently, as is presented in Section 4.1, they compute every multiplication using 2-way SIMD instructions.…”
Section: Concurrent Computing Of Montgomery Multiplicationmentioning
confidence: 94%
“…This results in a software implementation which outperforms the one presented in [17]. It would be interesting to see if these two approaches (from [17] and [65]) can be merged on platforms which support efficient 4-way SIMD instructions.…”
Section: Concurrent Computing Of Montgomery Multiplicationmentioning
confidence: 99%