“…The implementation shows up to 152-fold performance improvement when benchmarked against a reference CPU version, and good scalability over the number of GPUs. There is no previous research on the topic of using GPUs in DEC. Esqueda et al [28] mention the usage of a single GPU in their numerical experiments, but give no description of the implementation nor its computing time performance, therefore this paper aims to fill those gaps.…”