“…Perez, Stafford, Beivide, Mateo, Turel, Ayguadé and Martorell propose in Auto-tuned OpenCL kernel co-execution in OmpSs for heterogeneous systems [7] a novel extension to the OmpSs programming model to allow the co-execution of a single OpenCL kernel in several devices, including the Auto-Tune algorithm that provides adaptive load balancing strategies. Experimental results reveal that the co-execution of single kernels on all the devices in the node is beneficial in terms of performance and energy consumption, and that the proposed scheduling algorithm gives the best overall results.…”