Performance Optimisation of Parallelized ADAS Applications in FPGA-GPU Heterogeneous Systems: A Case Study With Lane Detection

Wang, Xiebing; Huang, Kai; Knoll, Alois

doi:10.1109/tiv.2019.2938092

Cited by 10 publications

(3 citation statements)

References 39 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The proposed algorithms are validated using an estimated few scheduling metrics such as task miss rate, execution time (makespan), cluster utilization, and prediction accuracy were estimated as per the below equations. φ = µ ρ × 100 (11) where φ accuracy rate of core prediction µ denotes the predicted number of cores divided by ρ an actual number of active cores.…”

Section: Resultsmentioning

confidence: 99%

See 1 more Smart Citation

An Intelligent Task Scheduling Mechanism for Autonomous Vehicles via Deep Learning

2021

View full text Add to dashboard Cite

With the rapid development of the Internet of Things (IoT) and artificial intelligence, autonomous vehicles have received much attention in recent years. Safe driving is one of the essential concerns of self-driving cars. The main problem in providing better safe driving requires an efficient inference system for real-time task management and autonomous control. Due to limited battery life and computing power, reducing execution time and resource consumption can be a daunting process. This paper addressed these challenges and developed an intelligent task management system for IoT-based autonomous vehicles. For each task processing, a supervised resource predictor is invoked for optimal hardware cluster selection. Tasks are executed based on the earliest hyper period first (EHF) scheduler to achieve optimal task error rate and schedule length performance. The single-layer feedforward neural network (SLFN) and lightweight learning approaches are designed to distribute each task to the appropriate processor based on their emergency and CPU utilization. We developed this intelligent task management module in python and experimentally tested it on multicore SoCs (Odroid Xu4 and NVIDIA Jetson embedded platforms).Connected Autonomous Vehicles (CAV) and Internet of Medical Things (IoMT) benchmarks are used for training and testing purposes. The proposed modules are validated by observing the task miss rate, resource utilization, and energy consumption metrics compared with state-of-art heuristics. SLFN-EHF task scheduler achieved better results in an average of 98% accuracy, and in an average of 20–27% reduced in execution time and 32–45% in task miss rate metric than conventional methods.

show abstract

Section: Resultsmentioning

confidence: 99%

“…Similarly, Xiebang Wang et al adopted the HMPSoC computing system with openCL software kernels for Advance Driver-Assistance System (ADAS) operations [11]. Tosiron et al studied the role of multicore processor optimization in IoT systems.…”

Section: Introductionmentioning

confidence: 99%

An Intelligent Task Scheduling Mechanism for Autonomous Vehicles via Deep Learning

2021

View full text Add to dashboard Cite

show abstract

“…Kojima et al [ 21 ] presents an autonomous driving system consisting of lane-keeping, localization, driving planning, and obstacle avoidance that are implemented as software in the embedded processor on FPGA. Wang et al [ 22 ] propose a detailed procedure that helps guide the performance optimization of parallelized ADAS applications in an FPGA-Graphics Processing Unit (GPU) combined heterogeneous system. Kamimae et al [ 23 ] develop an SoC FPGA based on the Helmholtz Principle to control unmanned mobile vehicles for the FPGA design competition.…”

Section: Introductionmentioning

confidence: 99%

QuantLaneNet: A 640-FPS and 34-GOPS/W FPGA-Based CNN Accelerator for Lane Detection

Khai

Pham

2023

Sensors

View full text Add to dashboard Cite

Lane detection is one of the most fundamental problems in the rapidly developing field of autonomous vehicles. With the dramatic growth of deep learning in recent years, many models have achieved a high accuracy for this task. However, most existing deep-learning methods for lane detection face two main problems. First, most early studies usually follow a segmentation approach, which requires much post-processing to extract the necessary geometric information about the lane lines. Second, many models fail to reach real-time speed due to the high complexity of model architecture. To offer a solution to these problems, this paper proposes a lightweight convolutional neural network that requires only two small arrays for minimum post-processing, instead of segmentation maps for the task of lane detection. This proposed network utilizes a simple lane representation format for its output. The proposed model can achieve 93.53% accuracy on the TuSimple dataset. A hardware accelerator is proposed and implemented on the Virtex-7 VC707 FPGA platform to optimize processing time and power consumption. Several techniques, including data quantization to reduce data width down to 8-bit, exploring various loop-unrolling strategies for different convolution layers, and pipelined computation across layers, are optimized in the proposed hardware accelerator architecture. This implementation can process at 640 FPS while consuming only 10.309 W, equating to a computation throughput of 345.6 GOPS and energy efficiency of 33.52 GOPS/W.

show abstract

GPU-Accelerated Vehicle Detection for Roads

Halim

Ishak

Bakar

et al. 2021

Advanced Structured Materials

View full text Add to dashboard Cite

Performance Optimisation of Parallelized ADAS Applications in FPGA-GPU Heterogeneous Systems: A Case Study With Lane Detection

Cited by 10 publications

References 39 publications

An Intelligent Task Scheduling Mechanism for Autonomous Vehicles via Deep Learning

An Intelligent Task Scheduling Mechanism for Autonomous Vehicles via Deep Learning

QuantLaneNet: A 640-FPS and 34-GOPS/W FPGA-Based CNN Accelerator for Lane Detection

GPU-Accelerated Vehicle Detection for Roads

Contact Info

Product

Resources

About