Carl Busart scite author profile

Carl Busart

5Publications

14Citation Statements Received

58Citation Statements Given

How they've been cited

How they cite others

Affiliations

United States Army Combat Capabilities Development Command, DEVCOM Army Research Laboratory

Publications

Order By: Most citations

TinyM2Net: A Flexible System Algorithm Co-designed Multimodal Learning Framework for Tiny Devices

Rashid¹,

Ovi²,

Busart³

et al. 2022

Preprint

View full text Add to dashboard Cite

With the emergence of Artificial Intelligence (AI), new attention has been given to implement AI algorithms on resource constrained tiny devices to expand the application domain of IoT. Multimodal Learning has recently become very popular with the classification task due to its impressive performance for both image and audio event classification. This paper presents \emph{\sys{}} - a flexible system algorithm co-designed multimodal learning framework for resource constrained tiny devices. The framework was designed to be evaluated on two different case-studies: COVID-19 detection from multimodal audio recordings and battle field object detection from multimodal images and audios. In order to compress the model to implement on tiny devices, substantial network architecture optimization and mixed precision quantization were performed (mixed 8-bit and 4-bit). \emph{\sys{}} shows that even a tiny multimodal learning model can improve the classification performance than that of any unimodal frameworks. The most compressed \emph{\sys{}} achieves 88.4\% COVID-19 detection accuracy (14.5\% improvement from unimodal base model) and 96.8\% battle field object detection accuracy (3.9\% improvement from unimodal base model). Finally, we test our \emph{\sys{}} models on a Raspberry Pi 4 to see how they perform when deployed to a resource constrained tiny device.

show abstract

Model-Architecture Co-Design for High Performance Temporal GNN Inference on FPGA

Zhou¹,

Zhang²,

Kannan³

et al. 2022

View full text Add to dashboard Cite

Model-Architecture Co-Design for High Performance Temporal GNN Inference on FPGA

Zhou¹,

Zhang²,

Kannan³

et al. 2022

Preprint

View full text Add to dashboard Cite

Temporal Graph Neural Networks (TGNNs) are powerful models to capture temporal, structural, and contextual information on temporal graphs. The generated temporal node embeddings outperform other methods in many downstream tasks. Real-world applications require high performance inference on real-time streaming dynamic graphs. However, these models usually rely on complex attention mechanisms to capture relationships between temporal neighbors. In addition, maintaining vertex memory suffers from intrinsic temporal data dependency that hinders task-level parallelism, making it inefficient on general-purpose processors. In this work, we present a novel model-architecture co-design for inference in memory-based TGNNs on FPGAs. The key modeling optimizations we propose include a light-weight method to compute attention scores and a related temporal neighbor pruning strategy to further reduce computation and memory accesses. These are holistically coupled with key hardware optimizations that leverage FPGA hardware. We replace the temporal sampler with an on-chip FIFO based hardware sampler and the time encoder with a look-up-table. We train our simplified models using knowledge distillation to ensure similar accuracy vis-á-vis the original model. Taking advantage of the model optimizations, we propose a principled hardware architecture using batching, pipelining, and prefetching techniques to further improve the performance. We also propose a hardware mechanism to ensure the chronological vertex updating without sacrificing the computation parallelism. We evaluate the performance of the proposed hardware accelerator on three real-world datasets. The proposed model reduces the computation complexity by 84% and memory accesses by 67% with less than 0.33% accuracy loss. Compared with CPU/GPU, our FPGA accelerator achieves 16.4/2.3× speedup in latency and 0.27% improvement in accuracy compared with the stateof-the-art inference algorithm. To the best of our knowledge, this is the first work that performs model-architecture co-design on memory-based Temporal Graph Neural Networks.

show abstract

TinyM$^2$Net: A Flexible System Algorithm Co-designed Multimodal Learning Framework for Tiny Devices

Rashid¹,

Ovi²,

Busart³

et al. 2022

Preprint

View full text Add to dashboard Cite

With the emergence of Artificial Intelligence (AI), new attention has been given to implement AI algorithms on resource constrained tiny devices to expand the application domain of IoT. Multimodal Learning has recently become very popular with the classification task due to its impressive performance for both image and audio event classification. This paper presents TinyM 2 Net -a flexible system algorithm co-designed multimodal learning framework for resource constrained tiny devices. The framework was designed to be evaluated on two different case-studies: COVID-19 detection from multimodal audio recordings and battle field object detection from multimodal images and audios. In order to compress the model to implement on tiny devices, substantial network architecture optimization and mixed precision quantization were performed (mixed 8-bit and 4-bit). TinyM 2 Net shows that even a tiny multimodal learning model can improve the classification performance than that of any unimodal frameworks. The most compressed TinyM 2 Net achieves 88.4% COVID-19 detection accuracy (14.5% improvement from unimodal base model) and 96.8% battle field object detection accuracy (3.9% improvement from unimodal base model). Finally, we test our TinyM 2 Net models on a Raspberry Pi 4 to see how they perform when deployed to a resource constrained tiny device.

show abstract

Accurate, Low-latency, Efficient SAR Automatic Target Recognition on FPGA

Zhang¹,

Kannan²,

Prasanna³

et al. 2022

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Carl Busart

TinyM2Net: A Flexible System Algorithm Co-designed Multimodal Learning Framework for Tiny Devices

Model-Architecture Co-Design for High Performance Temporal GNN Inference on FPGA

Model-Architecture Co-Design for High Performance Temporal GNN Inference on FPGA

TinyM$^2$Net: A Flexible System Algorithm Co-designed Multimodal Learning Framework for Tiny Devices

Accurate, Low-latency, Efficient SAR Automatic Target Recognition on FPGA

Contact Info

Product

Resources

About