Towards Light Weight Object Detection System

K. C. Dharma, Venkata Ravi Kiran Dayana, Meng Lin Wu, Venkateswara Rao Cherukuri, Hau Hwang, Clayton T. Morrison

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Transformers are a popular choice for classification tasks and as backbones for object detection tasks. However, their high latency brings challenges in their adaptation to lightweight classification and object detection systems. We present an approximation of the self-attention layers used in the transformer architecture. This approximation significantly reduces the latency of the classification system while incurring minimal loss in accuracy. We also present a method that uses a transformer encoder layer for multi-resolution feature fusion for object detection. This feature fusion improves the accuracy of the state-of-the-art lightweight object detection system without significantly increasing the number of parameters. These modules can be easily integrated into existing CNN and Transformer architecture to reduce latency and increase the accuracy of the system. Finally, we provide an abstraction for the transformer architecture called Generalized Transformer (gFormer) that can guide the design of novel transformer-like architectures.

Original languageEnglish (US)
Title of host publicationInternational Workshop on Advanced Imaging Technology, IWAIT 2024
EditorsMasayuki Nakajima, Phooi Yee Lau, Jae-Gon Kim, Hiroyuki Kubo, Chuan-Yu Chang, Qian Kemao
PublisherSPIE
ISBN (Electronic)9781510679924
DOIs
StatePublished - 2024
Event2024 International Workshop on Advanced Imaging Technology, IWAIT 2024 - Langkawi, Malaysia
Duration: Jan 7 2024Jan 8 2024

Publication series

NameProceedings of SPIE - The International Society for Optical Engineering
Volume13164
ISSN (Print)0277-786X
ISSN (Electronic)1996-756X

Conference

Conference2024 International Workshop on Advanced Imaging Technology, IWAIT 2024
Country/TerritoryMalaysia
CityLangkawi
Period1/7/241/8/24

Keywords

  • object detection
  • self-attention
  • Vision transformer

ASJC Scopus subject areas

  • Electronic, Optical and Magnetic Materials
  • Condensed Matter Physics
  • Computer Science Applications
  • Applied Mathematics
  • Electrical and Electronic Engineering

Cite this