Boosting object detection via diffusion-based data augmentation

Kui Song; Weihua Li; Songyan Liu

doi:10.1117/12.3021662

25 March 2024 Boosting object detection via diffusion-based data augmentation

Kui Song, Weihua Li, Songyan Liu

Proceedings Volume 13089, Fifteenth International Conference on Graphics and Image Processing (ICGIP 2023); 130891T (2024) https://doi.org/10.1117/12.3021662
Event: Fifteenth International Conference on Graphics and Image Processing (ICGIP 2023), 2023, Suzhou, China

Abstract

Object detection continues to be a significant challenge in computer vision. Despite advancements made possible through deep learning, these models predominantly depend on extensive and diverse annotated training data. Such data, unfortunately, often lacks representation of many real-world scenarios. To bridge this gap, we use target images from the original dataset to train a specialized generator. The main intent behind producing these images is to mimic the appearance of targets across a broader spectrum of real-world situations. Once integrated with the primary dataset, these synthetically generated images act as an effective augmentation to the original training set, encompassing scenarios and variations previously absent. This autonomous method eliminates the need for external data sources, proving to be more practical in most situations. Our empirical findings highlight significant improvements: with the ResNet-34 backbone, the mAP for SSD rose notably from 0.185 to 0.233. Furthermore, for small objects detected by Faster R-CNN with the ResNet-101 backbone, there is a pronounced improvement from 0.213 to 0.225. These results underscore our method's efficacy, especially in enhancing detection capabilities for underrepresented scenarios and smaller objects.

(2024) Published by SPIE. Downloading of the abstract is permitted for personal use only.

Citation Download Citation

Kui Song, Weihua Li, and Songyan Liu "Boosting object detection via diffusion-based data augmentation", Proc. SPIE 13089, Fifteenth International Conference on Graphics and Image Processing (ICGIP 2023), 130891T (25 March 2024); https://doi.org/10.1117/12.3021662

ACCESS THE FULL ARTICLE

INSTITUTIONAL
Select your institution to access the SPIE Digital Library.

SELECT YOUR INSTITUTION

PERSONAL
Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.

PERSONAL SIGN IN

No SPIE Account? Create one

PURCHASE THIS CONTENT

SUBSCRIBE TO DIGITAL LIBRARY

50 downloads per 1-year subscription

Members: $195

Non-members: $335 ADD TO CART

25 downloads per 1 - year subscription

Members: $145

Non-members: $250 ADD TO CART

PURCHASE SINGLE ARTICLE

Includes PDF, HTML & Video, when available