Skip to main navigation Skip to search Skip to main content

A Lightweight Gaussian-Based Model for Fast Detection and Classification of Moving Objects

Research output: Contribution to journalConference articlepeer-review

1 Scopus citations

Abstract

Moving object detection and classification are fundamental tasks in computer vision. However, current solutions detect all objects, and then another algorithm is used to determine which objects are in motion. Furthermore, diverse solutions employ complex networks that require a lot of computational resources, unlike lightweight solutions that could lead to widespread use. We introduce TRG-Net, a unified model that can be executed on computationally limited devices to detect and classify just moving objects. This proposal is based on the Faster R-CNN architecture, MobileNetV3 as a feature extractor, and a Gaussian mixture model for a fast search of regions of interest based on motion. TRG-Net reduces the inference time by unifying moving object detection and image classification tasks, and by limiting the regions of interest to the number of moving objects. Experiments over surveillance videos and the Kitti dataset for 2D object detection show that our approach improves the inference time of Faster R-CNN (0.221 to 0.138s) using fewer parameters (18.91 M to 18.30 M) while maintaining average precision (AP=0.423). Therefore, TRG-Net achieves a balance between precision and speed, and could be applied in various real-world scenarios.

Keywords

  • Classification
  • Detection
  • Gaussian Mixture
  • Lightweight Model
  • Moving Objects

Fingerprint

Dive into the research topics of 'A Lightweight Gaussian-Based Model for Fast Detection and Classification of Moving Objects'. Together they form a unique fingerprint.

Cite this