Detecting Soccer Balls with Reduced Neural Networks: A Comparison of Multiple Architectures Under Constrained Hardware Scenarios

dc.contributor.authorMENEGUETTI, D. D. R.
dc.contributor.authorHOMEM, T. P. D.
dc.contributor.authorDE OLIVEIRA, J. H. R.
dc.contributor.authorSILVA, I. J.
dc.contributor.authorPERICO, D. H.
dc.contributor.authorReinaldo Bianchi
dc.contributor.authorOrcidhttps://orcid.org/0000-0001-9097-827X
dc.date.accessioned2022-01-12T21:54:09Z
dc.date.available2022-01-12T21:54:09Z
dc.date.issued2021-03-05
dc.description.abstract© 2021, The Author(s), under exclusive licence to Springer Nature B.V. part of Springer Nature.Object detection techniques that achieve state-of-the-art detection accuracy employ convolutional neural networks, implemented to have lower latency in graphics processing units. Some hardware systems, such as mobile robots, operate under constrained hardware situations, but still benefit from object detection capabilities. Multiple network models have been proposed, achieving comparable accuracy with reduced architectures and leaner operations. Motivated by the need to create a near real-time object detection system for a soccer team of mobile robots operating with x86 CPU-only embedded computers, this work analyses the average precision and inference time of multiple object detection systems in a constrained hardware setting. We train open implementations of MobileNetV2 and MobileNetV3 models with different underlying architectures, achieved by changing their input and width multipliers, as well as YOLOv3, TinyYOLOv3, YOLOv4 and TinyYOLOv4 in an annotated image dataset captured using a mobile robot. We emphasize the speed/accuracy trade-off in the models by reporting their average precision on a test data set and their inference time in videos at different resolutions, under constrained and unconstrained hardware configurations. Results show that MobileNetV3 models have a good trade-off between average precision and inference time in constrained scenarios only, while MobileNetV2 with high width multipliers are appropriate for server-side inference. YOLO models in their official implementations are not suitable for inference in CPUs.
dc.description.issuenumber3
dc.description.volume101
dc.identifier.citationMENEGUETTI, D. D. R.; HOMEM, T. P. D.; DE OLIVEIRA, J. H. R.; SILVA, I. J.; PERICO, D. H.; BIANCHI, R. Detecting Soccer Balls with Reduced Neural Networks: A Comparison of Multiple Architectures Under Constrained Hardware Scenarios. Journal of Intelligent and Robotic Systems: Theory and Applications, v. 101, n. 3, March, 2021.
dc.identifier.doi10.1007/s10846-021-01336-y
dc.identifier.issn1573-0409
dc.identifier.urihttps://repositorio.fei.edu.br/handle/FEI/3584
dc.relation.ispartofJournal of Intelligent and Robotic Systems: Theory and Applications
dc.rightsAcesso Restrito
dc.subject.otherlanguageConstrained hardware
dc.subject.otherlanguageConvolutional neural networks
dc.subject.otherlanguageMobile robots
dc.subject.otherlanguageObject detection
dc.titleDetecting Soccer Balls with Reduced Neural Networks: A Comparison of Multiple Architectures Under Constrained Hardware Scenarios
dc.typeArtigo
fei.scopus.citations5
fei.scopus.eid2-s2.0-85101940364
fei.scopus.subjectDetection accuracy
fei.scopus.subjectDetection capability
fei.scopus.subjectDifferent resolutions
fei.scopus.subjectEmbedded computers
fei.scopus.subjectHardware configurations
fei.scopus.subjectMultiple-object detections
fei.scopus.subjectObject detection systems
fei.scopus.subjectOpen implementation
fei.scopus.updated2024-07-01
fei.scopus.urlhttps://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85101940364&origin=inward
Arquivos
Coleções