Title Agrast-6: abridged VGG-based reflected lightweight architecture for binary segmentation of depth images captured by Kinect /
Authors Ryselis, Karolis ; Blažauskas, Tomas ; Damaševičius, Robertas ; Maskeliūnas, Rytis
DOI 10.3390/s22176354
Full Text Download
Is Part of Sensors.. Basel : MDPI. 2022, vol. 22, iss. 17, art. no. 6354, p. 1-16.. ISSN 1424-8220
Keywords [eng] binary segmentation ; convolutional neural network ; depth images
Abstract [eng] Binary object segmentation is a sub-area of semantic segmentation that could be used for a variety of applications. Semantic segmentation models could be applied to solve binary segmentation problems by introducing only two classes, but the models to solve this problem are more complex than actually required. This leads to very long training times, since there are usually tens of millions of parameters to learn in this category of convolutional neural networks (CNNs). This article introduces a novel abridged VGG-16 and SegNet-inspired reflected architecture adapted for binary segmentation tasks. The architecture has 27 times fewer parameters than SegNet but yields 86% segmentation cross-intersection accuracy and 93% binary accuracy. The proposed architecture is evaluated on a large dataset of depth images collected using the Kinect device, achieving an accuracy of 99.25% in human body shape segmentation and 87% in gender recognition tasks.
Published Basel : MDPI
Type Journal article
Language English
Publication date 2022
CC license CC license description