Segmentation over detection via optimal sparse reconstructions

W. Xia, C. Domokos, J. Xiong, L. Cheong, S. Yan, IEEE Transactions on Circuits and Systems for Video Technology 25 (2015) 1295–1308.

Download
No fulltext has been uploaded. References only!

Journal Article | Published | English
Author
; ; ; ;
Department
Abstract
This paper addresses the problem of semantic segmentation, where the possible class labels are from a predefined set. We exploit top-down guidance, i.e., the coarse localization of the objects and their class labels provided by object detectors. For each detected bounding box, figure-ground segmentation is performed and the final result is achieved by merging the figure-ground segmentations. The main idea of the proposed approach, which is presented in our preliminary work, is to reformulate the figure-ground segmentation problem as sparse reconstruction pursuing the object mask in a nonparametric manner. The latent segmentation mask should be coherent subject to sparse error caused by intra-category diversity; thus, the object mask is inferred by making use of sparse representations over the training set. To handle local spatial deformations, local patch-level masks are also considered and inferred by sparse representations over the spatially nearby patches. The sparse reconstruction coefficients and the latent mask are alternately optimized by applying the Lasso algorithm and the accelerated proximal gradient method. The proposed formulation results in a convex optimization problem; thus, the global optimal solution is achieved. In this paper, we provide theoretical analysis of the convergence and optimality. We also give an extended numerical analysis of the proposed algorithm and a comprehensive comparison with the related semantic segmentation methods on the challenging PASCAL visual object class object segmentation datasets and the Weizmann horse dataset. The experimental results demonstrate that the proposed algorithm achieves a competitive performance when compared with the state of the arts.
Publishing Year
Date Published
2015-08-01
Journal Title
IEEE Transactions on Circuits and Systems for Video Technology
Volume
25
Issue
8
Page
1295 - 1308
IST-REx-ID

Cite this

Xia W, Domokos C, Xiong J, Cheong L, Yan S. Segmentation over detection via optimal sparse reconstructions. IEEE Transactions on Circuits and Systems for Video Technology. 2015;25(8):1295-1308. doi:10.1109/TCSVT.2014.2379972
Xia, W., Domokos, C., Xiong, J., Cheong, L., & Yan, S. (2015). Segmentation over detection via optimal sparse reconstructions. IEEE Transactions on Circuits and Systems for Video Technology, 25(8), 1295–1308. https://doi.org/10.1109/TCSVT.2014.2379972
Xia, Wei, Csaba Domokos, Junjun Xiong, Loongfah Cheong, and Shuicheng Yan. “Segmentation over Detection via Optimal Sparse Reconstructions.” IEEE Transactions on Circuits and Systems for Video Technology 25, no. 8 (2015): 1295–1308. https://doi.org/10.1109/TCSVT.2014.2379972.
W. Xia, C. Domokos, J. Xiong, L. Cheong, and S. Yan, “Segmentation over detection via optimal sparse reconstructions,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 25, no. 8, pp. 1295–1308, 2015.
Xia W, Domokos C, Xiong J, Cheong L, Yan S. 2015. Segmentation over detection via optimal sparse reconstructions. IEEE Transactions on Circuits and Systems for Video Technology. 25(8), 1295–1308.
Xia, Wei, et al. “Segmentation over Detection via Optimal Sparse Reconstructions.” IEEE Transactions on Circuits and Systems for Video Technology, vol. 25, no. 8, IEEE, 2015, pp. 1295–308, doi:10.1109/TCSVT.2014.2379972.

Export

Marked Publications

Open Data IST Research Explorer

Search this title in

Google Scholar