Boosting RGB-D Saliency Detection by Leveraging Unlabeled RGB Images.

Wang, X; Zhu, L; Tang, S; Fu, H; Li, P; Wu, F; Yang, Y; Zhuang, Y

Boosting RGB-D Saliency Detection by Leveraging Unlabeled RGB Images.

Wang, X Zhu, L Tang, S Fu, H Li, P Wu, F Yang, Y

Zhuang, Y

Permalink

Publisher:: IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
Publication Type:: Journal Article
Citation:: IEEE Trans Image Process, 2022, 31, pp. 1107-1119
Issue Date:: 2022

Closed Access

	Filename	Description	Size
	Boosting RGB-D Saliency Detection by Leveraging Unlabeled RGB Images..pdf	Published version	8.6 MB	Adobe PDF	View/Open

Copyright Clearance Process

Recently Added
In Progress
Closed Access

This item is closed access and not available.

Full metadata record

Field	Value	Language
dc.contributor.author	Wang, X
dc.contributor.author	Zhu, L
dc.contributor.author	Tang, S
dc.contributor.author	Fu, H
dc.contributor.author	Li, P
dc.contributor.author	Wu, F
dc.contributor.author	Yang, Y https://orcid.org/0000-0002-0512-880X
dc.contributor.author	Zhuang, Y
dc.date.accessioned	2023-03-20T21:52:26Z
dc.date.available	2023-03-20T21:52:26Z
dc.date.issued	2022
dc.identifier.citation	IEEE Trans Image Process, 2022, 31, pp. 1107-1119
dc.identifier.issn	1057-7149
dc.identifier.issn	1941-0042
dc.identifier.uri	http://hdl.handle.net/10453/167801
dc.description.abstract	Training deep models for RGB-D salient object detection (SOD) often requires a large number of labeled RGB-D images. However, RGB-D data is not easily acquired, which limits the development of RGB-D SOD techniques. To alleviate this issue, we present a Dual-Semi RGB-D Salient Object Detection Network (DS-Net) to leverage unlabeled RGB images for boosting RGB-D saliency detection. We first devise a depth decoupling convolutional neural network (DDCNN), which contains a depth estimation branch and a saliency detection branch. The depth estimation branch is trained with RGB-D images and then used to estimate the pseudo depth maps for all unlabeled RGB images to form the paired data. The saliency detection branch is used to fuse the RGB feature and depth feature to predict the RGB-D saliency. Then, the whole DDCNN is assigned as the backbone in a teacher-student framework for semi-supervised learning. Moreover, we also introduce a consistency loss on the intermediate attention and saliency maps for the unlabeled data, as well as a supervised depth and saliency loss for labeled data. Experimental results on seven widely-used benchmark datasets demonstrate that our DDCNN outperforms state-of-the-art methods both quantitatively and qualitatively. We also demonstrate that our semi-supervised DS-Net can further improve the performance, even when using an RGB image with the pseudo depth map.
dc.format	Print-Electronic
dc.language	eng
dc.publisher	IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
dc.relation.ispartof	IEEE Trans Image Process
dc.relation.isbasedon	10.1109/TIP.2021.3139232
dc.rights	info:eu-repo/semantics/closedAccess
dc.subject	0801 Artificial Intelligence and Image Processing, 0906 Electrical and Electronic Engineering, 1702 Cognitive Sciences
dc.subject.classification	Artificial Intelligence & Image Processing
dc.subject.mesh	Attention
dc.subject.mesh	Humans
dc.subject.mesh	Neural Networks, Computer
dc.subject.mesh	Supervised Machine Learning
dc.subject.mesh	Humans
dc.subject.mesh	Attention
dc.subject.mesh	Supervised Machine Learning
dc.subject.mesh	Neural Networks, Computer
dc.subject.mesh	Attention
dc.subject.mesh	Humans
dc.subject.mesh	Neural Networks, Computer
dc.subject.mesh	Supervised Machine Learning
dc.title	Boosting RGB-D Saliency Detection by Leveraging Unlabeled RGB Images.
dc.type	Journal Article
utslib.citation.volume	31
utslib.location.activity	United States
utslib.for	0801 Artificial Intelligence and Image Processing
utslib.for	0906 Electrical and Electronic Engineering
utslib.for	1702 Cognitive Sciences
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
utslib.copyright.status	closed_access	*
dc.date.updated	2023-03-20T21:52:24Z
pubs.publication-status	Published
pubs.volume	31

Abstract:

Training deep models for RGB-D salient object detection (SOD) often requires a large number of labeled RGB-D images. However, RGB-D data is not easily acquired, which limits the development of RGB-D SOD techniques. To alleviate this issue, we present a Dual-Semi RGB-D Salient Object Detection Network (DS-Net) to leverage unlabeled RGB images for boosting RGB-D saliency detection. We first devise a depth decoupling convolutional neural network (DDCNN), which contains a depth estimation branch and a saliency detection branch. The depth estimation branch is trained with RGB-D images and then used to estimate the pseudo depth maps for all unlabeled RGB images to form the paired data. The saliency detection branch is used to fuse the RGB feature and depth feature to predict the RGB-D saliency. Then, the whole DDCNN is assigned as the backbone in a teacher-student framework for semi-supervised learning. Moreover, we also introduce a consistency loss on the intermediate attention and saliency maps for the unlabeled data, as well as a supervised depth and saliency loss for labeled data. Experimental results on seven widely-used benchmark datasets demonstrate that our DDCNN outperforms state-of-the-art methods both quantitatively and qualitatively. We also demonstrate that our semi-supervised DS-Net can further improve the performance, even when using an RGB image with the pseudo depth map.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/167801