USST: A two-phase privacy-preserving framework for personalized recommendation with semi-distributed training

Zhou, Y; Liu, J; Wang, JH; Wang, J; Liu, G; Wu, D; Li, C; Yu, S

USST: A two-phase privacy-preserving framework for personalized recommendation with semi-distributed training

Zhou, Y Liu, J Wang, JH Wang, J Liu, G Wu, D Li, C Yu, S

Permalink

Publisher:: Elsevier
Publication Type:: Journal Article
Citation:: Information Sciences, 2022, 606, pp. 688-701
Issue Date:: 2022-08-01

Closed Access

	Filename	Description	Size
	1-s2.0-S0020025522005217-main.pdf	Published version	1.19 MB	Adobe PDF	View/Open

Copyright Clearance Process

Recently Added
In Progress
Closed Access

This item is closed access and not available.

Full metadata record

Field	Value	Language
dc.contributor.author	Zhou, Y
dc.contributor.author	Liu, J
dc.contributor.author	Wang, JH
dc.contributor.author	Wang, J
dc.contributor.author	Liu, G
dc.contributor.author	Wu, D
dc.contributor.author	Li, C
dc.contributor.author	Yu, S https://orcid.org/0000-0003-4485-6743
dc.date.accessioned	2023-02-13T01:30:08Z
dc.date.available	2023-02-13T01:30:08Z
dc.date.issued	2022-08-01
dc.identifier.citation	Information Sciences, 2022, 606, pp. 688-701
dc.identifier.issn	0020-0255
dc.identifier.issn	1872-6291
dc.identifier.uri	http://hdl.handle.net/10453/166062
dc.description.abstract	Personalized recommendations are becoming indispensable for assisting online users in discovering items of interest. However, existing recommendation algorithms rely heavily on the collection of personal information, which poses significant privacy concerns to users. In this paper, we propose a two-phase privacy-preserving framework called user sampling and semi-distributed training (USST) for personalized recommendations, which can protect user privacy while ensuring high recommendation accuracy. In the USST framework, rather than directly training the model with all user records, a shared model is first trained with a small set of records contributed by sampled users (e.g., paid users and volunteers). This shared model is then distributed to each user, who further trains a personalized model using personal information. Thus, the USST guarantees that all unsampled users never disclose their private information. To validate the effectiveness and practicality of USST, we designed two USST-based privacy-preserving recommendation algorithms, USST-SVD and USST-NCF based on SVD and NCF algorithms, respectively. We conducted evaluations using MovieLens and Netflix Prize datasets, and the results show that, using only 20% of sampled users’ records, the recommendation accuracy of USST-based algorithms is very close to that of all users’ records. Thus, USST can significantly improve the level of privacy protection in recommender systems.
dc.language	English
dc.publisher	Elsevier
dc.relation.ispartof	Information Sciences
dc.relation.isbasedon	10.1016/j.ins.2022.05.083
dc.rights	info:eu-repo/semantics/closedAccess
dc.subject	01 Mathematical Sciences, 08 Information and Computing Sciences, 09 Engineering
dc.subject.classification	Artificial Intelligence & Image Processing
dc.title	USST: A two-phase privacy-preserving framework for personalized recommendation with semi-distributed training
dc.type	Journal Article
utslib.citation.volume	606
utslib.for	01 Mathematical Sciences
utslib.for	08 Information and Computing Sciences
utslib.for	09 Engineering
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology/School of Computer Science
utslib.copyright.status	closed_access	*
pubs.consider-herdc	false
dc.date.updated	2023-02-13T01:29:56Z
pubs.publication-status	Published
pubs.volume	606

Abstract:

Personalized recommendations are becoming indispensable for assisting online users in discovering items of interest. However, existing recommendation algorithms rely heavily on the collection of personal information, which poses significant privacy concerns to users. In this paper, we propose a two-phase privacy-preserving framework called user sampling and semi-distributed training (USST) for personalized recommendations, which can protect user privacy while ensuring high recommendation accuracy. In the USST framework, rather than directly training the model with all user records, a shared model is first trained with a small set of records contributed by sampled users (e.g., paid users and volunteers). This shared model is then distributed to each user, who further trains a personalized model using personal information. Thus, the USST guarantees that all unsampled users never disclose their private information. To validate the effectiveness and practicality of USST, we designed two USST-based privacy-preserving recommendation algorithms, USST-SVD and USST-NCF based on SVD and NCF algorithms, respectively. We conducted evaluations using MovieLens and Netflix Prize datasets, and the results show that, using only 20% of sampled users’ records, the recommendation accuracy of USST-based algorithms is very close to that of all users’ records. Thus, USST can significantly improve the level of privacy protection in recommender systems.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/166062