T3SRS: Tensor Train Transformer for compressing sequential recommender systems

Li, H; Zhao, J; Huo, H; Fang, S; Chen, J; Yao, L; Hua, Y

T3SRS: Tensor Train Transformer for compressing sequential recommender systems

Li, H Zhao, J Huo, H

Fang, S Chen, J Yao, L Hua, Y

Permalink

Publisher:: Elsevier
Publication Type:: Journal Article
Citation:: Expert Systems with Applications, 2024, 238, pp. 122260
Issue Date:: 2024-03-15

Closed Access

	Filename	Description	Size
	Binder1.pdf	Accepted version	1.34 MB	Adobe PDF	View/Open

Copyright Clearance Process

Recently Added
In Progress
Closed Access

This item is closed access and not available.

Full metadata record

Field	Value	Language
dc.contributor.author	Li, H
dc.contributor.author	Zhao, J
dc.contributor.author	Huo, H https://orcid.org/0000-0003-2440-714X
dc.contributor.author	Fang, S
dc.contributor.author	Chen, J
dc.contributor.author	Yao, L
dc.contributor.author	Hua, Y
dc.date.accessioned	2024-01-20T05:56:54Z
dc.date.available	2024-01-20T05:56:54Z
dc.date.issued	2024-03-15
dc.identifier.citation	Expert Systems with Applications, 2024, 238, pp. 122260
dc.identifier.issn	0957-4174
dc.identifier.uri	http://hdl.handle.net/10453/174812
dc.description.abstract	In recent years, attention mechanisms have gained popularity in sequential recommender systems (SRSs) due to obtaining dynamic user preferences efficiently. However, over-parameterization of these models often increases the risk of overfitting. To address this challenge, we propose a Transformer model based on tensor train networks. Initially, we propose a tensor train layer (TTL) to accommodate the original weight matrix, thus reducing the space complexity of the mapping layer. Based on the TTL, we reconfigure the multi-head attention module and the position-wise feed-forward network. Finally, a tensor train layer replaces the output layer to complete the overall compression. According to the experimental results, the proposed model compresses SRSs parameters effectively, achieving compression rates of 76.2%−85.0%, while maintaining or enhancing sequence recommendation performance. To our knowledge, the Tensor Train Transformer is the first model compression approach for Transformer-based SRSs, and the model is broadly applicable.
dc.language	en
dc.publisher	Elsevier
dc.relation.ispartof	Expert Systems with Applications
dc.relation.isbasedon	10.1016/j.eswa.2023.122260
dc.rights	info:eu-repo/semantics/closedAccess
dc.subject	01 Mathematical Sciences, 08 Information and Computing Sciences, 09 Engineering
dc.subject.classification	Artificial Intelligence & Image Processing
dc.title	T3SRS: Tensor Train Transformer for compressing sequential recommender systems
dc.type	Journal Article
utslib.citation.volume	238
utslib.for	01 Mathematical Sciences
utslib.for	08 Information and Computing Sciences
utslib.for	09 Engineering
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology/School of Computer Science
pubs.organisational-group	/University of Technology Sydney/Strength - CCSP - Centre for Cyber Security and Privacy
utslib.copyright.status	closed_access	*
dc.date.updated	2024-01-20T05:56:53Z
pubs.publication-status	Accepted
pubs.volume	238

Abstract:

In recent years, attention mechanisms have gained popularity in sequential recommender systems (SRSs) due to obtaining dynamic user preferences efficiently. However, over-parameterization of these models often increases the risk of overfitting. To address this challenge, we propose a Transformer model based on tensor train networks. Initially, we propose a tensor train layer (TTL) to accommodate the original weight matrix, thus reducing the space complexity of the mapping layer. Based on the TTL, we reconfigure the multi-head attention module and the position-wise feed-forward network. Finally, a tensor train layer replaces the output layer to complete the overall compression. According to the experimental results, the proposed model compresses SRSs parameters effectively, achieving compression rates of 76.2%−85.0%, while maintaining or enhancing sequence recommendation performance. To our knowledge, the Tensor Train Transformer is the first model compression approach for Transformer-based SRSs, and the model is broadly applicable.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/174812