USER: User-Side modality representation enhancement for multimodal recommendation

Fan, Yi; Shi, Donghui; Aguilar, Jose; Zurada, Jozef

doi:10.1016/j.knosys.2025.114943

dc.contributor.author	Fan, Yi
dc.contributor.author	Shi, Donghui
dc.contributor.author	Aguilar, Jose
dc.contributor.author	Zurada, Jozef
dc.date.accessioned	2025-12-12T09:48:54Z
dc.date.available	2025-12-12T09:48:54Z
dc.date.issued	2025-11-15
dc.identifier.issn	0950-7051	es
dc.identifier.uri	https://hdl.handle.net/20.500.12761/1999
dc.description.abstract	Multimodal recommendation systems (MMRS) aim to capture user preferences accurately by integrating users’ historical interaction behaviors with the rich multimodal features of recommended items. Prior research has primarily focused on enriching item-side representations by embedding modality features into item vectors. However, user-side modeling has remained underexplored, as existing methods typically treat each modality as a monolithic entity and fail to capture the nuanced structure of user interests within modalities, potentially limiting the model’s ability to represent intricate user preferences. To address this challenge, we propose a novel framework named USER (User-Side modality representation Enhancement for multimodal Recommendation). Specifically, our approach constructs a unified cross-modal preference representation that captures users’ co-perception behaviors across modalities. Building upon this representation, we propose a fine-grained preference mining module that extracts users’ fine-grained preferences and selectively emphasizes the most relevant preference factors for each modality at the token level, thereby refining the unified cross-modal preference representation to be more discriminative and modality-aware. Extensive experiments on three real-world datasets reveal that USER achieves notable improvements, with performance gains 3.24 %, 5.76 %, and 7.08 % across these datasets, respectively, underscoring the effectiveness of USER in enhancing user-side modality representation within multimodal recommendation systems. The source code and data are available at https://github.com/brave-child/USER	es
dc.language.iso	eng	es
dc.publisher	Elsevier	es
dc.title	USER: User-Side modality representation enhancement for multimodal recommendation	es
dc.type	journal article	es
dc.journal.title	Knowledge-Based Systems	es
dc.type.hasVersion	AO	es
dc.rights.accessRights	open access	es
dc.volume.number	331	es
dc.issue.number	3	es
dc.identifier.doi	10.1016/j.knosys.2025.114943	es
dc.description.refereed	TRUE	es
dc.description.status	pub	es

Files in this item

Name:: USER-clean.pdf
Size:: 1.349Mb
Format:: PDF

This item appears in the following Collection(s)

IMDEA Networks

Show simple item record