Efficient Subhypergraph Matching Based on Hyperedge Features

Su, Y; Gu, Y; Wang, Z; Zhang, Y; Qin, J; Yu, G

Efficient Subhypergraph Matching Based on Hyperedge Features

Su, Y Gu, Y Wang, Z Zhang, Y

Qin, J Yu, G

Permalink

Publisher:: IEEE COMPUTER SOC
Publication Type:: Journal Article
Citation:: IEEE Transactions on Knowledge and Data Engineering, 2023, 35, (6), pp. 5808-5822
Issue Date:: 2023-06-01

Closed Access

	Filename	Description	Size
	Efficient_Subhypergraph_Matching_Based_on_Hyperedge_Features_published.pdf	Published version	1.74 MB	Adobe PDF	View/Open

Copyright Clearance Process

Recently Added
In Progress
Closed Access

This item is closed access and not available.

Full metadata record

Field	Value	Language
dc.contributor.author	Su, Y
dc.contributor.author	Gu, Y
dc.contributor.author	Wang, Z
dc.contributor.author	Zhang, Y https://orcid.org/0000-0002-2674-1638
dc.contributor.author	Qin, J
dc.contributor.author	Yu, G
dc.date.accessioned	2024-03-04T00:15:23Z
dc.date.available	2024-03-04T00:15:23Z
dc.date.issued	2023-06-01
dc.identifier.citation	IEEE Transactions on Knowledge and Data Engineering, 2023, 35, (6), pp. 5808-5822
dc.identifier.issn	1041-4347
dc.identifier.issn	1558-2191
dc.identifier.uri	http://hdl.handle.net/10453/176034
dc.description.abstract	Hypergraphs consist of vertices and hyperedges that can connect multiple vertices. Since hypergraphs can effectively simulate complex intergroup relationships between entities, they have a wide range of applications such as computer vision and bioinformatics. In this paper, we study the subhypergraph matching problem, which is one of the most challenging problems in the processing of the hypergraphs. We aim to extract all subhypergraph isomorphism embeddings of a query hypergraph $q$q in a large data hypergraph $D$D. The existing methods on subgraph matching are designed for the ordinary graphs, which typically achieve the goal by three phases, i.e., filtering candidate vertices, refining candidate sets, and then enumeration final results in some matching order. However, such a design cannot be trivially extended to efficiently handle hypergraphs due to the inherent difference between ordinary graphs and hypergraphs. This motivates us to enhance the performance by exploiting hyperedge features, such as the typical intersections and inclusion relations between hyperedges. In our work, we present an efficient subhypergraph matching solution with two novel techniques, maximum hyperedge candidate filtering and co-occurrence matrix candidate refinement strategy. Maximum hyperedge candidate filtering is a filtering method based on hyperedge features, which can provide powerful pruning capability. Co-occurrence matrix candidate refinement strategy considers the high-order relationship between vertices in the hypergraph and provides an effective candidate refinement scheme to further reduce the overall search space. In order to find more effective matching order, we design a new enumeration strategy, which calculates the pseudo-isomorphic mapping set and then performs hyperedge verification. On real and synthetic data sets, we conduct extensive experiments to show our method outperforms existing methods by up to 2 orders of magnitude.
dc.language	English
dc.publisher	IEEE COMPUTER SOC
dc.relation.ispartof	IEEE Transactions on Knowledge and Data Engineering
dc.relation.isbasedon	10.1109/TKDE.2022.3160393
dc.rights	info:eu-repo/semantics/closedAccess
dc.subject	08 Information and Computing Sciences
dc.subject.classification	Information Systems
dc.subject.classification	46 Information and computing sciences
dc.title	Efficient Subhypergraph Matching Based on Hyperedge Features
dc.type	Journal Article
utslib.citation.volume	35
utslib.for	08 Information and Computing Sciences
pubs.organisational-group	University of Technology Sydney
pubs.organisational-group	University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	University of Technology Sydney/Strength - AAII - Australian Artificial Intelligence Institute
utslib.copyright.status	closed_access	*
dc.date.updated	2024-03-04T00:15:22Z
pubs.issue	6
pubs.publication-status	Published
pubs.volume	35
utslib.citation.issue	6

Abstract:

Hypergraphs consist of vertices and hyperedges that can connect multiple vertices. Since hypergraphs can effectively simulate complex intergroup relationships between entities, they have a wide range of applications such as computer vision and bioinformatics. In this paper, we study the subhypergraph matching problem, which is one of the most challenging problems in the processing of the hypergraphs. We aim to extract all subhypergraph isomorphism embeddings of a query hypergraph $q$q in a large data hypergraph $D$D. The existing methods on subgraph matching are designed for the ordinary graphs, which typically achieve the goal by three phases, i.e., filtering candidate vertices, refining candidate sets, and then enumeration final results in some matching order. However, such a design cannot be trivially extended to efficiently handle hypergraphs due to the inherent difference between ordinary graphs and hypergraphs. This motivates us to enhance the performance by exploiting hyperedge features, such as the typical intersections and inclusion relations between hyperedges. In our work, we present an efficient subhypergraph matching solution with two novel techniques, maximum hyperedge candidate filtering and co-occurrence matrix candidate refinement strategy. Maximum hyperedge candidate filtering is a filtering method based on hyperedge features, which can provide powerful pruning capability. Co-occurrence matrix candidate refinement strategy considers the high-order relationship between vertices in the hypergraph and provides an effective candidate refinement scheme to further reduce the overall search space. In order to find more effective matching order, we design a new enumeration strategy, which calculates the pseudo-isomorphic mapping set and then performs hyperedge verification. On real and synthetic data sets, we conduct extensive experiments to show our method outperforms existing methods by up to 2 orders of magnitude.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/176034