Learning deep kernels for non-parametric two-sample tests

Liu, F; Xu, W; Lu, J; Zhang, G; Gretton, A; Sutherland, DJ

Learning deep kernels for non-parametric two-sample tests

Liu, F

Xu, W Lu, J

Zhang, G

Gretton, A Sutherland, DJ

Permalink

Publisher:: MLR
Publication Type:: Conference Proceeding
Citation:: 37th International Conference on Machine Learning, ICML 2020, 2020, 119 PartF168147-9, pp. 6272-6282
Issue Date:: 2020-01-01

Open Access

Copyright Clearance Process

Recently Added
In Progress
Open Access

This item is open access.

Adobe PDF

Download Published versionAdobe PDF (6.33 MB)

View statistics

Full metadata record

Field	Value	Language
dc.contributor.author	Liu, F https://orcid.org/0000-0002-5005-9129
dc.contributor.author	Xu, W
dc.contributor.author	Lu, J https://orcid.org/0000-0003-0690-4732
dc.contributor.author	Zhang, G https://orcid.org/0000-0003-3960-0583
dc.contributor.author	Gretton, A
dc.contributor.author	Sutherland, DJ
dc.date	2020-07-13
dc.date.accessioned	2021-06-06T00:47:52Z
dc.date.available	2021-06-06T00:47:52Z
dc.date.issued	2020-01-01
dc.identifier.citation	37th International Conference on Machine Learning, ICML 2020, 2020, 119 PartF168147-9, pp. 6272-6282
dc.identifier.isbn	9781713821120
dc.identifier.uri	http://hdl.handle.net/10453/149428
dc.description.abstract	We propose a class of kernel-based two-sample tests, which aim to determine whether two sets of samples are drawn from the same distribution. Our tests are constructed from kernels parameterized by deep neural nets, trained to maximize test power. These tests adapt to variations in distribution smoothness and shape over space, and are especially suited to high dimensions and complex data. By contrast, the simpler kernels used in prior kernel testing work are spatially homogeneous, and adaptive only in lengthscale. We explain how this scheme includes popular classifier-based two-sample tests as a special case, but improves on them in general. We provide the first proof of consistency for the proposed adaptation method, which applies both to kernels on deep features and to simpler radial basis kernels or multiple kernel learning. In experiments, we establish the superior performance of our deep kernels in hypothesis testing on benchmark and real-world data.
dc.language	en
dc.publisher	MLR
dc.relation.ispartof	37th International Conference on Machine Learning, ICML 2020
dc.relation.ispartof	International Conference on Machine Learning
dc.relation.ispartofseries	Proceedings of Machine Learning Research
dc.rights	info:eu-repo/semantics/openAccess
dc.title	Learning deep kernels for non-parametric two-sample tests
dc.type	Conference Proceeding
utslib.citation.volume	119 PartF168147-9
utslib.location.activity	Virtual
utslib.for	0801 Artificial Intelligence and Image Processing
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	/University of Technology Sydney/Strength - AAII - Australian Artificial Intelligence Institute
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology/School of Computer Science
utslib.copyright.status	open_access	*
pubs.consider-herdc	false
dc.date.updated	2021-06-06T00:47:48Z
pubs.finish-date	2020-07-18
pubs.place-of-publication	USA
pubs.publication-status	Published
pubs.start-date	2020-07-13
pubs.volume	119 PartF168147-9
dc.location	USA

Abstract:

We propose a class of kernel-based two-sample tests, which aim to determine whether two sets of samples are drawn from the same distribution. Our tests are constructed from kernels parameterized by deep neural nets, trained to maximize test power. These tests adapt to variations in distribution smoothness and shape over space, and are especially suited to high dimensions and complex data. By contrast, the simpler kernels used in prior kernel testing work are spatially homogeneous, and adaptive only in lengthscale. We explain how this scheme includes popular classifier-based two-sample tests as a special case, but improves on them in general. We provide the first proof of consistency for the proposed adaptation method, which applies both to kernels on deep features and to simpler radial basis kernels or multiple kernel learning. In experiments, we establish the superior performance of our deep kernels in hypothesis testing on benchmark and real-world data.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/149428