Distributed open-domain answer sentence selection by federated learning

Wang, Weikuan

Distributed open-domain answer sentence selection by federated learning

Wang, Weikuan

Permalink

Publication Type:: Thesis
Issue Date:: 2023

Open Access

Copyright Clearance Process

Recently Added
In Progress
Open Access

This item is open access.

Adobe PDF

Download thesisAdobe PDF (2.58 MB)

View statistics

Full metadata record

Field	Value	Language
dc.contributor.author	Wang, Weikuan
dc.date.accessioned	2025-02-06T03:30:44Z
dc.date.available	2025-02-06T03:30:44Z
dc.date.issued	2023
dc.identifier.uri	http://hdl.handle.net/10453/184995
dc.description	University of Technology Sydney. Faculty of Engineering and Information Technology.	en_US.UTF-8
dc.description.abstract	Natural Language Processing (NLP) has achieved huge success, largely attributed to the use of large pre-trained language models. Open-Domain Question Answering (OD-QA), a task of significant importance within the industry, has also experienced substantial advancements through the application of these large-scale pre-training models. A specialized subset of Open-Domain Question Answering, Open-Domain Answer Sentence Selection (OD-AS2), seeks to provide an answer to a query from a sentence within a document collection. An excellent application of this technology is the deployment of OD-AS2 models on edge devices such as computers and smartphones, thereby creating a personalized, intelligent question-answering assistant derived from a user’s personal documents. Recently, Dense Retrieval has garnered interest from both academic and industrial society as a novel approach to OD-QA/OD-AS2. The Dense Retrieval models play an indispensable role by striking a balance between efficiency and performance across various solution paradigms. However, their effectiveness largely depends on the availability of sample labeled positive QA pairs and a diverse range of hard negative samples in training. Fulfilling these requirements is challenging in a privacy preserving distributed scenario, where each client possesses fewer in-domain pairs and a relatively small collection, unsuitable for effective Dense Retrieval training. To address this issue, we introduce a new deep-learning framework for Privacy-preserving Distributed OD-AS2, dubbed as PDD-AS2. Drawing upon the principles of Federated Learning, this framework incorporates a client-customized query encoding method for personalization and a cross-client negative sampling method to enhance learning effectiveness called Fed-Negative. To assess our learning framework, we initially construct a novel OD-AS2 dataset, termed FedNewsQA, utilizing NewsQA as the base to simulate distributed clients with varying genre/domain data. Experimental results indicate that our learning framework outperforms baseline models and demonstrates impressive personalization capabilities.	en_US.UTF-8
dc.format	Thesis (MSc (Res) CompSc)
dc.language.iso	en_US	en_US.UTF-8
dc.relation	https://opus.lib.uts.edu.au/bitstream/10453/184995/1/thesis.pdf
dc.rights	info:eu-repo/semantics/openAccess
dc.rights	The author owns the copyright in this thesis including all reproduction and reuse rights for the work. The work may not be altered without the permission of the copyright owner. Attribution is essential when quoting or paraphrasing from this thesis.
dc.rights	© 2023 Weikuan Wang
dc.rights	au.edu.uts.lib/cph
dc.title	Distributed open-domain answer sentence selection by federated learning	en_US.UTF-8
dc.type	Thesis
utslib.copyright.status	open_access	*

Abstract:

Natural Language Processing (NLP) has achieved huge success, largely attributed to the use of large pre-trained language models. Open-Domain Question Answering (OD-QA), a task of significant importance within the industry, has also experienced substantial advancements through the application of these large-scale pre-training models. A specialized subset of Open-Domain Question Answering, Open-Domain Answer Sentence Selection (OD-AS2), seeks to provide an answer to a query from a sentence within a document collection. An excellent application of this technology is the deployment of OD-AS2 models on edge devices such as computers and smartphones, thereby creating a personalized, intelligent question-answering assistant derived from a user’s personal documents. Recently, Dense Retrieval has garnered interest from both academic and industrial society as a novel approach to OD-QA/OD-AS2. The Dense Retrieval models play an indispensable role by striking a balance between efficiency and performance across various solution paradigms. However, their effectiveness largely depends on the availability of sample labeled positive QA pairs and a diverse range of hard negative samples in training. Fulfilling these requirements is challenging in a privacy preserving distributed scenario, where each client possesses fewer in-domain pairs and a relatively small collection, unsuitable for effective Dense Retrieval training. To address this issue, we introduce a new deep-learning framework for Privacy-preserving Distributed OD-AS2, dubbed as PDD-AS2. Drawing upon the principles of Federated Learning, this framework incorporates a client-customized query encoding method for personalization and a cross-client negative sampling method to enhance learning effectiveness called Fed-Negative. To assess our learning framework, we initially construct a novel OD-AS2 dataset, termed FedNewsQA, utilizing NewsQA as the base to simulate distributed clients with varying genre/domain data. Experimental results indicate that our learning framework outperforms baseline models and demonstrates impressive personalization capabilities.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/184995