More than Minorities and Majorities Understanding Multilateral Bias in Language Generation

Zhao, J; Shi, Z; Li, Y; Pei, Y; Chen, L; Fang, M; Pechenizkiy, M

More than Minorities and Majorities Understanding Multilateral Bias in Language Generation

Zhao, J Shi, Z Li, Y Pei, Y Chen, L

Fang, M Pechenizkiy, M

Permalink

Publisher:: Association for Computational Linguistics (ACL)
Publication Type:: Conference Proceeding
Citation:: Proceedings of the Annual Meeting of the Association for Computational Linguistics, 2024, pp. 9987-10001
Issue Date:: 2024-01-01

Recently Added

	Filename	Description	Size
	More than Minorities and Majorities Understanding Multilateral Bias in Language Generation.pdf	Published version	306.37 kB	Adobe PDF	View/Open

Copyright Clearance Process

Recently Added
In Progress
Open Access

This item is new to OPUS and is not currently available.

Full metadata record

Field	Value	Language
dc.contributor.author	Zhao, J
dc.contributor.author	Shi, Z
dc.contributor.author	Li, Y
dc.contributor.author	Pei, Y
dc.contributor.author	Chen, L https://orcid.org/0000-0002-6468-5729
dc.contributor.author	Fang, M
dc.contributor.author	Pechenizkiy, M
dc.date.accessioned	2025-03-24T00:15:39Z
dc.date.available	2025-03-24T00:15:39Z
dc.date.issued	2024-01-01
dc.identifier.citation	Proceedings of the Annual Meeting of the Association for Computational Linguistics, 2024, pp. 9987-10001
dc.identifier.issn	0736-587X
dc.identifier.uri	http://hdl.handle.net/10453/186154
dc.description.abstract	Pretrained models learned from real corpora can often capture undesirable features, leading to bias issues against different demographic groups. Most existing studies on bias dataset construction or bias mitigation methods only focus on one demographic group pair to study a certain bias, e.g. black vs. white for racial bias. However, in real-world applications, there are more than two demographic groups that are at risk of the same bias. In this paper, we propose to analyze and reduce biases across multiple demographic groups. We collect and build a multi-demographic bias dataset including five commonly discussed bias dimensions. To mitigate multi-demographic bias, we adopt several novel debiasing methods, including regularisation-based and augmentation-based methods, as well as appropriate evaluation metrics for multi-demographic bias measurement. Experimental results on the proposed multi-demographic dataset show that a fairer model can be achieved using a multi-demographic debiasing approach. Also, the model debiased using the proposed multi-demographic debiasing methods can better transfer to unseen demographics without sacrificing the performance of the pretrained model.
dc.language	en
dc.publisher	Association for Computational Linguistics (ACL)
dc.relation	http://purl.org/au-research/grants/arc/DP240102349
dc.relation.ispartof	Proceedings of the Annual Meeting of the Association for Computational Linguistics
dc.rights	info:eu-repo/semantics/restrictedAccess
dc.title	More than Minorities and Majorities Understanding Multilateral Bias in Language Generation
dc.type	Conference Proceeding
pubs.organisational-group	University of Technology Sydney
pubs.organisational-group	University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	University of Technology Sydney/UTS Groups
pubs.organisational-group	University of Technology Sydney/UTS Groups/Australian Artificial Intelligence Institute (AAII)
utslib.copyright.status	recently_added	*
dc.date.updated	2025-03-24T00:15:37Z
pubs.publication-status	Published

Abstract:

Pretrained models learned from real corpora can often capture undesirable features, leading to bias issues against different demographic groups. Most existing studies on bias dataset construction or bias mitigation methods only focus on one demographic group pair to study a certain bias, e.g. black vs. white for racial bias. However, in real-world applications, there are more than two demographic groups that are at risk of the same bias. In this paper, we propose to analyze and reduce biases across multiple demographic groups. We collect and build a multi-demographic bias dataset including five commonly discussed bias dimensions. To mitigate multi-demographic bias, we adopt several novel debiasing methods, including regularisation-based and augmentation-based methods, as well as appropriate evaluation metrics for multi-demographic bias measurement. Experimental results on the proposed multi-demographic dataset show that a fairer model can be achieved using a multi-demographic debiasing approach. Also, the model debiased using the proposed multi-demographic debiasing methods can better transfer to unseen demographics without sacrificing the performance of the pretrained model.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/186154