Dataset diversity: measuring and mitigating geographical bias
in image search and retrieval

Mandal, Abhishek; Leavy, Susan; Little, Suzanne

Mandal, Abhishek, Leavy, Susan and Little, Suzanne ORCID: 0000-0003-3281-3471 (2021) Dataset diversity: measuring and mitigating geographical bias in image search and retrieval. In: 1st International Workshop on Trustworthy AI for Multimedia Computing, 24 Oct 2021, Chengdu, China. ISBN 978-1-4503-8674-6

Abstract
Metadata
Downloads
Documents

[+][-]

Abstract

Many popular visual datasets used to train deep neural networks for computer vision applications, especially for facial analytics, are created by retrieving images from the internet. Search engines are often used to perform this task. However, due to localisation and personalisation of search results by the search engines along with the image indexing method used by these search engines, the resultant images overrepresent the demographics of the region from where they were queried from. As most of the visual datasets are created in western countries, they tend to have a western centric bias and when these datasets are used to train deep neural networks, they tend to inherit these biases. Researchers studying the issue of bias in visual datasets have focused on the racial aspect of these biases. We approach this from a geographical perspective. In this paper, we 1) study how linguistic variations in search queries and geographical variations in the querying region affect the social and cultural aspects of retrieved images focusing on facial analytics, 2) explore how geographical bias in image search and retrieval can cause racial, cultural and stereotypical bias in visual datasets and 3) propose methods to mitigate such biases.

Metadata

Item Type:	Conference or Workshop Item (Paper)
Event Type:	Workshop
Refereed:	Yes
Uncontrolled Keywords:	dataset bias; computer vision fairness; visual datasets; image search and retrieval
Subjects:	Computer Science > Artificial intelligence Computer Science > Information retrieval
DCU Faculties and Centres:	DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing Research Institutes and Centres > INSIGHT Centre for Data Analytics
Published in:	Trustworthy AI'21: Proceedings of the 1st International Workshop on Trustworthy AI for Multimedia Computing. . Association for Computing Machinery (ACM). ISBN 978-1-4503-8674-6
Publisher:	Association for Computing Machinery (ACM)
Official URL:	https://doi.org/10.1145/3475731.3484956
Copyright Information:	© 2021 The Author
Use License:	This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License
Funders:	Science Foundation Ireland SFI/12/RC/2289_P2, cofunded by the European Regional Development Fund., <A+> Alliance / Women at the Table
ID Code:	26268
Deposited On:	22 Oct 2021 10:51 by Abhishek Mandal . Last Modified 09 Nov 2021 16:22

Documents

Full text available as:

[thumbnail of Geographical_Bias_in_Image_Search_and_Retrieval__SL_Copy_ (1).pdf]

Preview

PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
825kB

Downloads

Downloads per month over past year

Archive Staff Only: edit this record

DORAS | DCU Research Repository

Dataset diversity: measuring and mitigating geographical bias in image search and retrieval

Downloads