Mandal, Abhishek, Leavy, Susan and Little, Suzanne ORCID: 0000-0003-3281-3471 (2021) Dataset diversity: measuring and mitigating geographical bias in image search and retrieval. In: 1st International Workshop on Trustworthy AI for Multimedia Computing, 24 Oct 2021, Chengdu, China. ISBN 978-1-4503-8674-6
Abstract
Many popular visual datasets used to train deep neural networks
for computer vision applications, especially for facial analytics,
are created by retrieving images from the internet. Search engines
are often used to perform this task. However, due to localisation
and personalisation of search results by the search engines along
with the image indexing method used by these search engines, the
resultant images overrepresent the demographics of the region from
where they were queried from. As most of the visual datasets are
created in western countries, they tend to have a western centric
bias and when these datasets are used to train deep neural networks,
they tend to inherit these biases. Researchers studying the issue of
bias in visual datasets have focused on the racial aspect of these
biases. We approach this from a geographical perspective. In this
paper, we 1) study how linguistic variations in search queries and
geographical variations in the querying region affect the social and
cultural aspects of retrieved images focusing on facial analytics, 2)
explore how geographical bias in image search and retrieval can
cause racial, cultural and stereotypical bias in visual datasets and
3) propose methods to mitigate such biases.
Metadata
Item Type: | Conference or Workshop Item (Paper) |
---|---|
Event Type: | Workshop |
Refereed: | Yes |
Uncontrolled Keywords: | dataset bias; computer vision fairness; visual datasets; image search and retrieval |
Subjects: | Computer Science > Artificial intelligence Computer Science > Information retrieval |
DCU Faculties and Centres: | DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing Research Institutes and Centres > INSIGHT Centre for Data Analytics |
Published in: | Trustworthy AI'21: Proceedings of the 1st International Workshop on Trustworthy AI for Multimedia Computing. . Association for Computing Machinery (ACM). ISBN 978-1-4503-8674-6 |
Publisher: | Association for Computing Machinery (ACM) |
Official URL: | https://doi.org/10.1145/3475731.3484956 |
Copyright Information: | © 2021 The Author |
Use License: | This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License |
Funders: | Science Foundation Ireland SFI/12/RC/2289_P2, cofunded by the European Regional Development Fund., <A+> Alliance / Women at the Table |
ID Code: | 26268 |
Deposited On: | 22 Oct 2021 10:51 by Abhishek Mandal . Last Modified 09 Nov 2021 16:22 |
Documents
Full text available as:
Preview |
PDF
- Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
825kB |
Metrics
Altmetric Badge
Dimensions Badge
Downloads
Downloads
Downloads per month over past year
Archive Staff Only: edit this record