Illusion of truth: analysing and classifying COVID-19 fake
news in Brazilian Portuguese language

Endo, Patricia Takako; Santos, Guto Leoni; Eduarda de Lima Xavier, Maria; Nascimento Campos, Gleyson Rhuan; Conceição de Lima, Luciana; Silva, Ivanovitch; Egli, Antonia; Lynn, Theo

Endo, Patricia Takako ORCID: 0000-0002-9163-5583, Santos, Guto Leoni ORCID: 0000-0002-0257-4214, Eduarda de Lima Xavier, Maria, Nascimento Campos, Gleyson Rhuan ORCID: 0000-0003-1245-3106, Conceição de Lima, Luciana, Silva, Ivanovitch ORCID: 0000-0002-0116-6489, Egli, Antonia ORCID: 0000-0002-0151-0884 and Lynn, Theo ORCID: 0000-0001-9284-7580 (2022) Illusion of truth: analysing and classifying COVID-19 fake news in Brazilian Portuguese language. Big Data Cognitive Computing, 6 (2). ISSN 2504-2289

Abstract
Metadata
Downloads
Documents
Metrics

[+][-]

Abstract

Public health interventions to counter the COVID-19 pandemic have accelerated and increased digital adoption and use of the Internet for sourcing health information. Unfortunately, there is evidence to suggest that it has also accelerated and increased the spread of false information relating to COVID-19. The consequences of misinformation, disinformation and misinterpretation of health information can interfere with attempts to curb the virus, delay or result in failure to seek or continue legitimate medical treatment and adherence to vaccination, as well as interfere with sound public health policy and attempts to disseminate public health messages. While there is a significant body of literature, datasets and tools to support countermeasures against the spread of false information online in resource-rich languages such as English and Chinese, there are few such resources to support Portuguese, and Brazilian Portuguese specifically. In this study, we explore the use of machine learning and deep learning techniques to identify fake news in online communications in the Brazilian Portuguese language relating to the COVID-19 pandemic. We build a dataset of 11,382 items comprising data from January 2020 to February 2021. Exploratory data analysis suggests that fake news about the COVID-19 vaccine was prevalent in Brazil, much of it related to government communications. To mitigate the adverse impact of fake news, we analyse the impact of machine learning to detect fake news based on stop words in communications. The results suggest that stop words improve the performance of the models when keeping them within the message. Random Forest was the machine learning model with the best results, achieving 97.91% of precision, while Bi-GRU was the best deep learning model with an F1 score of 94.03%.

Metadata

Item Type:	Article (Published)
Refereed:	Yes
Additional Information:	Article number: 36
Uncontrolled Keywords:	COVID-19; fake news; health misinformation; Brazilian Portuguese language; exploratory data analysis; machine learning; deep learning
Subjects:	UNSPECIFIED
DCU Faculties and Centres:	DCU Faculties and Schools > DCU Business School
Publisher:	MDPI
Official URL:	https://doi.org/10.3390/bdcc6020036
Copyright Information:	© 2022 The Authors.
Funders:	Irish Institute of Digital Business (IIDB).
ID Code:	28137
Deposited On:	09 Mar 2023 13:11 by INVALID USER. Last Modified 09 Mar 2023 13:11

Documents

Full text available as:

Preview

PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
Creative Commons: Attribution 4.0
1MB

Metrics

Downloads

Downloads per month over past year

Archive Staff Only: edit this record

DORAS | DCU Research Repository

Illusion of truth: analysing and classifying COVID-19 fake news in Brazilian Portuguese language

Altmetric Badge

Dimensions Badge

Downloads