This paper describes the system proposed by the Random team for SemEval-2020 Task 1: Unsupervised Lexical Semantic Change Detection. We focus our approach on the detection problem. Given the semantics of words captured by temporal word embeddings in different time periods, we investigate the use of unsupervised methods to detect when the target word has gained or lost senses. To this end, we define a new algorithm based on Gaussian Mixture Models to cluster the target similarities computed over the two periods. We compare the proposed approach with a number of similarity-based thresholds. We found that, although the performance of the detection methods varies across the word embedding algorithms, the combination of Gaussian Mixture with Temporal Referencing resulted in our best system.
Science Foundation Ireland SFI 13/RC/2106, Science Foundation Ireland SFI Research Centres Programme and is co-funded under the European Regional Development Fund (ERDF) Grant # 13/RC/2106.
ID Code:
25945
Deposited On:
02 Jun 2021 10:38 by
Annalina Caputo
. Last Modified 02 Jun 2021 10:45