A label noise filtering and label missing supplement framework based on game theory

Loading...
Thumbnail Image

Date

2023-08-31

DOI

Open Access Location

Journal Title

Journal ISSN

Volume Title

Publisher

Elsevier B.V. on behalf of KeAi Communications Co Ltd for the Chongqing University of Posts and Telecommunications

Rights

CC BY 4.0

Abstract

Labeled data is widely used in various classification tasks. However, there is a huge challenge that labels are often added artificially. Wrong labels added by malicious users will affect the training effect of the model. The unreliability of labeled data has hindered the research. In order to solve the above problems, we propose a framework of Label Noise Filtering and Missing Label Supplement (LNFS). And we take location labels in Location-Based Social Networks (LBSN) as an example to implement our framework. For the problem of label noise filtering, we first use FastText to transform the restaurant's labels into vectors, and then based on the assumption that the label most similar to all other labels in the location is most representative. We use cosine similarity to judge and select the most representative label. For the problem of label missing, we use simple common word similarity to judge the similarity of users' comments, and then use the label of the similar restaurant to supplement the missing labels. To optimize the performance of the model, we introduce game theory into our model to simulate the game between the malicious users and the model to improve the reliability of the model. Finally, a case study is given to illustrate the effectiveness and reliability of LNFS.

Description

Keywords

Label noise, FastText, Cosine similarity, Game theory, LSTM

Citation

Liu Y, Yao R, Jia S, Wang F, Wang R, Ma R, Qi L. (2023). A label noise filtering and label missing supplement framework based on game theory. Digital Communications and Networks. 9. 4. (pp. 887-895).

Collections

Endorsement

Review

Supplemented By

Referenced By

Creative Commons license

Except where otherwised noted, this item's license is described as CC BY 4.0