AIP: A Named Entity Recognition Method Combining Glyphs and Sounds

dc.citation.issue6
dc.citation.volume21
dc.contributor.authorLiu B
dc.contributor.authorSu Z
dc.contributor.authorQu G
dc.date.accessioned2026-04-08T22:07:44Z
dc.date.issued2022-11-12
dc.description.abstractIn recent years, a large number of Chinese electronic texts have been produced in the process of information construction in various fields. Identifying specific entities in these electronic texts has become a major research focus. Most existing research methods use radicals to extract the glyph features of Chinese characters but have seen its limitation. This paper extracts the features of Chinese characters from three aspects: glyph features, phonetic features, and character features, and improves conventional feature extraction methods for each kind of feature. A new named entity recognition method (AIP) is proposed by transforming Chinese characters into corresponding images for glyph feature extraction, dividing pinyin into initials, vowels, and tones for phonetic feature extraction, and fine-tuning the A Lite Bert model for character feature extraction to improve the performance of the model. This paper compares the performance of the AIP model and mainstream neural network models on Chinese named entity recognition tasks on commonly used data sets and the data sets in specific domains. The results showed that AIP achieved better results than the related work. The F1 values on the two data sets are 94.4% and 80.5%, respectively, which validates the model's versatility.
dc.description.confidentialfalse
dc.edition.editionNovember 2022
dc.identifier.citationLiu B, Su Z, Qu G. (2022). AIP: A Named Entity Recognition Method Combining Glyphs and Sounds. ACM Transactions on Asian and Low Resource Language Information Processing. 21. 6.
dc.identifier.doi10.1145/3522736
dc.identifier.eissn2375-4702
dc.identifier.elements-typejournal-article
dc.identifier.issn2375-4699
dc.identifier.number127
dc.identifier.urihttps://mro.massey.ac.nz/handle/10179/74406
dc.languageEnglish
dc.publisherAssociation for Computing Machinery
dc.publisher.urihttps://dl.acm.org/doi/10.1145/3522736
dc.relation.isPartOfACM Transactions on Asian and Low Resource Language Information Processing
dc.rights(c) The author/sen
dc.rights.licenseCC BY-NC 4.0en
dc.rights.urihttps://creativecommons.org/licenses/by-nc/4.0/deed.enen
dc.subjectNamed entity recognition
dc.subjectglyph features
dc.subjectphonetic features
dc.titleAIP: A Named Entity Recognition Method Combining Glyphs and Sounds
dc.typeJournal article
pubs.elements-id610096
pubs.organisational-groupOther

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
610096 PDF.pdf
Size:
1.92 MB
Format:
Adobe Portable Document Format
Description:
Published version.pdf

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
9.22 KB
Format:
Plain Text
Description:

Collections