MFE-Transformer: Adaptive English Text Named Entity Recognition Method Based on Multi-feature Extraction and Transformer

Liuxin Gao1

  1. School of Foreign Languages, Zhengzhou University of Science and Technology
    450064 Zhengzhou, China
    publicgj@163.com

Abstract

English text named entity recognition aims to alleviate the problem of insufficient labeling data in the target domain. Existing methods usually use feature representation or model parameter sharing to realize cross-domain transfer of entity recognition capability, but there is still a lack of full utilization of structured knowledge in text sequences. Therefore, this paper proposes an adaptive English named text entity recognition method based on multi-feature extraction and transformer. Firstly, a bidirectional long term memory conditional random field entity recognition model based on BERT pre-trained language model is constructed on a generic domain dataset. In the training process, the weights of two character vectors of text words are dynamically calculated and combined, which makes the model make full use of the information in the character granularity, and the parts-of-speech information and block analysis are added as additional features. The word vectors, character-level features and additional features are spliced into the BiLSTM-CRF neural network model for training. Finally, experiments are carried out on five English datasets and specific cross-domain named entity recognition datasets respectively. The results show that the average performance of the proposed model is improved by 0.43% and 1.47% compared with the current cross-domain model, indicating that the structured knowledge in feature representation can effectively improve the entity recognition capability of the target domain.

Key words

Adaptive English named text entity recognition, multi-feature extraction, transformer, domain invariant knowledge

Digital Object Identifier (DOI)

https://doi.org/10.2298/CSIS240418061G

Publication information

Volume 21, Issue 4 (September 2024)
Year of Publication: 2024
ISSN: 2406-1018 (Online)
Publisher: ComSIS Consortium

Full text

DownloadAvailable in PDF
Portable Document Format

How to cite

Gao, L.: MFE-Transformer: Adaptive English Text Named Entity Recognition Method Based on Multi-feature Extraction and Transformer. Computer Science and Information Systems, Vol. 21, No. 4, 1865–1885. (2024), https://doi.org/10.2298/CSIS240418061G