An idiom cloze algorithm incorporating contrastive learning

doi:10.19907/j.0490-6756.2022.052003

Home > Archive>Volume 59, Issue 5, 2022 >052003. DOI:10.19907/j.0490-6756.2022.052003

An idiom cloze algorithm incorporating contrastive learning
DOI:
                        10.19907/j.0490-6756.2022.052003
                    
Author:
                        ZHANG Ben-Wen1ZHANG Ben-Wen
School of Science and Engineering， Sichuan Minzu College,
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
HUANG Fang-Yi2HUANG Fang-Yi
College of Computer Science， Sichuan University
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
JU Sheng-Gen2JU Sheng-Gen
College of Computer Science， Sichuan University
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:1.School of Science and Engineering， Sichuan Minzu College,;2.College of Computer Science， Sichuan University
Clc Number:TP391
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Abstract:

Idiom cloze test is a subtask in Machine Reading Comprehension (MRC), which aim to test the model's ability to understand and apply idioms in Chinese text. The existing idiom cloze algorithms ignore the fact that the idiom embeddings suffer from representational collapse, which leads to low accuracy and poor generalization performance on out-of-domain data. In this paper, the authors propose the NeZha-CLofTN, which consists of four parts: embedding layer, fusion coding layer, graph attention subnetwork, and prediction layer. The fusion coding layer uses contrastive learning to force the network to change the feature extraction that avoids the network outputting a constant embedding vector, thus preventing the representational collapse. The prediction layer combines the output of multiple synonym subgraphs to obtain better prediction than a single subgraph and to enhance the generalization performance of the model. NeZha-ClofTN is used in the ChID-Official and ChID-Competition datasets with accuracy of 80.3% and 85.3%, and the effectiveness of each module was demonstrated by ablation experiments.

Key words:Idiom cloze test; Pre-trained language model; Contrastive learning; Synonym idiom

Get Citation

Cite this article as: ZHANG Ben-Wen, HUANG Fang-Yi, JU Sheng-Gen. An idiom cloze algorithm incorporating contrastive learning [J]. J Sichuan Univ: Nat Sci Ed, 2022, 59: 052003.

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:May 09,2022
Revised:June 06,2022
Adopted:June 10,2022
Online: September 29,2022
Published:

Home

About journal

Authors

Referees

Editors

Readers

Contact us

Get Citation

Share

Article Metrics

History