Hypotheses re-ranking in translation models using human markup

K. V. Vorontsov; Воронцов К. В.; N. A. Skachkov; Скачков Н. А.

doi:10.31857/S0002338824040074

Hypotheses re-ranking in translation models using human markup

作者: Vorontsov K.V.¹, Skachkov N.A.¹
隶属关系:
1. CC FRC CSC RAS
期: 编号 4 (2024)
页面: 121-128
栏目: ARTIFICIAL INTELLIGENCE
URL: https://rjsvd.com/0002-3388/article/view/676401
DOI: https://doi.org/10.31857/S0002338824040074
EDN: https://elibrary.ru/UEFMST
ID: 676401

如何引用文章

全文:

开放存取

##reader.subscriptionAccessGranted##
受限制的访问

订阅或者付费存取

详细
全文:
作者简介
参考
补充文件
统计

详细

Modern machine translation systems are trained on large volumes of parallel data obtained using heuristic methods of the Internet bypassing. The poor quality of the data leads to systematic translation errors, which can be quite noticeable from the human point of view. To fix such errors a human based models hypotheses re-ranking is introduced in this work. In this paper the use of human markup is shown not only to increase the overall quality of translation, but also to significantly reduce the number of systematic translation errors. In addition, the relative simplicity of human markup and its integration in the model training process opens up new opportunities in the field of domain adaptation of translation models for new domains like online retail.

关键词

Machine translation, neural network, stochastic gradient descent, contrastive learning, model fine-tuning, negative examples

全文:

作者简介

K. Vorontsov

CC FRC CSC RAS

编辑信件的主要联系方式.
Email: vokov@forecsys.ru
俄罗斯联邦, Moscow

N. Skachkov

CC FRC CSC RAS

Email: nikolaj-skachkov@ya.ru
俄罗斯联邦, Moscow

参考

Bañón M., Chen P., Haddow B. et. al. ParaCrawl: Web-Scale Acquisition of Parallel Corpora // Proc. 58th Annual Meeting of the Association for Computational Linguistics. Seattle, 2020. P. 4555–4567.
Stahlberg F. Neural Machine Translation: A Review // J. Artific. Intelligence Res. 2020. № 69. P. 343–418.
Vaswani A., Shazeer N., Parmar N. et. al. Attention is All You Need // Proc. 31st Intern. Conf. on Neural Information Processing Systems (NIPS’17). Curran Associates Inc., Red Hook. N.Y., 2017. P. 6000–6010.
Yang Z., Cheng Y., Liu Y. et. al. Reducing Word Omission Errors in Neural Machine Translation: A Contrastive Learning Approach // Proc. 57th Annual Meeting of the Association for Computational Linguistics. Florence, 2019. P. 6191–6196.
Vijayakumar A.K., Cogswell M., Selvaraju R.R. et. al. Diverse Beam Search: Decoding Diverse Solutions from Neural Sequence Models // ArXiv. 2016. abs/1610.02424.
Papineni K., Roukos S., Ward T. et. al. Bleu: a Method for Automatic Evaluation of Machine Translation // Proc. 40th Annual Meeting of the Association for Computational Linguistics. Philadelphia, 2002. P. 311–318.
Barrault L., Bojar O.R., Costa-jussà M. et al. Findings of the Conf. on Machine Translation (WMT19) // Proc. Fourth Conf. on Machine Translation. Florence, 2019. V. 2: Shared Task Papers.
Kingma D.P., Ba J. Adam: A Method for Stochastic Optimization // 3rd Intern. Conf. on Learning Representations (ICLR). San Diego, CA, 2015.

补充文件

附件文件

动作

1. JATS XML

下载

2. Fig. 1. Example of a task for human markers

下载 (138KB)

索引源数据

用户名
密码
记住我

忘记您的密码?	注册

用户名
密码
记住我

忘记您的密码?	注册

编号 5 (2025)

编号 5 (2025)

Hypotheses re-ranking in translation models using human markup

全文:

详细

关键词

全文:

作者简介

K. Vorontsov

N. Skachkov

参考

补充文件