TY - JOUR
T1 - What do post-editors correct? :
T2 - A fine-grained analysis of SMT and NMT errors
AU - Alvarez-Vidal, Sergi
AU - Oliver, Antoni
AU - Badia, Toni
PY - 2021
Y1 - 2021
N2 - The recent improvements in neural MT (NMT) have driven a shift from statistical MT (SMT) to NMT. However, to assess the usefulness of MT models for post-editing (PE) and have a detailed insight of the output they produce, we need to analyse the most frequent errors and how they affect the task. We present a pilot study of a fine-grained analysis of MT errors based on post-editors corrections for an English to Spanish medical text translated with SMT and NMT. We use the MQM taxonomy to compare the two MT models and have a categorized classification of the errors produced. Even though results show a great variation among post-editors' corrections, for this language combination fewer errors are corrected by post-editors in the NMT output. NMT also produces fewer accuracy errors and errors that are less critical.
AB - The recent improvements in neural MT (NMT) have driven a shift from statistical MT (SMT) to NMT. However, to assess the usefulness of MT models for post-editing (PE) and have a detailed insight of the output they produce, we need to analyse the most frequent errors and how they affect the task. We present a pilot study of a fine-grained analysis of MT errors based on post-editors corrections for an English to Spanish medical text translated with SMT and NMT. We use the MQM taxonomy to compare the two MT models and have a categorized classification of the errors produced. Even though results show a great variation among post-editors' corrections, for this language combination fewer errors are corrected by post-editors in the NMT output. NMT also produces fewer accuracy errors and errors that are less critical.
KW - Machine translation
KW - MT
KW - NMT
KW - Post-editing
KW - Neural machine translation
KW - Error taxonomy
KW - Traducció automàtica
KW - TA
KW - TAN
KW - Postedició
KW - Traducció automàtica neuronal
KW - Taxonomia d'errors
KW - Traducción automática
KW - Posedición
KW - Traducción automática neuronal
KW - Taxonomía de errores
UR - https://www.scopus.com/pages/publications/85123619661
UR - https://dialnet.unirioja.es/servlet/articulo?codigo=8239318
U2 - 10.5565/rev/tradumatica.286
DO - 10.5565/rev/tradumatica.286
M3 - Article
SN - 1578-7559
SP - 131
EP - 147
JO - Revista Tradumatica
JF - Revista Tradumatica
IS - 19
ER -