Advancements in English-Persian hierarchical statistical machine translation
Mohaghegh, Mahsa
Date
2012Citation:
Mohaghegh, M. (2012). Advancements in English-Persian Hierarchical Statistical Machine Translation. NZCSRSC New Zealand Computer Science Research Student Conference. Dunedin, New Zealand. 13 April.Permanent link to Research Bank record:
https://hdl.handle.net/10652/2226Abstract
In this paper we show that a hierarchical phrase-based translation system will outperform a classical (non-hierarchical) phrase-based system in the English-to-Persian translation direction, yet for the Persian-to-English direction, the classical phrase-based system is preferable. We seek to explain why this is so, and detail a series of translation experiments with our SMT system using various bilingual corpora each with both toolkits Moses (non-hierarchical) and Joshua (hierarchical).