Introduction | To be able to translate a Chinese abbreviation that s unseen in available parallel corpora, one may an-lotate more parallel data . |
Unsupervised Translation Induction for Chinese Abbreviations | This is particularly interesting since we normally have enormous monolingual data, but a small amount of parallel data . |
Unsupervised Translation Induction for Chinese Abbreviations | For example, in the translation task between Chinese and English, both the Chinese and English Gigaword have billions of words, but the parallel data has only about 30 million words. |