The translation task is on the official NIST Chinese-to-English evaluation data.
Table 2 shows the corpora that we use for the translation task .
Table 2: Training corpora for the translation task .
We describe our experiments with training algorithms for tree-to-tree synchronous tree-substitution grammar (STSG) for monolingual translation tasks such as sentence compression and paraphrasing.
These translation tasks are characterized by the relative ability to commit to parallel parse trees and availability of word alignments, yet the unavailability of large-scale data, calling for a Bayesian tree-to-tree formalism.
Overall, we take these results as being encouraging for STSG induction via Bayesian nonparametrics for monolingual translation tasks .