Index of papers in Proc. ACL that mention

statistical machine translation

Seen in text as:

statistical machine translation (84)
Statistical Machine Translation (16)
Statistical machine translation (9)

Seen in 108 sentences in 27 papers.

1. Statistical Machine Translation Improves Question Retrieval in Community Question Answering via Matrix Factorization

Zhou, Guangyou and Liu, Fang and Liu, Yang and He, Shizhu and Zhao, Jun

In Proc. ACL 2013, part of Proceedings of the Annual Meeting of the Association for Computational Linguistics.

Abstract	Our proposed method employs statistical machine translation to improve question retrieval and enriches the question representation with the translated words from other languages via matrix factorization.
Experiments	Row 8 and row 9 are our proposed method, which leverages statistical machine translation to improve question retrieval via matrix factorization.
Introduction	The idea of improving question retrieval with statistical machine translation is based on the following two observa-
Introduction	However, there are two problems with this enrichment: (1) enriching the original questions with the translated words from other languages increases the dimensionality and makes the question representation even more sparse; (2) statistical machine translation may introduce noise, which can harm the performance of question retrieval.
Introduction	To solve these two problems, we propose to leverage statistical machine translation to improve question retrieval via matrix factorization.
Our Approach	This paper aims to leverage statistical machine translation to enrich the question representation.
Our Approach	Statistical machine translation (e.g., Google Translate) can utilize contextual information during the question translation, so it can solve the word ambiguity and word mismatch problems to some extent.
Our Approach	However, there are two problems with this enrichment: (1) enriching the original questions with the translated words from other languages makes the question representation even more sparse; (2) statistical machine translation may introduce noise.5 To solve these two problems, we propose to leverage statistical machine translation to improve question retrieval via matrix factorization.

statistical machine translation is mentioned in 16 sentences in this paper.

Topics mentioned in this paper:

2. Syntax-to-Morphology Mapping in Factored Phrase-Based Statistical Machine Translation from English to Turkish

Yeniterzi, Reyyan and Oflazer, Kemal

In Proc. ACL 2010, part of Proceedings of the Annual Meeting of the Association for Computational Linguistics.

Introduction	Statistical machine translation into a morphologically complex language such as Turkish, Finnish or Arabic, involves the generation of target words with the proper morphology, in addition to properly ordering the target words.
Introduction	We assume that the reader is familiar with the basics of phrase-based statistical machine translation (Koehn et al., 2003) and factored statistical machine translation (Koehn and Hoang, 2007).
Related Work	Statistical Machine Translation into a morphologically rich language is a challenging problem in that, on the target side, the decoder needs to generate both the right sequence of constituents and the right sequence of morphemes for each word.
Related Work	Using morphology in statistical machine translation has been addressed by many researchers for translation from or into morphologically rich(er) languages.
Related Work	Goldwater and McClosky (2005) use morphological analysis on the Czech side to get improvements in Czech-to-English statistical machine translation .

statistical machine translation is mentioned in 6 sentences in this paper.

Topics mentioned in this paper:

3. Cross Language Dependency Parsing using a Bilingual Lexicon

Zhao, Hai and Song, Yan and Kit, Chunyu and Zhou, Guodong

In Proc. ACL 2009, part of Proceedings of the Annual Meeting of the Association for Computational Linguistics.

Abstract	A simple statistical machine translation method, word-by-word decoding, where not a parallel corpus but a bilingual lexicon is necessary, is adopted for the treebank translation.
Conclusion and Future Work	A simple statistical machine translation technique, word-by-word decoding, where only a bilingual lexicon is necessary, is used to translate the source treebank.
Introduction	In addition, a standard statistical machine translation method based on a parallel corpus will not work effectively if it is not able to find a parallel corpus that right covers source and target treebanks.
The Related Work	The second is that a parallel corpus is required for their work and a strict statistical machine translation procedure was performed, while our approach holds a merit of simplicity as only a bilingual lexicon is required.
Treebank Translation and Dependency Transformation	A word-by-word statistical machine translation strategy is adopted to translate words attached with the respective dependency information from the source language to the target one.

statistical machine translation is mentioned in 5 sentences in this paper.

Topics mentioned in this paper:

4. Bilingual Sense Similarity for Statistical Machine Translation

Chen, Boxing and Foster, George and Kuhn, Roland

In Proc. ACL 2010, part of Proceedings of the Annual Meeting of the Association for Computational Linguistics.

Abstract	We then apply the algorithms to statistical machine translation by computing the sense similarity between the source and target side of translation rule pairs.
Conclusions and Future Work	similarity for terms from parallel corpora and applied it to statistical machine translation .
Conclusions and Future Work	We have shown that the sense similarity computed between units from parallel corpora by means of our algorithm is helpful for at least one multilingual application: statistical machine translation .
Introduction	Is it useful for multilingual applications, such as statistical machine translation (SMT)?
Introduction	Second, we use the sense similarities between the source and target sides of a translation rule to improve statistical machine translation performance.

statistical machine translation is mentioned in 5 sentences in this paper.

Topics mentioned in this paper:

5. Advancements in Reordering Models for Statistical Machine Translation

Feng, Minwei and Peter, Jan-Thorsten and Ney, Hermann

In Proc. ACL 2013, part of Proceedings of the Annual Meeting of the Association for Computational Linguistics.

Introduction	The systematic word order difference between two languages poses a challenge for current statistical machine translation (SMT) systems.
Introduction	:15 for Statistical Machine Translation
Introduction	The remainder of this paper is organized as follows: Section 2 introduces the basement of this research: the principle of statistical machine translation .
Translation System Overview	In statistical machine translation , we are given a source language sentence fi] = fl... fj .

statistical machine translation is mentioned in 4 sentences in this paper.

Topics mentioned in this paper:

CRFs (19)
LM (14)
BLEU (8)

6. Distributed Word Clustering for Large Scale Class-Based Language Modeling in Machine Translation

Uszkoreit, Jakob and Brants, Thorsten

In Proc. ACL 2008, part of Proceedings of the Annual Meeting of the Association for Computational Linguistics.

Abstract	We show that combining them with word—based n—gram models in the log—linear model of a state—of—the—art statistical machine translation system leads to improvements in translation quality as indicated by the BLEU score.
Experiments	In the subsequent experiments, we use a phrase-based statistical machine translation system based on the log-linear formulation of the problem described in (Och and Ney, 2002):
Introduction	However, in the area of statistical machine translation , especially in the context of large training corpora, fewer experiments with class-based n-gram models have been performed with mixed success (Raab, 2006).
Introduction	We then show that using partially class-based language models trained using the resulting classifications together with word-based language models in a state-of-the-art statistical machine translation system yields improvements despite the very large size of the word-based models used.

statistical machine translation is mentioned in 4 sentences in this paper.

Topics mentioned in this paper:

7. Effective Selection of Translation Model Training Data

Liu, Le and Hong, Yu and Liu, Hao and Wang, Xing and Yao, Jianmin

In Proc. ACL 2014, part of Proceedings of the Annual Meeting of the Association for Computational Linguistics.

Abstract	Data selection has been demonstrated to be an effective approach to addressing the lack of high-quality bitext for statistical machine translation in the domain of interest.
Conclusion	our methods into domain adaptation task of statistical machine translation in model level.
Introduction	Statistical machine translation depends heavily on large scale parallel corpora.
Training Data Selection Methods	Translation model is a key component in statistical machine translation .

statistical machine translation is mentioned in 4 sentences in this paper.

Topics mentioned in this paper:

8. Improving Statistical Machine Translation with Monolingual Collocation

Liu, Zhanyi and Wang, Haifeng and Wu, Hua and Li, Sheng

In Proc. ACL 2010, part of Proceedings of the Annual Meeting of the Association for Computational Linguistics.

Abstract	This paper proposes to use monolingual collocations to improve Statistical Machine Translation (SMT).
Conclusion	Statistical Machine Translation .
Conclusion	The Mathematics of Statistical Machine Translation : Pa-
Conclusion	Moses: Open Source Toolkit for Statistical Machine Translation .

statistical machine translation is mentioned in 4 sentences in this paper.

Topics mentioned in this paper:

9. Learning a Phrase-based Translation Model from Monolingual Data with Application to Domain Adaptation

Zhang, Jiajun and Zong, Chengqing

In Proc. ACL 2013, part of Proceedings of the Annual Meeting of the Association for Computational Linguistics.

Abstract	Currently, almost all of the statistical machine translation (SMT) models are trained with the parallel corpora in some specific domains.
Introduction	During the last decade, statistical machine translation has made great progress.
Introduction	Recently, more and more researchers concentrated on taking full advantage of the monolingual corpora in both source and target languages, and proposed methods for bilingual lexicon induction from nonparallel data (Rapp, 1995, 1999; Koehn and Knight, 2002; Haghighi et al., 2008; Daume III and J agarlamudi, 2011) and proposed unsupervised statistical machine translation (bilingual lexicon is a byproduct) with only monolingual corpora (Ravi and Knight, 2011; Nuhn et al., 2012; Dou and Knight, 2012).
Introduction	The unsupervised statistical machine translation method (Ravi and Knight, 2011; Nuhn et al., 2012; Dou and Knight, 2012) viewed the translation task as a decipherment problem and designed a generative model with the objective function to maximize the likelihood of the source language monolingual data.

statistical machine translation is mentioned in 4 sentences in this paper.

Topics mentioned in this paper:

10. Incremental Syntactic Language Models for Phrase-based Translation

Schwartz, Lane and Callison-Burch, Chris and Schuler, William and Wu, Stephen

In Proc. ACL 2011, part of Proceedings of the Annual Meeting of the Association for Computational Linguistics.

Introduction	Early work in statistical machine translation Viewed translation as a noisy channel process comprised of a translation model, which functioned to posit adequate translations of source language words, and a target language model, which guided the fluency of generated target language strings (Brown et al.,
Introduction	Drawing on earlier successes in speech recognition, research in statistical machine translation has effectively used n-gram word sequence models as language models.
Related Work	speech recognition and statistical machine translation focus on the use of n-grams, which provide a simple finite-state model approximation of the target language.
Related Work	priate algorithmic fit for incorporating syntax into phrase-based statistical machine translation , since both process sentences in an incremental left-to-right fashion.

statistical machine translation is mentioned in 4 sentences in this paper.

Topics mentioned in this paper:

11. Enriching Morphologically Poor Languages for Statistical Machine Translation

Avramidis, Eleftherios and Koehn, Philipp

In Proc. ACL 2008, part of Proceedings of the Annual Meeting of the Association for Computational Linguistics.

Factored Model	The factored statistical machine translation model uses a log-linear approach, in order to combine the several components, including the language model, the reordering model, the translation models and the generation models.
Introduction	Traditional statistical machine translation methods are based on mapping on the lexical level, which takes place in a local window of a few words.
Introduction	Our method is based on factored phrase-based statistical machine translation models.
Introduction	Traditional statistical machine translation models deal with this problems in two ways:

statistical machine translation is mentioned in 4 sentences in this paper.

Topics mentioned in this paper:

12. Online Relative Margin Maximization for Statistical Machine Translation

Eidelman, Vladimir and Marton, Yuval and Resnik, Philip

In Proc. ACL 2013, part of Proceedings of the Annual Meeting of the Association for Computational Linguistics.

Abstract	However, these solutions are impractical in complex structured prediction problems such as statistical machine translation .
Conclusions and Future Work	Finally, although motivated by statistical machine translation , RM is a gradient-based method that can easily be applied to other problems.
Introduction	The desire to incorporate high-dimensional sparse feature representations into statistical machine translation (SMT) models has driven recent research away from Minimum Error Rate Training (MERT) (Och, 2003), and toward other discriminative methods that can optimize more features.

statistical machine translation is mentioned in 3 sentences in this paper.

Topics mentioned in this paper:

feature set (25)
BLEU (18)
TER (11)

13. Private Access to Phrase Tables for Statistical Machine Translation

Cancedda, Nicola

In Proc. ACL 2012, part of Proceedings of the Annual Meeting of the Association for Computational Linguistics.

Abstract	Some Statistical Machine Translation systems never see the light because the owner of the appropriate training data cannot release them, and the potential user of the system cannot disclose what should be translated.
Introduction	It is generally taken for granted that whoever is deploying a Statistical Machine Translation (SMT) system has unrestricted rights to access and use the parallel data required for its training.
Related work	private access to a phrase table or other resources for the purpose of performing statistical machine translation .

statistical machine translation is mentioned in 3 sentences in this paper.

Topics mentioned in this paper:

14. Scalable Decipherment for Machine Translation via Hash Sampling

Ravi, Sujith

In Proc. ACL 2013, part of Proceedings of the Annual Meeting of the Association for Computational Linguistics.

Abstract	In this paper, we propose a new Bayesian inference method to train statistical machine translation systems using only nonparallel corpora.
Introduction	Statistical machine translation (SMT) systems these days are built using large amounts of bilingual parallel corpora.
Introduction	Recently, this topic has been receiving increasing attention from researchers and new methods have been proposed to train statistical machine translation models using only monolingual data in the source and target language.

statistical machine translation is mentioned in 3 sentences in this paper.

Topics mentioned in this paper:

15. Polylingual Tree-Based Topic Models for Translation Domain Adaptation

Hu, Yuening and Zhai, Ke and Eidelman, Vladimir and Boyd-Graber, Jordan

In Proc. ACL 2014, part of Proceedings of the Annual Meeting of the Association for Computational Linguistics.

Introduction	In particular, we use topic models to aid statistical machine translation (Koehn, 2009, SMT).
Topic Models for Machine Translation	2.1 Statistical Machine Translation
Topic Models for Machine Translation	Statistical machine translation casts machine translation as a probabilistic process (Koehn, 2009).

statistical machine translation is mentioned in 3 sentences in this paper.

Topics mentioned in this paper:

16. A Sense-Based Translation Model for Statistical Machine Translation

Xiong, Deyi and Zhang, Min

In Proc. ACL 2014, part of Proceedings of the Annual Meeting of the Association for Computational Linguistics.

Abstract	In this paper, we propose a sense-based translation model to integrate word senses into statistical machine translation .
Introduction	Therefore a natural assumption is that word sense disambiguation (WSD) may contribute to statistical machine translation (SMT) by providing appropriate word senses for target translation selection with context features (Carpuat and Wu, 2005).
Introduction	'or Statistical Machine Translation

statistical machine translation is mentioned in 3 sentences in this paper.

Topics mentioned in this paper:

17. Constructing a Turkish-English Parallel TreeBank

Yıldız, Olcay Taner and Solak, Ercan and Görgün, Onur and Ehsani, Razieh

In Proc. ACL 2014, part of Proceedings of the Annual Meeting of the Association for Computational Linguistics.

Abstract	In this paper, we report our preliminary efforts in building an English-Turkish parallel treebank corpus for statistical machine translation .
Introduction	For example, EuroParl corpus (Koehn, 2002), one of the biggest parallel corpora in statistical machine translation , contains 22 languages (but not Turkish).
Introduction	In this study, we report our preliminary efforts in constructing an English-Turkish parallel treebank corpus for statistical machine translation .

statistical machine translation is mentioned in 3 sentences in this paper.

Topics mentioned in this paper:

18. Modeling the Translation of Predicate-Argument Structure for SMT

Xiong, Deyi and Zhang, Min and Li, Haizhou

In Proc. ACL 2012, part of Proceedings of the Annual Meeting of the Association for Computational Linguistics.

Abstract	Predicate-argument structure contains rich semantic information of which statistical machine translation hasn’t taken full advantage.
Abstract	In this paper, we propose two discriminative, feature-based models to exploit predicate-argument structures for statistical machine translation : 1) a predicate translation model and 2) an argument reordering model.
Introduction	Recent years have witnessed increasing efforts towards integrating predicate-argument structures into statistical machine translation (SMT) (Wu and Fung, 2009b; Liu and Gildea, 2010).

statistical machine translation is mentioned in 3 sentences in this paper.

Topics mentioned in this paper:

19. Mixing Multiple Translation Models in Statistical Machine Translation

Razmara, Majid and Foster, George and Sankaran, Baskaran and Sarkar, Anoop

In Proc. ACL 2012, part of Proceedings of the Annual Meeting of the Association for Computational Linguistics.

Abstract	Statistical machine translation is often faced with the problem of combining training data from many diverse sources into a single translation model which then has to translate sentences in a new domain.
Introduction	Statistical machine translation (SMT) systems require large parallel corpora in order to be able to obtain a reasonable translation quality.
Introduction	els in Statistical Machine Translation

statistical machine translation is mentioned in 3 sentences in this paper.

Topics mentioned in this paper:

20. Deciphering Foreign Language by Combining Language Models and Context Vectors

Nuhn, Malte and Mauser, Arne and Ney, Hermann

In Proc. ACL 2012, part of Proceedings of the Annual Meeting of the Association for Computational Linguistics.

Abstract	In this paper we show how to train statistical machine translation systems on real-life tasks using only nonparallel monolingual data from two languages.
Conclusion	We presented a method for learning statistical machine translation models from nonparallel data.
Conclusion	This work serves as a big step towards large-scale unsupervised training for statistical machine translation systems.

statistical machine translation is mentioned in 3 sentences in this paper.

Topics mentioned in this paper:

LM (27)
BLEU (16)
translation model (15)

21. Prediction of Learning Curves in Machine Translation

Kolachina, Prasanth and Cancedda, Nicola and Dymetman, Marc and Venkatapathy, Sriram

In Proc. ACL 2012, part of Proceedings of the Annual Meeting of the Association for Computational Linguistics.

Abstract	Parallel data in the domain of interest is the key resource when training a statistical machine translation (SMT) system for a specific purpose.
Inferring a learning curve from mostly monolingual data	The ability to predict the amount of parallel data required to achieve a given level of quality is very valuable in planning business deployments of statistical machine translation ; yet, we are not aware of any rigorous proposal for addressing this need.
Introduction	Parallel data in the domain of interest is the key resource when training a statistical machine translation (SMT) system for a specific business purpose.

statistical machine translation is mentioned in 3 sentences in this paper.

Topics mentioned in this paper:

22. Training Phrase Translation Models with Leaving-One-Out

Wuebker, Joern and Mauser, Arne and Ney, Hermann

In Proc. ACL 2010, part of Proceedings of the Annual Meeting of the Association for Computational Linguistics.

Abstract	Several attempts have been made to learn phrase translation probabilities for phrase-based statistical machine translation that go beyond pure counting of phrases in word-aligned training data.
Experimental Evaluation	We conducted our experiments on the German-English data published for the ACL 2008 Workshop on Statistical Machine Translation (WMT08).
Introduction	Europarl task from the ACL 2008 Workshop on Statistical Machine Translation (WMT08).

statistical machine translation is mentioned in 3 sentences in this paper.

Topics mentioned in this paper:

23. Efficient Minimum Error Rate Training and Minimum Bayes-Risk Decoding for Translation Hypergraphs and Lattices

Kumar, Shankar and Macherey, Wolfgang and Dyer, Chris and Och, Franz

In Proc. ACL 2009, part of Proceedings of the Annual Meeting of the Association for Computational Linguistics.

Abstract	Minimum Error Rate Training (MERT) and Minimum Bayes-Risk (MBR) decoding are used in most current state-of—the-art Statistical Machine Translation (SMT) systems.
Introduction	Statistical Machine Translation (SMT) systems have improved considerably by directly using the error criterion in both training and decoding.
Minimum Error Rate Training	In the context of statistical machine translation , the optimization procedure was first described in Och (2003) for N -best lists and later extended to phrase-lattices in Macherey et al.

statistical machine translation is mentioned in 3 sentences in this paper.

Topics mentioned in this paper:

n-gram (21)
BLEU (20)
phrase-based (11)

24. Active Learning for Multilingual Statistical Machine Translation

Haffari, Gholamreza and Sarkar, Anoop

In Proc. ACL 2009, part of Proceedings of the Annual Meeting of the Association for Computational Linguistics.

Abstract	Statistical machine translation (SMT) models require bilingual corpora for training, and these corpora are often multilingual with parallel text in multiple languages simultaneously.
Introduction	The main source of training data for statistical machine translation (SMT) models is a parallel corpus.
Introduction	11 Statistical Machine Translation*

statistical machine translation is mentioned in 3 sentences in this paper.

Topics mentioned in this paper:

25. Measure Word Generation for English-Chinese SMT Systems

Zhang, Dongdong and Li, Mu and Duan, Nan and Li, Chi-Ho and Zhou, Ming

In Proc. ACL 2008, part of Proceedings of the Annual Meeting of the Association for Computational Linguistics.

Abstract	Conventional statistical machine translation (SMT) systems do not perform well on measure word generation due to data sparseness and the potential long distance dependency between measure words and their corresponding head words.
Abstract	Our model works as a postprocessing procedure over output of statistical machine translation systems, and can work with any SMT system.
Introduction	In most statistical machine translation (SMT) models (Och et al., 2004; Koehn et al., 2003; Chiang, 2005), some of measure words can be generated without modification or additional processing.

statistical machine translation is mentioned in 3 sentences in this paper.

Topics mentioned in this paper:

26. A New String-to-Dependency Machine Translation Algorithm with a Target Dependency Language Model

Shen, Libin and Xu, Jinxi and Weischedel, Ralph

In Proc. ACL 2008, part of Proceedings of the Annual Meeting of the Association for Computational Linguistics.

Abstract	In this paper, we propose a novel string-to-dependency algorithm for statistical machine translation .
Conclusions and Future Work	In this paper, we propose a novel string-to-dependency algorithm for statistical machine translation .
Introduction	In recent years, hierarchical methods have been successfully applied to Statistical Machine Translation (Graehl and Knight, 2004; Chiang, 2005; Ding and Palmer, 2005; Quirk et al., 2005).

statistical machine translation is mentioned in 3 sentences in this paper.

Topics mentioned in this paper:

27. Name Translation in Statistical Machine Translation - Learning When to Transliterate

Hermjakob, Ulf and Knight, Kevin and Daumé III, Hal

In Proc. ACL 2008, part of Proceedings of the Annual Meeting of the Association for Computational Linguistics.

Abstract	We present a method to transliterate names in the framework of end-to-end statistical machine translation .
Discussion	We have shown that a state-of-the-art statistical machine translation system can benefit from a dedicated transliteration module to improve the transla-
Introduction	State-of-the-art statistical machine translation (SMT) is bad at translating names that are not very common, particularly across languages with different character sets and sound systems.

statistical machine translation is mentioned in 3 sentences in this paper.

Topics mentioned in this paper: