Index of papers in Proc. ACL that mention
  • statistical machine translation
Zhou, Guangyou and Liu, Fang and Liu, Yang and He, Shizhu and Zhao, Jun
Abstract
Our proposed method employs statistical machine translation to improve question retrieval and enriches the question representation with the translated words from other languages via matrix factorization.
Experiments
Row 8 and row 9 are our proposed method, which leverages statistical machine translation to improve question retrieval via matrix factorization.
Introduction
The idea of improving question retrieval with statistical machine translation is based on the following two observa-
Introduction
However, there are two problems with this enrichment: (1) enriching the original questions with the translated words from other languages increases the dimensionality and makes the question representation even more sparse; (2) statistical machine translation may introduce noise, which can harm the performance of question retrieval.
Introduction
To solve these two problems, we propose to leverage statistical machine translation to improve question retrieval via matrix factorization.
Our Approach
This paper aims to leverage statistical machine translation to enrich the question representation.
Our Approach
Statistical machine translation (e.g., Google Translate) can utilize contextual information during the question translation, so it can solve the word ambiguity and word mismatch problems to some extent.
Our Approach
However, there are two problems with this enrichment: (1) enriching the original questions with the translated words from other languages makes the question representation even more sparse; (2) statistical machine translation may introduce noise.5 To solve these two problems, we propose to leverage statistical machine translation to improve question retrieval via matrix factorization.
statistical machine translation is mentioned in 16 sentences in this paper.
Topics mentioned in this paper:
Yeniterzi, Reyyan and Oflazer, Kemal
Introduction
Statistical machine translation into a morphologically complex language such as Turkish, Finnish or Arabic, involves the generation of target words with the proper morphology, in addition to properly ordering the target words.
Introduction
We assume that the reader is familiar with the basics of phrase-based statistical machine translation (Koehn et al., 2003) and factored statistical machine translation (Koehn and Hoang, 2007).
Related Work
Statistical Machine Translation into a morphologically rich language is a challenging problem in that, on the target side, the decoder needs to generate both the right sequence of constituents and the right sequence of morphemes for each word.
Related Work
Using morphology in statistical machine translation has been addressed by many researchers for translation from or into morphologically rich(er) languages.
Related Work
Goldwater and McClosky (2005) use morphological analysis on the Czech side to get improvements in Czech-to-English statistical machine translation .
statistical machine translation is mentioned in 6 sentences in this paper.
Topics mentioned in this paper:
Zhao, Hai and Song, Yan and Kit, Chunyu and Zhou, Guodong
Abstract
A simple statistical machine translation method, word-by-word decoding, where not a parallel corpus but a bilingual lexicon is necessary, is adopted for the treebank translation.
Conclusion and Future Work
A simple statistical machine translation technique, word-by-word decoding, where only a bilingual lexicon is necessary, is used to translate the source treebank.
Introduction
In addition, a standard statistical machine translation method based on a parallel corpus will not work effectively if it is not able to find a parallel corpus that right covers source and target treebanks.
The Related Work
The second is that a parallel corpus is required for their work and a strict statistical machine translation procedure was performed, while our approach holds a merit of simplicity as only a bilingual lexicon is required.
Treebank Translation and Dependency Transformation
A word-by-word statistical machine translation strategy is adopted to translate words attached with the respective dependency information from the source language to the target one.
statistical machine translation is mentioned in 5 sentences in this paper.
Topics mentioned in this paper:
Chen, Boxing and Foster, George and Kuhn, Roland
Abstract
We then apply the algorithms to statistical machine translation by computing the sense similarity between the source and target side of translation rule pairs.
Conclusions and Future Work
similarity for terms from parallel corpora and applied it to statistical machine translation .
Conclusions and Future Work
We have shown that the sense similarity computed between units from parallel corpora by means of our algorithm is helpful for at least one multilingual application: statistical machine translation .
Introduction
Is it useful for multilingual applications, such as statistical machine translation (SMT)?
Introduction
Second, we use the sense similarities between the source and target sides of a translation rule to improve statistical machine translation performance.
statistical machine translation is mentioned in 5 sentences in this paper.
Topics mentioned in this paper:
Feng, Minwei and Peter, Jan-Thorsten and Ney, Hermann
Introduction
The systematic word order difference between two languages poses a challenge for current statistical machine translation (SMT) systems.
Introduction
:15 for Statistical Machine Translation
Introduction
The remainder of this paper is organized as follows: Section 2 introduces the basement of this research: the principle of statistical machine translation .
Translation System Overview
In statistical machine translation , we are given a source language sentence fi] = fl... fj .
statistical machine translation is mentioned in 4 sentences in this paper.
Topics mentioned in this paper:
Uszkoreit, Jakob and Brants, Thorsten
Abstract
We show that combining them with word—based n—gram models in the log—linear model of a state—of—the—art statistical machine translation system leads to improvements in translation quality as indicated by the BLEU score.
Experiments
In the subsequent experiments, we use a phrase-based statistical machine translation system based on the log-linear formulation of the problem described in (Och and Ney, 2002):
Introduction
However, in the area of statistical machine translation , especially in the context of large training corpora, fewer experiments with class-based n-gram models have been performed with mixed success (Raab, 2006).
Introduction
We then show that using partially class-based language models trained using the resulting classifications together with word-based language models in a state-of-the-art statistical machine translation system yields improvements despite the very large size of the word-based models used.
statistical machine translation is mentioned in 4 sentences in this paper.
Topics mentioned in this paper:
Liu, Le and Hong, Yu and Liu, Hao and Wang, Xing and Yao, Jianmin
Abstract
Data selection has been demonstrated to be an effective approach to addressing the lack of high-quality bitext for statistical machine translation in the domain of interest.
Conclusion
our methods into domain adaptation task of statistical machine translation in model level.
Introduction
Statistical machine translation depends heavily on large scale parallel corpora.
Training Data Selection Methods
Translation model is a key component in statistical machine translation .
statistical machine translation is mentioned in 4 sentences in this paper.
Topics mentioned in this paper:
Liu, Zhanyi and Wang, Haifeng and Wu, Hua and Li, Sheng
Abstract
This paper proposes to use monolingual collocations to improve Statistical Machine Translation (SMT).
Conclusion
Statistical Machine Translation .
Conclusion
The Mathematics of Statistical Machine Translation : Pa-
Conclusion
Moses: Open Source Toolkit for Statistical Machine Translation .
statistical machine translation is mentioned in 4 sentences in this paper.
Topics mentioned in this paper:
Zhang, Jiajun and Zong, Chengqing
Abstract
Currently, almost all of the statistical machine translation (SMT) models are trained with the parallel corpora in some specific domains.
Introduction
During the last decade, statistical machine translation has made great progress.
Introduction
Recently, more and more researchers concentrated on taking full advantage of the monolingual corpora in both source and target languages, and proposed methods for bilingual lexicon induction from nonparallel data (Rapp, 1995, 1999; Koehn and Knight, 2002; Haghighi et al., 2008; Daume III and J agarlamudi, 2011) and proposed unsupervised statistical machine translation (bilingual lexicon is a byproduct) with only monolingual corpora (Ravi and Knight, 2011; Nuhn et al., 2012; Dou and Knight, 2012).
Introduction
The unsupervised statistical machine translation method (Ravi and Knight, 2011; Nuhn et al., 2012; Dou and Knight, 2012) viewed the translation task as a decipherment problem and designed a generative model with the objective function to maximize the likelihood of the source language monolingual data.
statistical machine translation is mentioned in 4 sentences in this paper.
Topics mentioned in this paper:
Schwartz, Lane and Callison-Burch, Chris and Schuler, William and Wu, Stephen
Introduction
Early work in statistical machine translation Viewed translation as a noisy channel process comprised of a translation model, which functioned to posit adequate translations of source language words, and a target language model, which guided the fluency of generated target language strings (Brown et al.,
Introduction
Drawing on earlier successes in speech recognition, research in statistical machine translation has effectively used n-gram word sequence models as language models.
Related Work
speech recognition and statistical machine translation focus on the use of n-grams, which provide a simple finite-state model approximation of the target language.
Related Work
priate algorithmic fit for incorporating syntax into phrase-based statistical machine translation , since both process sentences in an incremental left-to-right fashion.
statistical machine translation is mentioned in 4 sentences in this paper.
Topics mentioned in this paper:
Avramidis, Eleftherios and Koehn, Philipp
Factored Model
The factored statistical machine translation model uses a log-linear approach, in order to combine the several components, including the language model, the reordering model, the translation models and the generation models.
Introduction
Traditional statistical machine translation methods are based on mapping on the lexical level, which takes place in a local window of a few words.
Introduction
Our method is based on factored phrase-based statistical machine translation models.
Introduction
Traditional statistical machine translation models deal with this problems in two ways:
statistical machine translation is mentioned in 4 sentences in this paper.
Topics mentioned in this paper:
Eidelman, Vladimir and Marton, Yuval and Resnik, Philip
Abstract
However, these solutions are impractical in complex structured prediction problems such as statistical machine translation .
Conclusions and Future Work
Finally, although motivated by statistical machine translation , RM is a gradient-based method that can easily be applied to other problems.
Introduction
The desire to incorporate high-dimensional sparse feature representations into statistical machine translation (SMT) models has driven recent research away from Minimum Error Rate Training (MERT) (Och, 2003), and toward other discriminative methods that can optimize more features.
statistical machine translation is mentioned in 3 sentences in this paper.
Topics mentioned in this paper:
Cancedda, Nicola
Abstract
Some Statistical Machine Translation systems never see the light because the owner of the appropriate training data cannot release them, and the potential user of the system cannot disclose what should be translated.
Introduction
It is generally taken for granted that whoever is deploying a Statistical Machine Translation (SMT) system has unrestricted rights to access and use the parallel data required for its training.
Related work
private access to a phrase table or other resources for the purpose of performing statistical machine translation .
statistical machine translation is mentioned in 3 sentences in this paper.
Topics mentioned in this paper:
Ravi, Sujith
Abstract
In this paper, we propose a new Bayesian inference method to train statistical machine translation systems using only nonparallel corpora.
Introduction
Statistical machine translation (SMT) systems these days are built using large amounts of bilingual parallel corpora.
Introduction
Recently, this topic has been receiving increasing attention from researchers and new methods have been proposed to train statistical machine translation models using only monolingual data in the source and target language.
statistical machine translation is mentioned in 3 sentences in this paper.
Topics mentioned in this paper:
Hu, Yuening and Zhai, Ke and Eidelman, Vladimir and Boyd-Graber, Jordan
Introduction
In particular, we use topic models to aid statistical machine translation (Koehn, 2009, SMT).
Topic Models for Machine Translation
2.1 Statistical Machine Translation
Topic Models for Machine Translation
Statistical machine translation casts machine translation as a probabilistic process (Koehn, 2009).
statistical machine translation is mentioned in 3 sentences in this paper.
Topics mentioned in this paper:
Xiong, Deyi and Zhang, Min
Abstract
In this paper, we propose a sense-based translation model to integrate word senses into statistical machine translation .
Introduction
Therefore a natural assumption is that word sense disambiguation (WSD) may contribute to statistical machine translation (SMT) by providing appropriate word senses for target translation selection with context features (Carpuat and Wu, 2005).
Introduction
'or Statistical Machine Translation
statistical machine translation is mentioned in 3 sentences in this paper.
Topics mentioned in this paper:
Yıldız, Olcay Taner and Solak, Ercan and Görgün, Onur and Ehsani, Razieh
Abstract
In this paper, we report our preliminary efforts in building an English-Turkish parallel treebank corpus for statistical machine translation .
Introduction
For example, EuroParl corpus (Koehn, 2002), one of the biggest parallel corpora in statistical machine translation , contains 22 languages (but not Turkish).
Introduction
In this study, we report our preliminary efforts in constructing an English-Turkish parallel treebank corpus for statistical machine translation .
statistical machine translation is mentioned in 3 sentences in this paper.
Topics mentioned in this paper:
Xiong, Deyi and Zhang, Min and Li, Haizhou
Abstract
Predicate-argument structure contains rich semantic information of which statistical machine translation hasn’t taken full advantage.
Abstract
In this paper, we propose two discriminative, feature-based models to exploit predicate-argument structures for statistical machine translation : 1) a predicate translation model and 2) an argument reordering model.
Introduction
Recent years have witnessed increasing efforts towards integrating predicate-argument structures into statistical machine translation (SMT) (Wu and Fung, 2009b; Liu and Gildea, 2010).
statistical machine translation is mentioned in 3 sentences in this paper.
Topics mentioned in this paper:
Razmara, Majid and Foster, George and Sankaran, Baskaran and Sarkar, Anoop
Abstract
Statistical machine translation is often faced with the problem of combining training data from many diverse sources into a single translation model which then has to translate sentences in a new domain.
Introduction
Statistical machine translation (SMT) systems require large parallel corpora in order to be able to obtain a reasonable translation quality.
Introduction
els in Statistical Machine Translation
statistical machine translation is mentioned in 3 sentences in this paper.
Topics mentioned in this paper:
Nuhn, Malte and Mauser, Arne and Ney, Hermann
Abstract
In this paper we show how to train statistical machine translation systems on real-life tasks using only nonparallel monolingual data from two languages.
Conclusion
We presented a method for learning statistical machine translation models from nonparallel data.
Conclusion
This work serves as a big step towards large-scale unsupervised training for statistical machine translation systems.
statistical machine translation is mentioned in 3 sentences in this paper.
Topics mentioned in this paper:
Kolachina, Prasanth and Cancedda, Nicola and Dymetman, Marc and Venkatapathy, Sriram
Abstract
Parallel data in the domain of interest is the key resource when training a statistical machine translation (SMT) system for a specific purpose.
Inferring a learning curve from mostly monolingual data
The ability to predict the amount of parallel data required to achieve a given level of quality is very valuable in planning business deployments of statistical machine translation ; yet, we are not aware of any rigorous proposal for addressing this need.
Introduction
Parallel data in the domain of interest is the key resource when training a statistical machine translation (SMT) system for a specific business purpose.
statistical machine translation is mentioned in 3 sentences in this paper.
Topics mentioned in this paper:
Wuebker, Joern and Mauser, Arne and Ney, Hermann
Abstract
Several attempts have been made to learn phrase translation probabilities for phrase-based statistical machine translation that go beyond pure counting of phrases in word-aligned training data.
Experimental Evaluation
We conducted our experiments on the German-English data published for the ACL 2008 Workshop on Statistical Machine Translation (WMT08).
Introduction
Europarl task from the ACL 2008 Workshop on Statistical Machine Translation (WMT08).
statistical machine translation is mentioned in 3 sentences in this paper.
Topics mentioned in this paper:
Kumar, Shankar and Macherey, Wolfgang and Dyer, Chris and Och, Franz
Abstract
Minimum Error Rate Training (MERT) and Minimum Bayes-Risk (MBR) decoding are used in most current state-of—the-art Statistical Machine Translation (SMT) systems.
Introduction
Statistical Machine Translation (SMT) systems have improved considerably by directly using the error criterion in both training and decoding.
Minimum Error Rate Training
In the context of statistical machine translation , the optimization procedure was first described in Och (2003) for N -best lists and later extended to phrase-lattices in Macherey et al.
statistical machine translation is mentioned in 3 sentences in this paper.
Topics mentioned in this paper:
Haffari, Gholamreza and Sarkar, Anoop
Abstract
Statistical machine translation (SMT) models require bilingual corpora for training, and these corpora are often multilingual with parallel text in multiple languages simultaneously.
Introduction
The main source of training data for statistical machine translation (SMT) models is a parallel corpus.
Introduction
11 Statistical Machine Translation*
statistical machine translation is mentioned in 3 sentences in this paper.
Topics mentioned in this paper:
Zhang, Dongdong and Li, Mu and Duan, Nan and Li, Chi-Ho and Zhou, Ming
Abstract
Conventional statistical machine translation (SMT) systems do not perform well on measure word generation due to data sparseness and the potential long distance dependency between measure words and their corresponding head words.
Abstract
Our model works as a postprocessing procedure over output of statistical machine translation systems, and can work with any SMT system.
Introduction
In most statistical machine translation (SMT) models (Och et al., 2004; Koehn et al., 2003; Chiang, 2005), some of measure words can be generated without modification or additional processing.
statistical machine translation is mentioned in 3 sentences in this paper.
Topics mentioned in this paper:
Shen, Libin and Xu, Jinxi and Weischedel, Ralph
Abstract
In this paper, we propose a novel string-to-dependency algorithm for statistical machine translation .
Conclusions and Future Work
In this paper, we propose a novel string-to-dependency algorithm for statistical machine translation .
Introduction
In recent years, hierarchical methods have been successfully applied to Statistical Machine Translation (Graehl and Knight, 2004; Chiang, 2005; Ding and Palmer, 2005; Quirk et al., 2005).
statistical machine translation is mentioned in 3 sentences in this paper.
Topics mentioned in this paper:
Hermjakob, Ulf and Knight, Kevin and Daumé III, Hal
Abstract
We present a method to transliterate names in the framework of end-to-end statistical machine translation .
Discussion
We have shown that a state-of-the-art statistical machine translation system can benefit from a dedicated transliteration module to improve the transla-
Introduction
State-of-the-art statistical machine translation (SMT) is bad at translating names that are not very common, particularly across languages with different character sets and sound systems.
statistical machine translation is mentioned in 3 sentences in this paper.
Topics mentioned in this paper: