Index of papers in Proc. ACL that mention
  • binarization
Zhang, Hao and Fang, Licheng and Xu, Peng and Wu, Xiaoyun
Abstract
We propose an alternative approach to generating forests that is based on combining sub-trees within the first best parse through binarization .
Abstract
Provably, our binarization forest can cover any non-consitituent phrases in a sentence but maintains the desirable property that for each span there is at most one nonterminal so that the grammar constant for decoding is relatively small.
Abstract
For the purpose of reducing search errors, we apply the synchronous binarization technique to forest-to-string decoding.
Introduction
To focus on structural variants, we propose a family of binarization algorithms to expand one single constituent tree into a packed forest of binary trees containing combinations of adj acent tree nodes.
Introduction
0 Forests are not generated by a parser but by combining substructures using a tree binarizer .
Introduction
For the first time, we show that similar to string-to-tree decoding, synchronous binarization significantly reduces search errors and improves translation quality for forest-to-string decoding.
Source Tree Binarization
The motivation of tree binarization is to factorize large and rare structures into smaller but frequent ones to improve generalization.
Source Tree Binarization
If long sequences are binarized,
binarization is mentioned in 61 sentences in this paper.
Topics mentioned in this paper:
Zhao, Bing and Lee, Young-Suk and Luo, Xiaoqiang and Li, Liu
Abstract
In particular, we integrate synchronous binarizations , verb regrouping, removal of redundant parse nodes, and incorporate a few important features such as translation boundaries.
Decoding
E T as above-mentioned, such as binarizations , at different levels for constructing partial hypothesis.
Elementary Trees to String Grammar
We specified a few operators for transforming an elementary tree 7, including flattening tree operators such as removing interior nodes in vi, or grouping the children via binarizations .
Elementary Trees to String Grammar
Obvious systematic linguistic divergences between language-pairs could be handled by some simple operators such as using binarization to regroup contiguously aligned children.
Elementary Trees to String Grammar
3.4.1 Binarizations
binarization is mentioned in 18 sentences in this paper.
Topics mentioned in this paper:
Büchse, Matthias and Koller, Alexander and Vogler, Heiko
Abstract
Binarization of grammars is crucial for improving the complexity and performance of parsing and translation.
Abstract
We present a versatile binarization algorithm that can be tailored to a number of grammar formalisms by simply varying a formal parameter.
Abstract
We apply our algorithm to binarizing tree-to-string transducers used in syntax-based machine translation.
Introduction
Binarization amounts to transforming a given grammar into an equivalent grammar of rank 2, i.e., with at most two nonterminals on any right-hand side.
Introduction
The ability to binarize grammars is crucial for efficient parsing, because for many grammar formalisms the parsing complexity depends exponentially on the rank of the grammar.
Introduction
The classical approach to binarization , as known from the Chomsky normal form transformation for context-free grammars (CFGs), proceeds rule by rule.
binarization is mentioned in 88 sentences in this paper.
Topics mentioned in this paper:
Zhang, Yuan and Barzilay, Regina and Globerson, Amir
Features
Note that after binarization , grandparent and sibling information becomes very important in encoding the structure.
Implementation
This section introduces important implementation details, including supertagging, feature forest pruning and binarization methods.
Implementation
5.3 Binarization
Implementation
Since Penn Treebank trees are not binarized, we construct a simple procedure for binarizing them.
The Learning Problem
Note that the two derivations share the same ( binarized ) tree structure.
binarization is mentioned in 10 sentences in this paper.
Topics mentioned in this paper:
Crescenzi, Pierluigi and Gildea, Daniel and Marino, Andrea and Rossi, Gianluca and Satta, Giorgio
Introduction
Whenever it is possible, binarization of LCFRS rules, or reduction of rank to two, is therefore important for parsing, as it reduces the time complexity needed for dynamic programming.
Introduction
This has lead to a number of binarization algorithms for LCFRSs, as well as factorization algorithms that factor rules into new rules with smaller rank, without necessarily reducing rank all the way to two.
Introduction
Kuhlmann and Satta (2009) present an algorithm for binarizing certain LCFRS rules without increasing their fanout, and Sagot and Satta (2010) show how to reduce rank to the lowest value possible for LCFRS rules of fanout two, again without increasing fanout.
binarization is mentioned in 8 sentences in this paper.
Topics mentioned in this paper:
Heilman, Michael and Cahill, Aoife and Madnani, Nitin and Lopez, Melissa and Mulholland, Matthew and Tetreault, Joel
Experiments
We also trained and evaluated on binarized versions of the ordinal GUG labels: a sentence was labeled 1 if the average judgment was at least 3.5 (i.e., would round to 4), and 0 otherwise.
Experiments
To train our system on binarized data, we replaced the £2 -regularized linear regression model with an 62-regularized logistic regression and used Kendall’s 7' rank correlation between the predicted probabilities of the positive class and the binary gold standard labels as the grid search metric (§3.1) instead of Pearson’s 7“.
Experiments
We also evaluated the binary system for the ordinal task by computing correlations between its estimated probabilities and the averaged human scores, and we evaluated the ordinal system for the binary task by binarizing its predictions.12
System Description
From an initial prediction y, it produces the final prediction: A AM re not affect Pearson’s 'r correlations or rankings, but it would affect binarized predictions.
binarization is mentioned in 5 sentences in this paper.
Topics mentioned in this paper:
Zhang, Meishan and Zhang, Yue and Che, Wanxiang and Liu, Ting
Character-based Chinese Parsing
The system can provide bina-rzied CFG trees in Chomsky Norm Form, and they present a reversible conversion procedure to map arbitrary CFG trees into binarized trees.
Character-based Chinese Parsing
In this work, we remain consistent with their work, using the head-finding rules of Zhang and Clark (2008), and the same binarization algorithm.1 We apply the same beam-search algorithm for decoding, and employ the averaged perceptron with early-update to train our model.
Word Structures and Syntax Trees
Our annotations are binarized recursive word
Word Structures and Syntax Trees
(Los Angeles)”, have flat structures, and we use “coordination” for their left binarization .
binarization is mentioned in 4 sentences in this paper.
Topics mentioned in this paper:
Finkel, Jenny Rose and Kleeman, Alex and Manning, Christopher D.
Experiments
Lastly, for the WSJ40 runs we used a simple, right branching binarization where each active state is annotated with its previous sibling and first child.
The Model
Binarization of rules (Earley, 1970) is necessary to obtain cubic parsing time, and closure of unary chains is required for finding total probability mass (rather than just best parses) (Stolcke, 1995).
The Model
This was done by collapsing all allowed unary chains to single unary rules, and disallowing multiple unary rule applications over the same span.1 We give the details of our binarization scheme in Section 5.
binarization is mentioned in 3 sentences in this paper.
Topics mentioned in this paper:
Cheung, Jackie Chi Kit and Penn, Gerald
Experiments
All productions in the corpus have also been binarized .
Experiments
Tuning the parameter settings on the development set, we found that parameterized categories, binarization , and including punctuation gave the best F1 performance.
Introduction
They also binarize the very flat topological tree structures, and prune rules that only occur once.
binarization is mentioned in 3 sentences in this paper.
Topics mentioned in this paper:
Sagot, Beno^it and Satta, Giorgio
Introduction
Most of the literature has been focusing on binarization algorithms, which attempt to find a reduction to 7“ = 2 and return a failure if this is not possible.
Introduction
(2009) report a general binarization algorithm for LCFRS which, in the case of f = 2, works in time 0(lpl7), where |p| is the size of the input production.
Introduction
A more efficient binarization algorithm for the case f = 2 is presented in (Gomez-Rodriguez and Satta, 2009), working in time O(|p|).
binarization is mentioned in 3 sentences in this paper.
Topics mentioned in this paper:
Chiang, David and Andreas, Jacob and Bauer, Daniel and Hermann, Karl Moritz and Jones, Bevan and Knight, Kevin
Abstract
We present a more precise characterization of the algorithm’s complexity, an optimization analogous to binarization of context-free grammars, and some important implementation details, resulting in an algorithm that is practical for natural-language applications.
Introduction
We give a more precise complexity analysis in terms of the grammar and the size and maximum degree of the input graph, and we show how to optimize it by a process analogous to binarization of CFGs, following Gildea (2011).
Parsing
In this section, we present a refinement that makes the rule-matching procedure explicit, and because it matches rules little by little, similarly to binarization of CFG rules, it does so more efliciently than the original.
binarization is mentioned in 3 sentences in this paper.
Topics mentioned in this paper: