Backgrounds | Particles are suffixes or tokens in Japanese grammar that immediately follow modified content words or sentences. |
Composed Rule Extraction | A chunk contains roughly one content word (usually the head) and affixed function words, such as case markers (e.g., go) and verbal morphemes (e.g., sa re to, which indicate past tense and passive voice). |
Introduction | This indicates that the alignments of the function words are more easily to be mistaken than content words . |
Related Research | Specially, we observed that most incorrect or ambiguous word alignments are caused by function words rather than content words . |
Elementary Trees to String Grammar | (0/ , 7’ ) deletes a src content word |
Elementary Trees to String Grammar | (0/ , 7’ ) over generates a tgt content word ( v ) |
The Projectable Structures | The transformations could be as simple as merging two adjacent nonterminals into one bracket to accommodate non-contiguity on the target side, or lexicalizing those words which have fork-style, many—to—many alignment, or unaligned content words to enable the rest of the span to be generalized into nonterminals. |