Index of papers in Proc. ACL 2008 that mention
  • morphological analysis
Adler, Meni and Goldberg, Yoav and Gabay, David and Elhadad, Michael
Introduction
For the task of full morphological analysis, the lexicon must provide all possible morphological analyses for any given token.
Introduction
In this paper, we investigate the characteristics of Hebrew unknowns for full morphological analysis , and propose a new method for handling such unavoidable lack of information.
Introduction
In our evaluation, these learned distributions include the correct analysis for unknown words in 85% of the cases, contributing an error reduction of over 30% over a competitive baseline for the overall task of full morphological analysis in Hebrew.
Method
model application is a set of possible full morphological analyses for the token — in exactly the same format as the morphological analyzer provides.
Previous Work
Habash and Rambow (2006) used the root+pattern+features representation of Arabic tokens for morphological analysis and generation of Arabic dialects, which have no lexicon.
Previous Work
They report high recall (95%—98%) but low precision (37%—63%) for token types and token instances, against gold-standard morphological analysis .
Previous Work
Unlike Nakagawa, our model does not use any segmented text, and, on the other hand, it aims to select full morphological analysis for each token,
morphological analysis is mentioned in 11 sentences in this paper.
Topics mentioned in this paper:
Goldberg, Yoav and Tsarfaty, Reut
A Generative PCFG Model
The Input The set of analyses for a token is thus represented as a lattice in which every arc corresponds to a specific lexeme l, as shown in Figure l. A morphological analyzer M : W —> L is a function mapping sentences in Hebrew (W E W) to their corresponding lattices (M = L E L).
A Generative PCFG Model
212,, and a morphological analyzer , we look for the most probable parse tree 7r s.t.
A Generative PCFG Model
Since the lattice L for a given sentence W is determined by the morphological analyzer M we have
Experimental Setup
Morphological Analyzer Ideally, we would use an of-the-shelf morphological analyzer for mapping each input token to its possible analyses.
Experimental Setup
patible with the one of the Hebrew Treebank.8 For this reason, we use a data-driven morphological analyzer derived from the training data similar to (Cohen and Smith, 2007).
Experimental Setup
To control for the effect of the HSPELL-based pruning, we also experimented with a morphological analyzer that does not perform this pruning.
Model Preliminaries
We represent all morphological analyses of a given utterance using a lattice structure.
Previous Work on Hebrew Processing
Morphological analyzers for Hebrew that analyze a surface form in isolation have been proposed by Segal (2000), Yona and Wintner (2005), and recently by the knowledge center for processing Hebrew (Itai et al., 2006).
Previous Work on Hebrew Processing
Morphological dis-ambiguators that consider a token in context (an utterance) and propose the most likely morphological analysis of an utterance (including segmentation) were presented by Bar-Haim et a1.
Previous Work on Hebrew Processing
Tsarfaty (2006) used a morphological analyzer (Segal, 2000), a PoS tagger (Bar-Haim et al., 2005), and a general purpose parser (Schmid, 2000) in an integrated framework in which morphological and syntactic components interact to share information, leading to improved performance on the joint task.
morphological analysis is mentioned in 13 sentences in this paper.
Topics mentioned in this paper:
Toutanova, Kristina and Suzuki, Hisami and Ruopp, Achim
Inflection prediction models
Morphological analysis: returns the set of possible morphological analyses Aw = {a1, ..., a”} for w. A morphological analysis a is a vector of categorical values, where each dimension and its possible values are defined by L.
Inflection prediction models
For the morphological analysis operation, we used the same set of morphological features described in (Minkov et al., 2007), that is, seven features for Russian (POS, Person, Number, Gender, Tense, Mood and Case) and 12 for Arabic (POS, Person, Number, Gender, Tense, Mood, Negation, Determiner, Conjunction, Preposition, Object and Possessive pronouns).
Inflection prediction models
The same is true with the operation of morphological analysis .
Introduction
Work in this area is motivated by two advantages offered by morphological analysis : (1) it provides linguistically motivated clustering of words and makes the data less sparse; (2) it captures morphological constraints applicable on the target side, such as agreement phenomena.
morphological analysis is mentioned in 5 sentences in this paper.
Topics mentioned in this paper:
Snyder, Benjamin and Barzilay, Regina
Experimental SetUp
This Bible edition is augmented by gold standard morphological analysis (including segmentation) performed by biblical scholars.
Experimental SetUp
We obtained gold standard segmentations of the Arabic translation with a handcrafted Arabic morphological analyzer which utilizes manually constructed word lists and compatibility rules and is further trained on a large corpus of hand-annotated Arabic data (Habash and Ram-bow, 2005).
Experimental SetUp
The accuracy of this analyzer is reported to be 94% for full morphological analyses , and 98%-99% when part-of-speech tag accuracy is not included.
Multilingual Morphological Segmentation
The underlying assumption of our work is that structural commonality across different languages is a powerful source of information for morphological analysis .
morphological analysis is mentioned in 4 sentences in this paper.
Topics mentioned in this paper:
Kazama, Jun'ichi and Torisawa, Kentaro
Experiments
We used MeCab as a morphological analyzer and CaboCha14 (Kudo and Matsumoto, 2002) as the dependency parser to find the boundaries of the bunsetsu.
Gazetteer Induction 2.1 Induction by MN Clustering
After preprocessing the first sentence of an article using a morphological analyzer , MeCab9, we extracted the last noun after the appearance of Japanese postpo-sition “Oi (wa)” (% “is”).
Using Gazetteers as Features of NER
Asahara and Motsumoto (2003) proposed using characters instead of morphemes as the unit to alleviate the effect of segmentation errors in morphological analysis and we also used their character-based method.
morphological analysis is mentioned in 3 sentences in this paper.
Topics mentioned in this paper: