Conclusion | For future work, we want to expand our work to other dialects, while utilizing dialectal morphological analysis to improve conversion. |
Previous Work | Sawaf (2010) proposed a dialect to MSA normalization that used character-level rules and morphological analysis . |
Previous Work | We tokenized Egyptian and Arabic according to the ATB tokenization scheme using the MADA+TOKAN morphological analyzer and to-kenizer v3.1 (Roth et al., 2008). |
Proposed Methods 3.1 Egyptian to EG’ Conversion | Perhaps a morphological analyzer , or just a part-of-speech tagger, could enforce (or probabilistically encourage) a match in parts of speech. |
Proposed Methods 3.1 Egyptian to EG’ Conversion | In particular, using a morphological analyzer seeems like a promising possibility. |
Proposed Methods 3.1 Egyptian to EG’ Conversion | One approach could be to run a morphological analyzer for dialectal Arabic (e.g. |
Abstract | We also show that finite-state morphological analyzers are effective sources of type information when few labeled examples are available. |
Data | While we do not explore a rule-writing approach to POS-tagging, we do consider the impact of rule-based morphological analyzers as a component in our semi-supervised POS-tagging system. |
Introduction | We also did not consider morphological analyzers as a form of type supervision, as suggested by Merialdo (1994). |
Introduction | Also, morphological analyzers help for morphologically rich languages when there are few labeled types or tokens (and, it never hurts to use them). |
Morphological Transducers | We use FSTs for morphological analysis : the FST accepts a word type and produces a set of morphological features. |
Approach to Sentence-Level Dialect Identification | The aforementioned approach relies on language models (LM) and MSA and EDA Morphological Analyzer to decide whether each word is (a) MSA, (b) EDA, (c) Both (MSA & EDA) or (d) OOV. |
Approach to Sentence-Level Dialect Identification | Percentage of words in the sentence that is analyzable by an MSA morphological analyzer . |
Approach to Sentence-Level Dialect Identification | Percentage of words in the sentence that is analyzable by an EDA morphological analyzer . |