By Abdelhadi Soudi, Antal van den Bosch, Günter Neumann

The morphology of Arabic poses precise demanding situations to computational typical language processing platforms. the phenomenal measure of ambiguity within the writing approach, the wealthy morphology, and the hugely advanced observe formation means of roots and styles all give a contribution to creating computational ways to Arabic very not easy. certainly many computational linguists the world over have taken up this problem over the years, and plenty of of the researchers with a tune list during this study quarter have contributed to this e-book.

The book’s subtitle goals to mirror that generally varied computational methods to the Arabic morphological approach were proposed. those bills fall into major paradigms: the knowledge-based and the empirical. when you consider that morphological wisdom performs an important position in any higher-level figuring out and processing of Arabic textual content, the booklet additionally contains a half at the function of Arabic morphology in better purposes, i.e. info Retrieval (IR) and computing device Translation (MT).

Additional info for Arabic Computational Morphology: Knowledge-based and Empirical Methods

Sample text

1. The default structure of the form katab strings.

12). The most frequent “run-on” words in Arabic are combinations of the highfrequency function words ϻ /al-/ and Ύϣ /mƗ/ – which end in the non-connector alif – with following perfect or imperfect verbs, such as ϝ΍ΰϳϻ /lƗ-yazƗl/, ϡ΍ήϳΎϣ /mƗyazƗl/, and ϝ΍ίΎϣ /mƗ-zƗl/. The ϻ /lƗ/ of “absolute negation” concatenates freely with nouns, as in ΪΑϻ /lƗ-budda/ and Ϛηϻ /lƗ-šakka/. 13). 12. 13. 1-word and 2-word frequencies of run-on words Google score (4/2006) as 2-words as 1-word 3,540,000 717,000 4,420,000 1,850,000 2,210,000 2,010,000 1,120,000 192,000 1,190,000 155,000 1,160,000 210,000 928,000 356,000 910,000 749,000 170 000 , 30,500 Google score (4/2005) as 2-words as 1-word 412,000 44,300 792,000 87,100 188,000 276,000 106,000 15,500 97,500 7,680 96,500 8,470 83,500 30,300 83,400 67,800 12,700 3,760 Run-on words ϦϜϤϳϻ ϮϫΎϣ ΪΑϻ ϝ΍ΰΗϻ ΙΪΣΎϣ ϖϠόΘϳΎϣ Ϛηϻ Ζϟ΍ίΎϣ ΐϳέϻ space.

D. thesis, Charles University in Prague. The Unicode Consortium. 2003. 0. Boston, AddisonWesley. H. Wehr. 1979 A Dictionary of Modern Written Arabic. 4th edition, edited. by J. Milton Cowan. Wiesbaden, Harrassowitz. uk Abstract: Syllable-based morphology is an approach to morphology that considers syllables to be the primary concept in morphological description. The theory proposes that, other than simple affixation, morphological processes or operations are best defined in terms of the resulting syllabic structure, with syllable constituents (onset, peak, coda) being defined according to the morphosyntactic status of the form.

