GRAMPAL was originally developed as a morphological processor of Spanish for written texts (Moreno 1991, Moreno y Goñi 1995). In order to annotate C-ORAL-ROM, new modules were specifically developed for spoken Spanish: a tokenizer, desambiguation modules and a recognizer of unknown words (Moreno y Guirao 2003). These modules were developed for Spanish with the aim of adapting the morphological processor to the spoken language features. GRAMPAL lexicons comprises around 50.000 entries of stems, endings and multiwords. All the dictionary entries, except dependent morphemes and multiwords, have morphological and Part-of-Speech information, including lemmas. Also, GRAMPAL contains 239 prefixes which form new words without a change of category.


