Main

Laboratorio de Lingüística Informática

Linguistic Resources

Comparison of LLI-UAM's corpora


CORPORA
RESOURCE
ACCESS
DESCRIPTION

CORLEC. Reference Spoken Corpus of Contemporary Spanish

Free
Text database (spoken language corpus): 1.100.000 words.
Reference Corpus of Spanish in Argentina
Free

Text database (written language corpus): more than 2.000.000 words

Reference Corpus of Spanish in Chile
Free

Text database (written language corpus): 2.000.000 words

Spanish Treebank Corpus
Free

1.500 sentences from newspapers, syntactically annotated

C-ORAL-ROM
Restricted

Multilingual Spoken Corpus (Spanish, French, Portuguese and Italian) with 300.000 words in each language

MAVIR Corpus
Free

Spoken Corpus made up of the lectures from the MAVIR Conferences.

MULTIMÉDICA
Restricted

Spanish, Japanese and Arabic medical corpus with nearly 8.000.000 words

C-ORAL JAPÓN
Restricted

Japanese Spoken Corpus with 50.000 words

CHIEDE. Spontaneous Child Language Corpus of Spanish
Restricted

Spoken child language corpus Corpus with 60.000 words

Spanish Learner Oral Corpus
Free

Interlanguage oral corpus of speech of learners of Spanish with over 50.000 words

French Learner Oral Corpus
Free

Interlanguage oral corpus of speech of learners of French with over 61.000 words

Arabic-Spanish Corpus
Free

Arabic-Spanish Parallel Corpus with 1179 sentences

COREMAH
Free

Multimodal Spanish corpus of speech acts

C-ORAL CHINA
Restricted

China spoken corpus

DIR-SI
Restricted

Bilingual (English and Italian) spoken corpus of speeches and their simultaneous translation

 
TOOLS
RESOURCE
ACCESS
DESCRIPTION
GRAMPAL
Free online

Morphosyntactic tagger

MULTIMÉDICA
Free online

Medical term extractor tool

LETRAS-WEB
Free online

Online tool developed by Prof. Hiroto Ueda for processing and studying corpora

NÚMEROS-WEB
Free online

Online tool developed by Prof. Hiroto Ueda for statistical analysis

Japanese dictionary
Free online

Dictionary with the 800 basic japanese words with sound

JABALÍN
Free

Morphological analyser and generator of Arabic verbs

OJIME
Free

Modality tagger for Spanish and Japanese

Spanish-French Dictionary
Free

Dictionary of french prepositions

Acoustic database of questions
Restricted

Collection of spoken questions compiled after the participation in CLEF




Main Main