A Processing Platform Relating Data and Tools for Romanian Language

Vasile Păiș, Radu Ion, Dan Tufiș


Abstract
This paper presents RELATE (http://relate.racai.ro), a high-performance natural language platform designed for Romanian language. It is meant both for demonstration of available services, from text-span annotations to syntactic dependency trees as well as playing or automatically synthesizing Romanian words, and for the development of new annotated corpora. It also incorporates the search engines for the large COROLA reference corpus of contemporary Romanian and the Romanian wordnet. It integrates multiple text and speech processing modules and exposes their functionality through a web interface designed for the linguist researcher. It makes use of a scheduler-runner architecture, allowing processing to be distributed across multiple computing nodes. A series of input/output converters allows large corpora to be loaded, processed and exported according to user preferences.
Anthology ID:
2020.iwltp-1.13
Volume:
Proceedings of the 1st International Workshop on Language Technology Platforms
Month:
May
Year:
2020
Address:
Marseille, France
Venues:
IWLTP | LREC | WS
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
81–88
URL:
https://www.aclweb.org/anthology/2020.iwltp-1.13
DOI:
Bib Export formats:
BibTeX MODS XML EndNote
PDF:
https://www.aclweb.org/anthology/2020.iwltp-1.13.pdf

You can write comments here (and agree to place them under CC-by). They are not guaranteed to stay and there is no e-mail functionality.