2020
pdf
bib
abs
Orchestrating NLP Services for the Legal Domain
Julian Moreno-Schneider
|
Georg Rehm
|
Elena Montiel-Ponsoda
|
Víctor Rodriguez-Doncel
|
Artem Revenko
|
Sotirios Karampatakis
|
Maria Khvalchik
|
Christian Sageder
|
Jorge Gracia
|
Filippo Maganza
Proceedings of The 12th Language Resources and Evaluation Conference
Legal technology is currently receiving a lot of attention from various angles. In this contribution we describe the main technical components of a system that is currently under development in the European innovation project Lynx, which includes partners from industry and research. The key contribution of this paper is a workflow manager that enables the flexible orchestration of workflows based on a portfolio of Natural Language Processing and Content Curation services as well as a Multilingual Legal Knowledge Graph that contains semantic information and meaningful references to legal documents. We also describe different use cases with which we experiment and develop prototypical solutions.
pdf
bib
abs
Defying Wikidata: Validation of Terminological Relations in the Web of Data
Patricia Martín-Chozas
|
Sina Ahmadi
|
Elena Montiel-Ponsoda
Proceedings of The 12th Language Resources and Evaluation Conference
In this paper we present an approach to validate terminological data retrieved from open encyclopaedic knowledge bases. This need arises from the enrichment of automatically extracted terms with information from existing resources in theLinguistic Linked Open Data cloud. Specifically, the resource employed for this enrichment is WIKIDATA, since it is one of the biggest knowledge bases freely available within the Semantic Web. During the experiment, we noticed that certain RDF properties in the Knowledge Base did not contain the data they are intended to represent, but a different type of information. In this paper we propose an approach to validate the retrieved data based on four axioms that rely on two linguistic theories: the x-bar theory and the multidimensional theory of terminology.The validation process is supported by a second knowledge base specialised in linguistic data; in this case, CONCEPTNET. In our experiment, we validate terms from the legal domain in four languages: Dutch, English, German and Spanish. The final aim is to generate a set of sound and reliable terminological resources in RDF to contribute to the population of the Linguistic Linked Open Data cloud.
pdf
bib
abs
Recent Developments for the Linguistic Linked Open Data Infrastructure
Thierry Declerck
|
John Philip McCrae
|
Matthias Hartung
|
Jorge Gracia
|
Christian Chiarcos
|
Elena Montiel-Ponsoda
|
Philipp Cimiano
|
Artem Revenko
|
Roser Saurí
|
Deirdre Lee
|
Stefania Racioppa
|
Jamal Abdul Nasir
|
Matthias Orlikowsk
|
Marta Lanau-Coronas
|
Christian Fäth
|
Mariano Rico
|
Mohammad Fazleh Elahi
|
Maria Khvalchik
|
Meritxell Gonzalez
|
Katharine Cooney
Proceedings of The 12th Language Resources and Evaluation Conference
In this paper we describe the contributions made by the European H2020 project “Prêt-à-LLOD” (‘Ready-to-use Multilingual Linked Language Data for Knowledge Services across Sectors’) to the further development of the Linguistic Linked Open Data (LLOD) infrastructure. Prêt-à-LLOD aims to develop a new methodology for building data value chains applicable to a wide range of sectors and applications and based around language resources and language technologies that can be integrated by means of semantic technologies. We describe the methods implemented for increasing the number of language data sets in the LLOD. We also present the approach for ensuring interoperability and for porting LLOD data sets and services to other infrastructures, as well as the contribution of the projects to existing standards.