Deriving a PropBank Corpus from Parallel FrameNet and UD Corpora
Normunds Gruzitis, Roberts Darģis, Laura Rituma, Gunta Nešpore-Bērzkalne, Baiba Saulite
Abstract
We propose an approach for generating an accurate and consistent PropBank-annotated corpus, given a FrameNet-annotated corpus which has an underlying dependency annotation layer, namely, a parallel Universal Dependencies (UD) treebank. The PropBank annotation layer of such a multi-layer corpus can be semi-automatically derived from the existing FrameNet and UD annotation layers, by providing a mapping configuration from lexical units in [a non-English language] FrameNet to [English language] PropBank predicates, and a mapping configuration from FrameNet frame elements to PropBank semantic arguments for the given pair of a FrameNet frame and a PropBank predicate. The latter mapping generally depends on the underlying UD syntactic relations. To demonstrate our approach, we use Latvian FrameNet, annotated on top of Latvian UD Treebank, for generating Latvian PropBank in compliance with the Universal Propositions approach.- Anthology ID:
- 2020.framenet-1.9
- Volume:
- Proceedings of the International FrameNet Workshop 2020: Towards a Global, Multilingual FrameNet
- Month:
- May
- Year:
- 2020
- Address:
- Marseille, France
- Venues:
- Framenet | LREC | WS
- SIG:
- Publisher:
- European Language Resources Association
- Note:
- Pages:
- 63–69
- URL:
- https://www.aclweb.org/anthology/2020.framenet-1.9
- DOI:
- PDF:
- https://www.aclweb.org/anthology/2020.framenet-1.9.pdf
You can write comments here (and agree to place them under CC-by). They are not guaranteed to stay and there is no e-mail functionality.