A Fine-grained Sentiment Dataset for Norwegian
Lilja Øvrelid, Petter Mæhlum, Jeremy Barnes, Erik Velldal
Abstract
We here introduce NoReC_fine, a dataset for fine-grained sentiment analysis in Norwegian, annotated with respect to polar expressions, targets and holders of opinion. The underlying texts are taken from a corpus of professionally authored reviews from multiple news-sources and across a wide variety of domains, including literature, games, music, products, movies and more. We here present a detailed description of this annotation effort. We provide an overview of the developed annotation guidelines, illustrated with examples and present an analysis of inter-annotator agreement. We also report the first experimental results on the dataset, intended as a preliminary benchmark for further experiments.- Anthology ID:
- 2020.lrec-1.618
- Volume:
- Proceedings of The 12th Language Resources and Evaluation Conference
- Month:
- May
- Year:
- 2020
- Address:
- Marseille, France
- Venue:
- LREC
- SIG:
- Publisher:
- European Language Resources Association
- Note:
- Pages:
- 5025–5033
- URL:
- https://www.aclweb.org/anthology/2020.lrec-1.618
- DOI:
- PDF:
- https://www.aclweb.org/anthology/2020.lrec-1.618.pdf
You can write comments here (and agree to place them under CC-by). They are not guaranteed to stay and there is no e-mail functionality.