Einar Freyr Sigurðsson
2020
Language Technology Programme for Icelandic 2019-2023
Anna Nikulásdóttir
|
Jón Guðnason
|
Anton Karl Ingason
|
Hrafn Loftsson
|
Eiríkur Rögnvaldsson
|
Einar Freyr Sigurðsson
|
Steinþór Steingrímsson
Proceedings of The 12th Language Resources and Evaluation Conference
In this paper, we describe a new national language technology programme for Icelandic. The programme, which spans a period of five years, aims at making Icelandic usable in communication and interactions in the digital world, by developing accessible, open-source language resources and software. The research and development work within the programme is carried out by a consortium of universities, institutions, and private companies, with a strong emphasis on cooperation between academia and industries. Five core projects will be the main content of the programme: language resources, speech recognition, speech synthesis, machine translation, and spell and grammar checking. We also describe other national language technology programmes and give an overview over the history of language technology in Iceland.
Parsing Icelandic Alþingi Transcripts: Parliamentary Speeches as a Genre
Kristján Rúnarsson
|
Einar Freyr Sigurðsson
Proceedings of the Second ParlaCLARIN Workshop
We introduce a corpus of transcripts from Alþingi, the Icelandic parliament. The corpus is syntactically parsed for phrase structure according to the annotation scheme of the Icelandic Parsed Historical Corpus (IcePaHC). This addition to IcePaHC makes it more diverse with respect to text types and we argue that having a syntactically parsed corpus facilitates research on differt types of texts. We furthermore argue that the speech corpus can be treated somewhat like spoken language even though the transcripts differ in various ways from daily spoken language. We also compare this text type to other types and argue that this genre can shed light on their properties. Finally, we exhibit how this addition to IcePaHC has helped us in identifying and solving issues with our parsing scheme.
Search
Co-authors
- Anna Nikulásdóttir 1
- Jón Guðnason 1
- Anton Karl Ingason 1
- Hrafn Loftsson 1
- Eirikur Rögnvaldsson 1
- show all...