Towards Understanding ASR Error Correction for Medical Conversations

Anirudh Mani, Shruti Palaskar, Sandeep Konam


Abstract
Domain Adaptation for Automatic Speech Recognition (ASR) error correction via machine translation is a useful technique for improving out-of-domain outputs of pre-trained ASR systems to obtain optimal results for specific in-domain tasks. We use this technique on our dataset of Doctor-Patient conversations using two off-the-shelf ASR systems: Google ASR (commercial) and the ASPIRE model (open-source). We train a Sequence-to-Sequence Machine Translation model and evaluate it on seven specific UMLS Semantic types, including Pharmacological Substance, Sign or Symptom, and Diagnostic Procedure to name a few. Lastly, we breakdown, analyze and discuss the 7% overall improvement in word error rate in view of each Semantic type.
Anthology ID:
2020.nlpmc-1.2
Volume:
Proceedings of the First Workshop on Natural Language Processing for Medical Conversations
Month:
July
Year:
2020
Address:
Online
Venues:
ACL | NLPMC | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
7–11
URL:
https://www.aclweb.org/anthology/2020.nlpmc-1.2
DOI:
Bib Export formats:
BibTeX MODS XML EndNote
PDF:
https://www.aclweb.org/anthology/2020.nlpmc-1.2.pdf

You can write comments here (and agree to place them under CC-by). They are not guaranteed to stay and there is no e-mail functionality.