Towards Understanding ASR Error Correction for Medical Conversations

Anirudh Mani; Shruti Palaskar; Sandeep Konam

Towards Understanding ASR Error Correction for Medical Conversations

Anirudh Mani, Shruti Palaskar, Sandeep Konam

Abstract

Domain Adaptation for Automatic Speech Recognition (ASR) error correction via machine translation is a useful technique for improving out-of-domain outputs of pre-trained ASR systems to obtain optimal results for specific in-domain tasks. We use this technique on our dataset of Doctor-Patient conversations using two off-the-shelf ASR systems: Google ASR (commercial) and the ASPIRE model (open-source). We train a Sequence-to-Sequence Machine Translation model and evaluate it on seven specific UMLS Semantic types, including Pharmacological Substance, Sign or Symptom, and Diagnostic Procedure to name a few. Lastly, we breakdown, analyze and discuss the 7% overall improvement in word error rate in view of each Semantic type.

Anthology ID:: 2020.nlpmc-1.2
Volume:: Proceedings of the First Workshop on Natural Language Processing for Medical Conversations
Month:: July
Year:: 2020
Address:: Online
Venues:: ACL | NLPMC | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 7–11
URL:: https://www.aclweb.org/anthology/2020.nlpmc-1.2
DOI:
Bib Export formats:: BibTeX MODS XML EndNote
PDF:: https://www.aclweb.org/anthology/2020.nlpmc-1.2.pdf

You can write comments here (and agree to place them under CC-by). They are not guaranteed to stay and there is no e-mail functionality.

PDF BibTeX Search