Karin Verspoor
2020
Evaluating the Utility of Model Configurations and Data Augmentation on Clinical Semantic Textual Similarity
Yuxia Wang
|
Fei Liu
|
Karin Verspoor
|
Timothy Baldwin
Proceedings of the 19th SIGBioMed Workshop on Biomedical Language Processing
In this paper, we apply pre-trained language models to the Semantic Textual Similarity (STS) task, with a specific focus on the clinical domain. In low-resource setting of clinical STS, these large models tend to be impractical and prone to overfitting. Building on BERT, we study the impact of a number of model design choices, namely different fine-tuning and pooling strategies. We observe that the impact of domain-specific fine-tuning on clinical STS is much less than that in the general domain, likely due to the concept richness of the domain. Based on this, we propose two data augmentation techniques. Experimental results on N2C2-STS 1 demonstrate substantial improvements, validating the utility of the proposed methods.
Domain Adaptation and Instance Selection for Disease Syndrome Classification over Veterinary Clinical Notes
Brian Hur
|
Timothy Baldwin
|
Karin Verspoor
|
Laura Hardefeldt
|
James Gilkerson
Proceedings of the 19th SIGBioMed Workshop on Biomedical Language Processing
Identifying the reasons for antibiotic administration in veterinary records is a critical component of understanding antimicrobial usage patterns. This informs antimicrobial stewardship programs designed to fight antimicrobial resistance, a major health crisis affecting both humans and animals in which veterinarians have an important role to play. We propose a document classification approach to determine the reason for administration of a given drug, with particular focus on domain adaptation from one drug to another, and instance selection to minimize annotation effort.
Search
Co-authors
- Timothy Baldwin 2
- Yuxia Wang 1
- Fei Liu 1
- Brian Hur 1
- Laura Hardefeldt 1
- show all...