Augmenting Data for Sarcasm Detection with Unlabeled Conversation Context
Hankyol Lee, Youngjae Yu, Gunhee Kim
Abstract
We present a novel data augmentation technique, CRA (Contextual Response Augmentation), which utilizes conversational context to generate meaningful samples for training. We also mitigate the issues regarding unbalanced context lengths by changing the input output format of the model such that it can deal with varying context lengths effectively. Specifically, our proposed model, trained with the proposed data augmentation technique, participated in the sarcasm detection task of FigLang2020, have won and achieves the best performance in both Reddit and Twitter datasets.- Anthology ID:
- 2020.figlang-1.2
- Volume:
- Proceedings of the Second Workshop on Figurative Language Processing
- Month:
- July
- Year:
- 2020
- Address:
- Online
- Venues:
- ACL | Fig-Lang | WS
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 12–17
- URL:
- https://www.aclweb.org/anthology/2020.figlang-1.2
- DOI:
- PDF:
- https://www.aclweb.org/anthology/2020.figlang-1.2.pdf
You can write comments here (and agree to place them under CC-by). They are not guaranteed to stay and there is no e-mail functionality.