Anjalie Field
2020
A Generative Approach to Titling and Clustering Wikipedia Sections
Anjalie Field
|
Sascha Rothe
|
Simon Baumgartner
|
Cong Yu
|
Abe Ittycheriah
Proceedings of the Fourth Workshop on Neural Generation and Translation
We evaluate the performance of transformer encoders with various decoders for information organization through a new task: generation of section headings for Wikipedia articles. Our analysis shows that decoders containing attention mechanisms over the encoder output achieve high-scoring results by generating extractive text. In contrast, a decoder without attention better facilitates semantic encoding and can be used to generate section embeddings. We additionally introduce a new loss function, which further encourages the decoder to generate high-quality embeddings.
Demoting Racial Bias in Hate Speech Detection
Mengzhou Xia
|
Anjalie Field
|
Yulia Tsvetkov
Proceedings of the Eighth International Workshop on Natural Language Processing for Social Media
In the task of hate speech detection, there exists a high correlation between African American English (AAE) and annotators’ perceptions of toxicity in current datasets. This bias in annotated training data and the tendency of machine learning models to amplify it cause AAE text to often be mislabeled as abusive/offensive/hate speech (high false positive rate) by current hate speech classifiers. Here, we use adversarial training to mitigate this bias. Experimental results on one hate speech dataset and one AAE dataset suggest that our method is able to reduce the false positive rate for AAE text with only a minimal compromise on the performance of hate speech classification.
Search
Co-authors
- Sascha Rothe 1
- Simon Baumgartner 1
- Cong Yu 1
- Abe Ittycheriah 1
- Mengzhou Xia 1
- show all...