Sophia Ananiadou
2020
Revisiting Unsupervised Relation Extraction
Thy Thy Tran
|
Phong Le
|
Sophia Ananiadou
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
Unsupervised relation extraction (URE) extracts relations between named entities from raw text without manually-labelled data and existing knowledge bases (KBs). URE methods can be categorised into generative and discriminative approaches, which rely either on hand-crafted features or surface form. However, we demonstrate that by using only named entities to induce relation types, we can outperform existing methods on two popular datasets. We conduct a comparison and evaluation of our findings with other URE techniques, to ascertain the important features in URE. We conclude that entity types provide a strong inductive bias for URE.
Semantic Annotation for Improved Safety in Construction Work
Paul Thompson
|
Tim Yates
|
Emrah Inan
|
Sophia Ananiadou
Proceedings of The 12th Language Resources and Evaluation Conference
Risk management is a vital activity to ensure employee safety in construction projects. Various documents provide important supporting evidence, including details of previous incidents, consequences and mitigation strategies. Potential hazards may depend on a complex set of project-specific attributes, including activities undertaken, location, equipment used, etc. However, finding evidence about previous projects with similar attributes can be problematic, since information about risks and mitigations is usually hidden within and may be dispersed across a range of different free text documents. Automatic named entity recognition (NER), which identifies mentions of concepts in free text documents, is the first stage in structuring knowledge contained within them. While developing NER methods generally relies on annotated corpora, we are not aware of any such corpus targeted at concepts relevant to construction safety. In response, we have designed a novel named entity annotation scheme and associated guidelines for this domain, which covers hazards, consequences, mitigation strategies and project attributes. Four health and safety experts used the guidelines to annotate a total of 600 sentences from accident reports; an average inter-annotator agreement rate of 0.79 F-Score shows that our work constitutes an important first step towards developing tools for detailed semantic analysis of construction safety documents.
Proceedings of the 19th SIGBioMed Workshop on Biomedical Language Processing
Dina Demner-Fushman
|
Kevin Bretonnel Cohen
|
Sophia Ananiadou
|
Junichi Tsujii
Proceedings of the 19th SIGBioMed Workshop on Biomedical Language Processing
Search
Co-authors
- Thy Thy Tran 1
- Phong Le 1
- Paul Thompson 1
- Tim Yates 1
- Emrah Inan 1
- show all...