Book Title: Proceedings of the First International Workshop on Formalisms and Methodology for Learning by Reading
Date: June 6, 2010
Abstract: This paper describes a hybrid approach for unsupervised and unrestricted relation discovery between entities using output from linguistic analysis and semantic typing information from a knowledge base. We use Factz (encoded as subject, predicate and object triples) produced by Powerset as a result of linguistic analysis. A particular relation may be expressed in a variety of ways in text and hence have multiple facts associated with it. We present an unsupervised approach for collapsing multiple facts which represent the same kind of semantic relation between entities. Then a label is selected for the relation based on the input facts and entropy based label ranking of context words. Finally, we demonstrate relation discovery between entities at different levels of abstraction by leveraging semantic typing information from a knowledge base.
Type: InProceedings
Tags: information extraction, learning, natural language processing, powerset
Google Scholar: search
Attachments:
481.pdf | downloads: 1147 |