Using distant supervision to build a proposition bank
Författare
Summary, in English
Semantic role labeling has become a key module of many language processing applications. To build an unrestricted semantic
role labeler, the first step is to develop a comprehensive proposition bank. However, building such a bank is a costly enterprise,
which has only been achieved for a handful of languages. In this paper, we describe a technique to build proposition banks
for new languages using distant supervision. Starting from PropBank in English and loosely parallel corpora such as versions
of Wikipedia in different languages, we carried out a mapping of semantic propositions we extracted from English to syntactic
structures in Swedish using named entities. We could identify 2,333 predicate–argument frames in Swedish.
role labeler, the first step is to develop a comprehensive proposition bank. However, building such a bank is a costly enterprise,
which has only been achieved for a handful of languages. In this paper, we describe a technique to build proposition banks
for new languages using distant supervision. Starting from PropBank in English and loosely parallel corpora such as versions
of Wikipedia in different languages, we carried out a mapping of semantic propositions we extracted from English to syntactic
structures in Swedish using named entities. We could identify 2,333 predicate–argument frames in Swedish.
Avdelning/ar
Publiceringsår
2014
Språk
Engelska
Länkar
Dokumenttyp
Konferensbidrag
Ämne
- Computer Science
Conference name
The Fifth Swedish Language Technology Conference (SLTC 2014)
Conference date
2014-11-13 - 2014-11-14
Conference place
Uppsala, Sweden
Status
Published