Du är här

Constructing large proposition databases

Publiceringsår: 2012
Språk: Engelska
Sidor: 3836-3839
Publikation/Tidskrift/Serie: Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC'12)
Dokumenttyp: Konferensbidrag
Förlag: European Language Resources Association (ELRA)

Sammanfattning

With the advent of massive online encyclopedic corpora such as Wikipedia, it has become possible to apply a systematic analysis to a wide range of documents covering a significant part of human knowledge. Using semantic parsers, it has become possible to extract such knowledge in the form of propositions (predicate―argument structures) and build large proposition databases from these documents. This paper describes the creation of multilingual proposition databases using generic semantic dependency parsing. Using Wikipedia, we extracted, processed, clustered, and evaluated a large number of propositions. We built an architecture to provide a complete pipeline dealing with the input of text, extraction of knowledge, storage, and presentation of the resulting propositions

Disputation

Nyckelord

  • Technology and Engineering
  • Knowledge Discovery/Representation
  • Information Extraction
  • Information Retrieval
  • Semantics

Övrigt

The eighth international conference on Language Resources and Evaluation (LREC 2012)
21-27 May 2012
Istanbul, Turkey
Published
Yes

Box 117, 221 00 LUND
Telefon 046-222 00 00 (växel)
Telefax 046-222 47 20
lu [at] lu [dot] se

Fakturaadress: Box 188, 221 00 LUND
Organisationsnummer: 202100-3211
Om webbplatsen

LERU logo U21 logo