Meny

Du är här

Entity extraction: From unstructured text to DBpedia RDF triples

Publiceringsår: 2012
Språk: Engelska
Dokumenttyp: Konferensbidrag

Sammanfattning

In this paper, we describe an end-to-end system that automatically extracts RDF triples describing entity relations and properties from unstructured text. This system is based on a pipeline of text processing modules that includes a semantic parser and a coreference solver. By using coreference chains, we group entity actions and properties described in different sentences and convert them
into entity triples. We applied our system to over 114,000 Wikipedia articles and we could extract more than 1,000,000 triples. Using an ontology-mapping system that we bootstrapped using existing DBpedia triples, we mapped 189,000 extracted triples onto the DBpedia namespace. These extracted entities are availableonline in the N-Triple format. 1

1 http://semantica.cs.lth.se/

Disputation

Nyckelord

  • Technology and Engineering

Övriga

The Web of Linked Entities Workshop (WoLE 2012)
2012-11-11
Boston, USA
Published
Yes
  • ISSN: 1613-0073

Box 117, 221 00 LUND
Telefon 046-222 00 00 (växel)
Telefax 046-222 47 20
lu [at] lu [dot] se

Fakturaadress: Box 188, 221 00 LUND
Organisationsnummer: 202100-3211
Om webbplatsen