Webbläsaren som du använder stöds inte av denna webbplats. Alla versioner av Internet Explorer stöds inte längre, av oss eller Microsoft (läs mer här: * https://www.microsoft.com/en-us/microsoft-365/windows/end-of-ie-support).

Var god och använd en modern webbläsare för att ta del av denna webbplats, som t.ex. nyaste versioner av Edge, Chrome, Firefox eller Safari osv.

REFRACTIVE: An Open Source Tool to Extract Knowledge from Syntactic and Semantic Relations

Författare

Summary, in English

The extraction of semantic propositions has proven instrumental in applications like IBM Watson (Ferrucci, 2012) and in Google’s knowledge graph (Singhal, 2012). One of the core components of IBM Watson is the PRISMATIC knowledge base consisting of one billion propositions extracted from the English version of Wikipedia and the New York Times (Fan et al., 2010). However, extracting the propositions from the English version of Wikipedia is a time-consuming process. In practice, this task requires multiple machines and a computation distribution involving a good deal of system technicalities. In this paper, we describe REFRACTIVE, an open-source tool to extract propositions from a parsed corpus based on the Hadoop variant of MapReduce. While the complete process consists of a parsing part and an extraction part, we focus here on the extraction from the parsed corpus and we hope this tool will help computational linguists speed up the development of applications.

Publiceringsår

2014

Språk

Engelska

Sidor

2584-2589

Publikation/Tidskrift/Serie

Proceedings of LREC 2014, the 9th edition of the Language Resource and Evaluation Conference

Dokumenttyp

Konferensbidrag

Förlag

European Language Resources Association

Ämne

  • Computer Science

Conference name

LREC, The 9th edition of the Language Resources and Evaluation Conference

Conference date

2014-05-28 - 2014-05-30

Status

Published

ISBN/ISSN/Övrigt

  • ISBN: 978-2-9517408-8-4