Publikationer
Building Frame Semantics Resources for Scandinavian and Baltic Languages
Redaktör:
- Pierre Nugues (Docent)
- Richard Johansson
Avdelning/ar:
Publiceringsår: 2007
Språk: Engelska
Publikation/Tidskrift/Serie: LU-CS-TR: 2007-240
Fulltext:
Dokumenttyp: Konferenstryck: redaktör
Förlag: Department of Computer Science, Lund University
Sammanfattning
Annotated data with role-semantic information are becoming an ever more important resource for
many semantic systems. They form the core element to develop large coverage, high-performance,
and reusable semantic parsers, classifiers as well as applications that include lexicography, term and information
extraction, semantic processing of the web, text-to-scene conversion systems, etc.
Existing examples of role annotated corpora/resources include for English: FrameNet, PropBank, and
VerbNet, for German: Salsa, and for Spanish: Spanish FrameNet. However, the two main initiatives
outside English take FrameNet as a semantic pivot and attempt to derive or adapt frames to the target
language using manual work or semiautomatic systems.
As frequently observed, the itemization of frames and lexical units and their manual annotation in a
corpus is an expensive task that requires a relatively long-term and dedicated commitment. Such an
effort is currently beyond the reach of most research teams in the Nordic/Baltic area, which could impair
the quality, and possibly the existence, of future semantic applications in these languages. This
makes the construction of a role-semantic annotated corpus and the design of automatic or semiautomatic
transfer methods a challenge as well as an opportunity.
These proceedings contain the seven papers of the FRAME 2007 workshop, which was held on May
24, 2007 in Tartu, Estonia. They provide perspectives from various areas on current research in frame
semantics and we hope they will foster new ideas to start the construction of role-annotated corpora
in the Nordic/Baltic region and possibly share it across families of related languages.
many semantic systems. They form the core element to develop large coverage, high-performance,
and reusable semantic parsers, classifiers as well as applications that include lexicography, term and information
extraction, semantic processing of the web, text-to-scene conversion systems, etc.
Existing examples of role annotated corpora/resources include for English: FrameNet, PropBank, and
VerbNet, for German: Salsa, and for Spanish: Spanish FrameNet. However, the two main initiatives
outside English take FrameNet as a semantic pivot and attempt to derive or adapt frames to the target
language using manual work or semiautomatic systems.
As frequently observed, the itemization of frames and lexical units and their manual annotation in a
corpus is an expensive task that requires a relatively long-term and dedicated commitment. Such an
effort is currently beyond the reach of most research teams in the Nordic/Baltic area, which could impair
the quality, and possibly the existence, of future semantic applications in these languages. This
makes the construction of a role-semantic annotated corpus and the design of automatic or semiautomatic
transfer methods a challenge as well as an opportunity.
These proceedings contain the seven papers of the FRAME 2007 workshop, which was held on May
24, 2007 in Tartu, Estonia. They provide perspectives from various areas on current research in frame
semantics and we hope they will foster new ideas to start the construction of role-annotated corpora
in the Nordic/Baltic region and possibly share it across families of related languages.
Disputation
Nyckelord
- Technology and Engineering
- Frame semantics
- natural language processing
- Semantic analysis
Övrigt
Building Frame Semantics Resources for Scandinavian and Baltic Languages
2007-05-24
Tartu, Estonia
Yes
- ISBN: 978-91-976939-0-5

