Javascript verkar inte påslaget? - Vissa delar av Lunds universitets webbplats fungerar inte optimalt utan javascript, kontrollera din webbläsares inställningar.
Du är här

Statistical Identification of Pleonastic Pronouns

Publiceringsår: 2012
Språk: Engelska
Sidor: 67-68
Publikation/Tidskrift/Serie: SLTC 2012
Dokumenttyp: Konferensbidrag
Förlag: SLTC


This paper describes an algorithm to identify pleonastic pronouns using statistical techniques. The training step uses a coreference

annotated corpus of English and focuses on a set of pronouns such as it. As far as we know, there is no corpus with a pleonastic

annotation. The main idea of the algorithm was then to recast the definition of pleonastic pronouns as pronouns that never occur

in a coreference chain. We integrated this algorithm in an existing coreference solver (Bj¨orkelund and Nugues, 2011) and we

measured the overall performance gains brought by the pleonastic it removal. We observed an improvement of 0.42 from 59.15

of the CoNLL score. The complete system (Stamborg et al., 2012) participated in the CoNLL 2012 shared task (Pradhan et al.,

2012), where it obtained the 4th rank.


  • Computer Science


The Fourth Swedish Language Technology Conference

Box 117, 221 00 LUND
Telefon 046-222 00 00 (växel)
Telefax 046-222 47 20
lu [at] lu [dot] se

Fakturaadress: Box 188, 221 00 LUND
Organisationsnummer: 202100-3211
Om webbplatsen