Memory-based Named Entity Recognition in Tweets

Antal Van den Bosch, Toine Bogers

Publikation: Konference artikel i Proceeding eller bog/rapport kapitelKonferencebidrag i proceedingsForskningpeer review

Abstract

We present a memory-based named entity recognition system that participated in the MSM-2013 Concept Extraction Challenge. The system expands the training set of annotated tweets with part-of-speech tags and seedlist information, and then generates a sequential memory-based tagger comprised of separate modules for known and unknown words. Two taggers are trained: one on the original capitalized data, and one on a lowercased version of the training data. The intersection of named entities in the predictions of the two taggers is kept as the final output.
OriginalsprogEngelsk
TitelMSM 2013 : Proceedings of the 3rd WWW Workshop on Making Sense of Microposts
RedaktørerAmparo Cano, Matthew Rowe, Milan Stankovic, Aba-Sah Dadzie
Antal sider4
ForlagCEUR Workshop Proceedings
Publikationsdato13 maj 2013
Sider40-43
StatusUdgivet - 13 maj 2013
Udgivet eksterntJa
Begivenhed3rd workshop on 'Making Sense of Microposts' - RIo de Janerio, Brasilien
Varighed: 13 maj 2013 → …
http://oak.dcs.shef.ac.uk/msm2013/

Konference

Konference3rd workshop on 'Making Sense of Microposts'
Land/OmrådeBrasilien
ByRIo de Janerio
Periode13/05/2013 → …
Internetadresse
NavnCEUR Workshop Proceedings
Vol/bind1019
ISSN1613-0073

Fingeraftryk

Dyk ned i forskningsemnerne om 'Memory-based Named Entity Recognition in Tweets'. Sammen danner de et unikt fingeraftryk.

Citationsformater