Skip to main navigation Skip to search Skip to main content

DaNLP: An open-source toolkit for Danish Natural Language Processing

  • Amalie Brogaard Pauli
  • , Maria Jung Barrett
  • , Ophélie Lacroix
  • , Rasmus Hvingelby

Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

Abstract

We present an open-source toolkit for Danish natural language processing (NLP), enabling easy access to Danish NLP’s latest advancements. The toolkit features wrapper functions for loading models and datasets in a unified way using third-party NLP frameworks. The toolkit is developed to enhance community building, understanding the need from industry and knowledge sharing. As an example of this, we present Angry Tweets: An Annotation Game to increase Danish NLP awareness and create a new sentiment-annotated dataset.
Original languageEnglish
Title of host publicationProceedings of the 23rd Nordic Conference on Computational Linguistics
PublisherLinköping University Press
Publication date2021
Pages460-466
Publication statusPublished - 2021
EventNordic Conference on Computational Linguistics - Rejkjavik, Iceland
Duration: 31 May 20212 Jun 2021
Conference number: 23

Conference

ConferenceNordic Conference on Computational Linguistics
Number23
Country/TerritoryIceland
CityRejkjavik
Period31/05/202102/06/2021

Keywords

  • Danish natural language processing
  • open-source toolkit
  • model and dataset wrappers
  • community building in NLP
  • sentiment-annotated dataset

Fingerprint

Dive into the research topics of 'DaNLP: An open-source toolkit for Danish Natural Language Processing'. Together they form a unique fingerprint.

Cite this