Empirically sampling Universal Dependencies

Natalie Schluter, Zeljko Agic

Research output: Journal Article or Conference Article in JournalConference articleResearchpeer-review

Abstract

Universal Dependencies incur a high cost in computation for unbiased system development. We propose a 100% empirically chosen small subset of UD languages for efficient parsing system development. The technique used is based on measurements of model capacity globally. We show that the diversity of the resulting representative language set is superior to the requirements-based procedure.
Original languageEnglish
JournalNEALT (Northern European Association of Language Technology) Proceedings Series
Volume31
Pages (from-to)117-122
ISSN1736-6305
Publication statusPublished - 2017
EventNoDaLiDa 2017 Workshop on Universal Dependencies - Gothenburg, Sweden
Duration: 22 May 2017 → …

Workshop

WorkshopNoDaLiDa 2017 Workshop on Universal Dependencies
Country/TerritorySweden
CityGothenburg
Period22/05/2017 → …

Keywords

  • Universal Dependencies
  • Parsers
  • Language subset selection
  • Model capacity
  • Empirical methods

Fingerprint

Dive into the research topics of 'Empirically sampling Universal Dependencies'. Together they form a unique fingerprint.

Cite this