ITU

Natural Language Processing

Organisational unit: Research Group

IT University of Copenhagen
Rued Langgaards Vej 7
DK-2300 Copenhagen S
Denmark

Contact information

Organisation profile

Natural Language Processing (NLP) uses machine learning and other techniques to parse, analyse, translate and understand texts in human languages such as English or Danish. The work of ITU NLP researchers include transfer learning, representation learning, analysis of clinical patient records, automatic summarization, corpora building, stance detection, fake news analysis, and much more. 

  1. Published

    What do You Mean by Relation Extraction?
 A Survey on Datasets and Study on Scientific Relation Classification

    Bassignana, E. & Plank, B., 2022, The 60th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop. Dublin, Ireland: Association for Computational Linguistics, Vol. Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop. p. 67–83 17 p.

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  2. Published

    SkillSpan: Hard and Soft Skill Extraction from English Job Postings

    Zhang, M., Jensen, K. N., Sonniks, S. D. & Plank, B., 9 Jul 2022, 2022 Annual Conference of the North American Chapter of the Association for Computational Linguistics. Association for Computational Linguistics

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  3. Published

    Kompetencer: Fine-grained Skill Classification in Danish Job Postings via Distant Supervision and Transfer Learning

    Zhang, M., Jensen, K. N. & Plank, B., 16 Jun 2022, 13th International Conference on Language Resources and Evaluation. European Language Resources Association (ELRA), p. 436-447 11 p.

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  4. Published

    Hyperparameter Power Impact in Transformer Language Model Training

    Puvis de Chavannes, L. H., Kongsbak, M. G. K., Rantzau, T. & Derczynski, L., 1 Nov 2021, Proceedings of the Second Workshop on Simple and Efficient Natural Language Processing. Association for Computational Linguistics

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  5. Published

    DanFEVER: claim verification dataset for Danish

    Nørregaard, J. & Derczynski, L., 1 Jun 2021, Proceedings of the 23rd Nordic Conference on Computational Linguistics (NoDaLiDa). Northern European Association for Language Technology (NEALT)

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  6. Published

    The Danish Gigaword Corpus

    Derczynski, L., Ciosici, M. R., Baglini, R., Christiansen, M., Dalsgaard, J. A., Fusaroli, R., Henrichsen, P. J., Hvingelby, R., Kirkedal, A. S., Kjeldsen, A. S., Ladefoged, C., Nielsen, F. Å., Madsen, J., Petersen, M. L., Rystrøm, J. H. & Varab, D., 1 Jun 2021, Proceedings of the 23rd Nordic Conference on Computational Linguistics (NoDaLiDa). Northern European Association for Language Technology (NEALT)

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  7. Published

    How Universal is Genre in Universal Dependencies?

    Müller-Eberstein, M., van der Goot, R. & Plank, B., Dec 2021, Proceedings of the 20th International Workshop on Treebanks and Linguistic Theories (TLT, SyntaxFest 2021). Sofia, Bulgaria: Association for Computational Linguistics, p. 69-85

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  8. Published

    Unsupervised discovery of unaccusative and unergative verbs

    Loáiciga, S., Bevacqua, L. & Hardmeier, C., 2021.

    Research output: Working paperPreprintResearch

  9. Published

    How to write a bias statement: Recommendations for submissions to the Workshop on Gender Bias in NLP

    Hardmeier, C., Costa-jussà, M. R., Webster, K., Radford, W. & Blodgett, S. L., 2021.

    Research output: Working paperPreprintResearch

  10. Published

    Proceedings of the Third Workshop on Gender Bias in Natural Language Processing

    Costa-jussà, M. R. (ed.), Gonen, H. (ed.), Hardmeier, C. (ed.) & Webster, K. (ed.), 2021, Association for Computational Linguistics.

    Research output: Book / Anthology / Report / Ph.D. thesisAnthologyResearchpeer-review

  11. Published

    Proceedings of the Second Workshop on Computational Approaches to Discourse (CODI)

    Braud, C. (ed.), Hardmeier, C. (ed.), Li, J. J. (ed.), Louis, A. (ed.), Strube, M. (ed.) & Zeldes, A. (ed.), 2021, Association for Computational Linguistics.

    Research output: Book / Anthology / Report / Ph.D. thesisAnthologyResearchpeer-review

  12. Published

    Event and entity coreference across five European languages: Effects of context and referring expression

    Bevacqua, L., Loáiciga, S., Rohde, H. & Hardmeier, C., 18 Dec 2021, In: Dialogue and Discourse.

    Research output: Journal Article or Conference Article in JournalJournal articleResearchpeer-review

  13. Published

    Exploring the importance of source text in automatic post-editing for context-aware machine translation

    Wang, C., Hardmeier, C. & Sennrich, R., 2021, Proceedings of the 23rd Nordic Conference on Computational Linguistics (NODALIDA). Linköping University Electronic Press, p. 326-335

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  14. Published

    A mention-based system for revision requirements detection

    Ruby, A., Hardmeier, C. & Stymne, S., 2021, Proceedings of the First Workshop on Understanding Implicit and Underspecified Language (UnImplicit). Association for Computational Linguistics, p. 58-63

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  15. Published

    MultiLexNorm: A Shared Task on Multilingual Lexical Normalization

    van der Goot, R., Ramponi, A., Zubiaga, A., Plank, B., Muller, B., San Vicente Roncal, I., Ljubešic´, N., Çetinoğlu, Ö., Mahendra, R., Çolakoglu, T., Baldwin, T., Caselli, T. & Sidorenko, W., Nov 2021, Proceedings of the Seventh Workshop on Noisy User-generated Text (W-NUT 2021). Association for Computational Linguistics, p. 493–509 16 p.

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  16. Published

    CL-MoNoise: Cross-lingual Lexical Normalization

    van der Goot, R., Oct 2021, Proceedings of the Seventh Workshop on Noisy User-generated Text (W-NUT 2021). Association for Computational Linguistics, p. 510 4 p.

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  17. Published

    We Need to Talk About train-dev-test Splits

    van der Goot, R., Oct 2021, Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, p. 4485 9 p.

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  18. Published

    Much Gracias: Semi-supervised Code-switch Detection for Spanish-English: How far can we get?

    Iliescu, D-M., Grand, R., van der Goot, R. & Qirko, S., Jun 2021, Proceedings of the Fifth Workshop on Computational Approaches to Linguistic Code-Switching. Association for Computational Linguistics, p. 65 6 p. (Proceedings of the Fifth Workshop on Computational Approaches to Linguistic Code-Switching).

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  19. Published

    Resources and Evaluations for Danish Entity Resolution

    Barrett, M. J., Lam, H., Wu, M., Lacroix, O., Plank, B. & Søgaard, A., 2021, Fourth Workshop on Computational Models of Reference, Anaphora and Coreference (CRAC). Association for Computational Linguistics, p. 63–69

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  20. Published

    DaNLP: An open-source toolkit for Danish Natural Language Processing

    Pauli, A. B., Barrett, M. J., Lacroix, O. & Hvingelby, R., 2021, Proceedings of the 23rd Nordic Conference on Computational Linguistics. Linköping Electronic Conference Proceedings, p. 460-466

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  21. Published

    Decoding EEG brain activity for multi-modal natural language processing

    Hollenstein, N., Renggli, C., Glaus, B., Barrett, M. J., Troendle, M., Langer, N. & Zhang, C., 2021, In: Frontiers in Human Neuroscience. 15, 659410.

    Research output: Journal Article or Conference Article in JournalJournal articleResearchpeer-review

  22. Published

    Spurious Correlations in Cross-Topic Argument Mining

    Jakobsen, T. S. T., Barrett, M. J. & Søgaard, A., 2021, Proceedings of *SEM 2021: The Tenth Joint Conference on Lexical and Computational Semantics. Association for Computational Linguistics, p. 263-277 9 p.

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  23. Published

    PROCAT: Product Catalogue Dataset for Implicit Clustering, Permutation Learning and Structure Prediction

    Jurewicz, M. & Derczynski, L., 1 Dec 2021, Thirty-fifth Conference on Neural Information Processing Systems: Datasets and Benchmarks Track. 2021 ed. Vol. 1.

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  24. Published

    Genre as Weak Supervision for Cross-lingual Dependency Parsing

    Müller-Eberstein, M., van der Goot, R. & Plank, B., Nov 2021, Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing. Online and Punta Cana, Dominican Republic: Association for Computational Linguistics, p. 4786-4802

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  25. Published

    "I’ll be there for you": The One with Understanding Indirect Answers

    Damgaard, C., Toborek, P., Eriksen, T. & Plank, B., 2021, The Second Workshop on Computational Approaches to Discourse. Association for Computational Linguistics

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  26. Published

    Finding the needle in a haystack: Extraction of Informative COVID-19 Danish Tweets

    Olsen, B. A. & Plank, B., 2021, Proceedings of the 2021 EMNLP Workshop W-NUT: The Seventh Workshop on Noisy User-generated Text. Association for Computational Linguistics, p. 11–19

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  27. Published

    Cartography Active Learning

    Zhang, M. & Plank, B., 8 Nov 2021, Findings of the Association for Computational Linguistics: EMNLP 2021. Association for Computational Linguistics, p. 395–406

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  28. Published

    Cross-Lingual Cross-Domain Nested Named Entity Evaluation on English Web Texts

    Plank, B., 2021, Findings of ACL 2021. Association for Computational Linguistics, p. 1808 1815 p.

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  29. Published

    Set-to-Sequence Methods in Machine Learning: A Review

    Jurewicz, M. & Derczynski, L., 12 Aug 2021, In: The Journal of Artificial Intelligence Research. 71, p. 885-924

    Research output: Journal Article or Conference Article in JournalJournal articleResearchpeer-review

  30. Published

    Annotating Online Misogyny

    Zeinert, P., Inie, N. & Derczynski, L., 3 Aug 2021, Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers). Association for Computational Linguistics, p. 3181–3197

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  31. Published

    We Need to Consider Disagreement in Evaluation

    Basile, V., Fell, M., Fornaciari, T., Hovy, D., Paun, S., Plank, B., Poesio, M. & Uma, A., 2021, ACL-IJCNLP2021 Workshop on Benchmarking: Past, Present and Future. Association for Computational Linguistics, p. 15-21

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  32. Published

    Challenges in Annotating and Parsing Spoken, Code-switched, Frisian-Dutch Data

    Braggaar, A. & van der Goot, R., Apr 2021, Proceedings of the Second Workshop on Domain Adaptation for NLP. Association for Computational Linguistics, p. 50-58

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  33. Published

    Lexical Normalization for Code-switched Data and its Effect on POS Tagging

    van der Goot, R. & Çetinoğlu, Ö., Apr 2021, Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume. Association for Computational Linguistics, p. 2352-2365 13 p.

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  34. Published

    Cross-lingual Multi-task Transfer for Zero-shot Task-oriented Dialog

    van der Goot, R., Stepanovic, M., Ramponi, A., Sharaf, I., Üstün, A., Imankulova, A., Khairunnisa, S. O., Komachi, M. & Plank, B., 25 Sep 2021.

    Research output: Contribution to conference - NOT published in proceeding or journalConference abstract for conferenceResearchpeer-review

  35. Published

    Creating a Universal Dependencies Treebank of Spoken Frisian-Dutch Code-switched Data

    Braggaar, A. & van der Goot, R., 25 Sep 2021.

    Research output: Contribution to conference - NOT published in proceeding or journalConference abstract for conferenceResearchpeer-review

  36. Published

    From Masked Language Modeling to Translation: Non-English Auxiliary Tasks Improve Zero-shot Spoken Language Understanding

    van der Goot, R., Sharaf, I., Imankulova, A., Üstün, A., Stepanovic, M., Ramponi, A., Khairunnisa, S. O., Komachi, M. & Plank, B., 2021, Proceedings of NAACL. Association for Computational Linguistics

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  37. Published

    Massive Choice, Ample Tasks (MaChAmp): A Toolkit for Multi-task Learning in NLP

    van der Goot, R., Üstün, A., Ramponi, A., Sharaf, I. & Plank, B., 2021, Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: System Demonstrations. Association for Computational Linguistics, p. 176-197

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  38. Published

    On the Effectiveness of Dataset Embeddings in Mono-lingual, Multi-lingual and Zero-shot Conditions

    van der Goot, R., Üstün, A. & Plank, B., Apr 2021, Proceedings of the Second Workshop on Domain Adaptation for NLP: EACL 2021 workshop. Association for Computational Linguistics, p. 183–194

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  39. Published

    Abusive Language Recognition in Russian

    Saitov, K. & Derczynski, L., 20 Apr 2021, Proceedings of the 8th Workshop on Balto-Slavic Natural Language Processing. Association for Computational Linguistics, p. 20-25

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  40. Published

    An IDR Framework of Opportunities and Barriers between HCI and NLP

    Inie, N. & Derczynski, L., 20 Apr 2021, Proceedings of the First Workshop on Bridging Human–Computer Interaction and Natural Language Processing: HCINLP. Association for Computational Linguistics, p. 101-108

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  41. Published

    De-identification of Privacy-related Entities in Job Postings

    Jensen, K. N., Zhang, M. & Plank, B., 21 May 2021, Proceedings of the 23rd Nordic Conference on Computational Linguistics. Association for Computational Linguistics, p. 210-221 (Linköping Electronic Conference Proceedings; No. 21, Vol. 178).

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  42. Published

    Discriminating Between Similar Nordic Languages

    Haas, R. & Derczynski, L., 20 Apr 2021, Proceedings of the Eighth Workshop on NLP for Similar Languages, Varieties and Dialects. Association for Computational Linguistics, p. 67–75

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  43. Published

    Detection and Resolution of Rumors and Misinformation with NLP

    Derczynski, L. & Zubiaga, A., Dec 2020, Proceedings of the 28th International Conference on Computational Linguistics: Tutorial Abstracts. Barcelona, Spain (Online): Association for Computational Linguistics, p. 22-26

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  44. Published

    SemEval-2020 Task 12: Multilingual Offensive Language Identification in Social Media (OffensEval 2020)

    Zampieri, M., Nakov, P., Rosenthal, S., Atanasova, P., Karadzhov, G., Mubarak, H., Derczynski, L., Pitenis, Z. & Coltekin, C., Dec 2020, Proceedings of the Fourteenth Workshop on Semantic Evaluation. Barcelona (online): Association for Computational Linguistics, p. 1425-1447 23 p.

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  45. Published

    Directions in abusive language training data, a systematic review: Garbage in, garbage out

    Vidgen, B. & Derczynski, L., 28 Dec 2020, In: PLOS ONE. 15, 12, e0243300.

    Research output: Journal Article or Conference Article in JournalJournal articleResearchpeer-review

  46. Published

    DaNewsroom: A Large-scale Danish Summarisation Dataset

    Varab, D. & Schluter, N., Apr 2020, Proceedings of the 12th Conference on Language Resources and Evaluation (LREC 2020). European Language Resources Association, p. 6731–6739

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  47. Published

    Sequence labelling and sequence classification with gaze: Novel uses of eye‐tracking data for Natural Language Processing

    Barrett, M. J. & Hollenstein, N., 5 Nov 2020, In: Language and Linguistics Compass. 14, 11, p. 1-16 16 p.

    Research output: Journal Article or Conference Article in JournalJournal articleResearchpeer-review

  48. Published

    Matching Theory and Data with Personal-ITY: What a Corpus of Italian YouTube Comments Reveals About Personality

    Bassignana, E., Nissim, M. & Patti, V., 2020, Proceedings of the Third Workshop on Computational Modeling of People's Opinions, Personality, and Emotion's in Social Media. Association for Computational Linguistics, p. 11-22

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  49. Published

    NLP North at WNUT-2020 Task 2: Pre-training versus Ensembling for Detection of Informative COVID-19 English Tweets

    Møller, A. G., van der Goot, R. & Plank, B., Nov 2020, Proceedings of the Sixth Workshop on Noisy User-generated Text (W-NUT 2020). Association for Computational Linguistics, p. 331-336

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  50. Published

    Neural Unsupervised Domain Adaptation in NLP—A Survey

    Ramponi, A. & Plank, B., Dec 2020, The 28th International Conference on Computational Linguistics. Association for Computational Linguistics

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

Previous 1 2 Next