ITU

Natural Language Processing

Organisational unit: Research Group

IT University of Copenhagen
Rued Langgaards Vej 7
DK-2300 Copenhagen S
Denmark

Contact information

Organisation profile

Natural Language Processing (NLP) uses machine learning and other techniques to parse, analyse, translate and understand texts in human languages such as English or Danish. The work of ITU NLP researchers include transfer learning, representation learning, analysis of clinical patient records, automatic summarization, corpora building, stance detection, fake news analysis, and much more. 

  1. 2021
  2. Published

    From Masked Language Modeling to Translation: Non-English Auxiliary Tasks Improve Zero-shot Spoken Language Understanding

    van der Goot, R., Sharaf, I., Imankulova, A., Üstün, A., Stepanovic, M., Ramponi, A., Khairunnisa, S. O., Komachi, M. & Plank, B., 2021, Proceedings of NAACL. Association for Computational Linguistics

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  3. Published

    Lexical Normalization for Code-switched Data and its Effect on POS Tagging

    van der Goot, R. & Çetinoğlu, Ö., Apr 2021, Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume. Association for Computational Linguistics, p. 2352-2365 13 p.

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  4. Published

    Massive Choice, Ample Tasks (MaChAmp): A Toolkit for Multi-task Learning in NLP

    van der Goot, R., Üstün, A., Ramponi, A., Sharaf, I. & Plank, B., 2021, Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: System Demonstrations. Association for Computational Linguistics, p. 176-197

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  5. Published

    On the Effectiveness of Dataset Embeddings in Mono-lingual, Multi-lingual and Zero-shot Conditions

    van der Goot, R., Üstün, A. & Plank, B., Apr 2021, Proceedings of the Second Workshop on Domain Adaptation for NLP: EACL 2021 workshop. Association for Computational Linguistics, p. 183–194

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  6. Published

    Annotating Online Misogyny

    Zeinert, P., Inie, N. & Derczynski, L., 3 Aug 2021, Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers). Association for Computational Linguistics, p. 3181–3197

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  7. 2020
  8. Published

    Sequence labelling and sequence classification with gaze: Novel uses of eye‐tracking data for Natural Language Processing

    Barrett, M. J. & Hollenstein, N., 5 Nov 2020, In: Language and Linguistics Compass. 14, 11, p. 1-16 16 p.

    Research output: Journal Article or Conference Article in JournalJournal articleResearchpeer-review

  9. Published

    Matching Theory and Data with Personal-ITY: What a Corpus of Italian YouTube Comments Reveals About Personality

    Bassignana, E., Nissim, M. & Patti, V., 2020, Proceedings of the Third Workshop on Computational Modeling of People's Opinions, Personality, and Emotion's in Social Media. Association for Computational Linguistics, p. 11-22

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  10. Published

    One of these words is not like the other: a reproduction of outlier identification using non-contextual word representations

    Brink Andersen, J., Bak Bertelsen, M., Hørby Schou, M., Ciosici, M. R. & Assent, I., Nov 2020, Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing and the 10th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP) . Association for Computational Linguistics, 11 p.

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  11. Published

    Accelerated High-Quality Mutual-Information Based Word Clustering

    Ciosici, M. R., Assent, I. & Derczynski, L., 1 May 2020, Proceedings of The 12th Language Resources and Evaluation Conference. Marseille, France: European Language Resources Association, p. 2484-2489 6 p.

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  12. Published

    Synthetic Data for English Lexical Normalization: How Close Can We Get to Manually Annotated Data?

    Dekker, K. & van der Goot, R., May 2020, Proceedings of the Twelfth International Conference on Language Resources and Evaluation (LREC 2020). European Language Resources Association, p. 6300-6309

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review