ITU

Natural Language Processing

Organisational unit: Research Group

IT University of Copenhagen
Rued Langgaards Vej 7
DK-2300 Copenhagen S
Denmark

Contact information

Organisation profile

Natural Language Processing (NLP) uses machine learning and other techniques to parse, analyse, translate and understand texts in human languages such as English or Danish. The work of ITU NLP researchers include transfer learning, representation learning, analysis of clinical patient records, automatic summarization, corpora building, stance detection, fake news analysis, and much more. 

  1. 2020
  2. Published

    Matching Theory and Data with Personal-ITY: What a Corpus of Italian YouTube Comments Reveals About Personality

    Bassignana, E., Nissim, M. & Patti, V., 2020, Proceedings of the Third Workshop on Computational Modeling of People's Opinions, Personality, and Emotion's in Social Media. Association for Computational Linguistics, p. 11-22

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  3. Published

    One of these words is not like the other: a reproduction of outlier identification using non-contextual word representations

    Brink Andersen, J., Bak Bertelsen, M., Hørby Schou, M., Ciosici, M. R. & Assent, I., Nov 2020, Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing and the 10th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP) . Association for Computational Linguistics, 11 p.

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  4. Published

    Accelerated High-Quality Mutual-Information Based Word Clustering

    Ciosici, M. R., Assent, I. & Derczynski, L., 1 May 2020, Proceedings of The 12th Language Resources and Evaluation Conference. Marseille, France: European Language Resources Association, p. 2484-2489 6 p.

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  5. Published

    Synthetic Data for English Lexical Normalization: How Close Can We Get to Manually Annotated Data?

    Dekker, K. & van der Goot, R., May 2020, Proceedings of the Twelfth International Conference on Language Resources and Evaluation (LREC 2020). European Language Resources Association, p. 6300-6309

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  6. Published

    Detection and Resolution of Rumors and Misinformation with NLP

    Derczynski, L. & Zubiaga, A., Dec 2020, Proceedings of the 28th International Conference on Computational Linguistics: Tutorial Abstracts. Barcelona, Spain (Online): Association for Computational Linguistics, p. 22-26

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  7. Published

    The Rumour Mill: Making the Spread of Misinformation Explicit and Tangible

    Inie, N., Falk Olesen, J. & Derczynski, L., Apr 2020, The ACM CHI Conference on Human Factors in Computing Systems. Association for Computing Machinery

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  8. Published

    Buhscitu at SemEvaL-2020 Task 7: Assessing Humour in Edited News Headlines using Hand-Crafted Features and Online Knowledge Bases

    Jensen, K. N., Filrup Rasmussen, N., Wang, T., Placenti, M. & Plank, B., 2020, SemEval. Association for Computational Linguistics

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  9. Published

    FT Speech: Danish Parliament Speech Corpus

    Kirkedal, A. S., Stepanovic, M. & Plank, B., 2020, INTERSPEECH 2020. International Speech Communication Association (ISCA), (Annual Conference of the International Speech Communication Association).

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  10. Published

    SHR++: An Interface for Morpho-syntactic annotation of Sanskrit Corpora

    Krishna, A., Vidhyut, S., Chawla, D., Sambhavi, S. & Goyal, P., Feb 2020, Proceedings of the 12th Conference on Language Resources and Evaluation (LREC 2020),. Association for Computational Linguistics, p. 7069–7076

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  11. Published

    NLP North at WNUT-2020 Task 2: Pre-training versus Ensembling for Detection of Informative COVID-19 English Tweets

    Møller, A. G., van der Goot, R. & Plank, B., Nov 2020, Proceedings of the Sixth Workshop on Noisy User-generated Text (W-NUT 2020). Association for Computational Linguistics, p. 331-336

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  12. Published

    DAN+: Danish Nested Named Entities and Lexical Normalization

    Plank, B., Jensen, K. N. & van der Goot, R., Dec 2020, The 28th International Conference on Computational Linguistics. Association for Computational Linguistics, p. 6649–6662

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  13. Published

    Biomedical Event Extraction as Sequence Labeling

    Ramponi, A., van der Goot, R., Lombardo, R. & Plank, B., Nov 2020, Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  14. Published

    Cross-Domain Evaluation of Edge Detection for Biomedical Event Extraction

    Ramponi, A., Plank, B. & Lombardo, R., May 2020, Proceedings of the 12th International Conference on Language Resources and Evaluation (LREC 2020). European Language Resources Association, p. 1975 1982 p.

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  15. Published

    Neural Unsupervised Domain Adaptation in NLP—A Survey

    Ramponi, A. & Plank, B., Dec 2020, The 28th International Conference on Computational Linguistics. Association for Computational Linguistics

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  16. Published

    Maintaining quality in FEVER annotation

    Schulte, H., Binau, J. & Derczynski, L., 9 Jul 2020, Proceedings of the Third Workshop on Fact Extraction and VERification (FEVER): Association for Computational Linguistics. Association for Computational Linguistics, p. 42-46

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  17. Published

    Offensive Language and Hate Speech Detection for Danish

    Sigurbergsson, G. & Derczynski, L., 1 May 2020, Proceedings of the International Conference on Language Resources and Evaluation: LREC 2020. European Language Resources Association, p. 3498–3508

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  18. Published

    Norm It! Lexical Normalization for Italian and Its Downstream Effects forDependency Parsing

    van der Goot, R., Ramponi, A., Caselli, T., Cafagna, M. & De Mattei, L., May 2020, Proceedings of the Twelfth International Conference on Language Resources and Evaluation (LREC 2020). France: European Language Resources Association (ELRA), p. 6272–6278

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  19. Published

    DaNewsroom: A Large-scale Danish Summarisation Dataset

    Varab, D. & Schluter, N., Apr 2020, Proceedings of the 12th Conference on Language Resources and Evaluation (LREC 2020). European Language Resources Association, p. 6731–6739

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  20. Published

    SemEval-2020 Task 12: Multilingual Offensive Language Identification in Social Media (OffensEval 2020)

    Zampieri, M., Nakov, P., Rosenthal, S., Atanasova, P., Karadzhov, G., Mubarak, H., Derczynski, L., Pitenis, Z. & Coltekin, C., Dec 2020, Proceedings of the Fourteenth Workshop on Semantic Evaluation. Barcelona (online): Association for Computational Linguistics, p. 1425-1447 23 p.

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

  21. 2021
  22. Published

    Event and entity coreference across five European languages: Effects of context and referring expression

    Bevacqua, L., Loáiciga, S., Rohde, H. & Hardmeier, C., 18 Dec 2021, In: Dialogue and Discourse.

    Research output: Journal Article or Conference Article in JournalJournal articleResearchpeer-review

  23. Published

    Decoding EEG brain activity for multi-modal natural language processing

    Hollenstein, N., Renggli, C., Glaus, B., Barrett, M. J., Troendle, M., Langer, N. & Zhang, C., 2021, In: Frontiers in Human Neuroscience. 15, 659410.

    Research output: Journal Article or Conference Article in JournalJournal articleResearchpeer-review

  24. Published

    Set-to-Sequence Methods in Machine Learning: A Review

    Jurewicz, M. & Derczynski, L., 12 Aug 2021, In: The Journal of Artificial Intelligence Research. 71, p. 885-924

    Research output: Journal Article or Conference Article in JournalJournal articleResearchpeer-review

  25. Published

    Proceedings of the Second Workshop on Computational Approaches to Discourse (CODI)

    Braud, C. (ed.), Hardmeier, C. (ed.), Li, J. J. (ed.), Louis, A. (ed.), Strube, M. (ed.) & Zeldes, A. (ed.), 2021, Association for Computational Linguistics.

    Research output: Book / Anthology / Report / Ph.D. thesisAnthologyResearchpeer-review

  26. Published

    Proceedings of the Third Workshop on Gender Bias in Natural Language Processing

    Costa-jussà, M. R. (ed.), Gonen, H. (ed.), Hardmeier, C. (ed.) & Webster, K. (ed.), 2021, Association for Computational Linguistics.

    Research output: Book / Anthology / Report / Ph.D. thesisAnthologyResearchpeer-review

  27. Published

    Resources and Evaluations for Danish Entity Resolution

    Barrett, M. J., Lam, H., Wu, M., Lacroix, O., Plank, B. & Søgaard, A., 2021, Fourth Workshop on Computational Models of Reference, Anaphora and Coreference (CRAC). Association for Computational Linguistics, p. 63–69

    Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review