Natural Language Processing
Organisational unit: Research Group
IT University of Copenhagen
Rued Langgaards Vej 7
DK-2300 Copenhagen S
Denmark
Contact information
- Web: http://nlp.itu.dk
Organisation profile
Natural Language Processing (NLP) uses machine learning and other techniques to parse, analyse, translate and understand texts in human languages such as English or Danish. The work of ITU NLP researchers include transfer learning, representation learning, analysis of clinical patient records, automatic summarization, corpora building, stance detection, fake news analysis, and much more.
- 2022
- Published
What do You Mean by Relation Extraction? A Survey on Datasets and Study on Scientific Relation Classification
Bassignana, E. & Plank, B., 2022, The 60th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop. Dublin, Ireland: Association for Computational Linguistics, Vol. Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop. p. 67–83 17 p.Research output: Conference Article in Proceeding or Book/Report chapter › Article in proceedings › Research › peer-review
- Published
Kompetencer: Fine-grained Skill Classification in Danish Job Postings via Distant Supervision and Transfer Learning
Zhang, M., Jensen, K. N. & Plank, B., 16 Jun 2022, 13th International Conference on Language Resources and Evaluation. European Language Resources Association (ELRA), p. 436-447 11 p.Research output: Conference Article in Proceeding or Book/Report chapter › Article in proceedings › Research › peer-review
- Published
SkillSpan: Hard and Soft Skill Extraction from English Job Postings
Zhang, M., Jensen, K. N., Sonniks, S. D. & Plank, B., 9 Jul 2022, 2022 Annual Conference of the North American Chapter of the Association for Computational Linguistics. Association for Computational LinguisticsResearch output: Conference Article in Proceeding or Book/Report chapter › Article in proceedings › Research › peer-review
- 2021
- Published
Resources and Evaluations for Danish Entity Resolution
Barrett, M. J., Lam, H., Wu, M., Lacroix, O., Plank, B. & Søgaard, A., 2021, Fourth Workshop on Computational Models of Reference, Anaphora and Coreference (CRAC). Association for Computational Linguistics, p. 63–69Research output: Conference Article in Proceeding or Book/Report chapter › Article in proceedings › Research › peer-review
- Published
We Need to Consider Disagreement in Evaluation
Basile, V., Fell, M., Fornaciari, T., Hovy, D., Paun, S., Plank, B., Poesio, M. & Uma, A., 2021, ACL-IJCNLP2021 Workshop on Benchmarking: Past, Present and Future. Association for Computational Linguistics, p. 15-21Research output: Conference Article in Proceeding or Book/Report chapter › Article in proceedings › Research › peer-review
- Published
Event and entity coreference across five European languages: Effects of context and referring expression
Bevacqua, L., Loáiciga, S., Rohde, H. & Hardmeier, C., 18 Dec 2021, In: Dialogue and Discourse.Research output: Journal Article or Conference Article in Journal › Journal article › Research › peer-review
- Published
Challenges in Annotating and Parsing Spoken, Code-switched, Frisian-Dutch Data
Braggaar, A. & van der Goot, R., Apr 2021, Proceedings of the Second Workshop on Domain Adaptation for NLP. Association for Computational Linguistics, p. 50-58Research output: Conference Article in Proceeding or Book/Report chapter › Article in proceedings › Research › peer-review
- Published
Creating a Universal Dependencies Treebank of Spoken Frisian-Dutch Code-switched Data
Braggaar, A. & van der Goot, R., 25 Sep 2021.Research output: Contribution to conference - NOT published in proceeding or journal › Conference abstract for conference › Research › peer-review
- Published
Proceedings of the Second Workshop on Computational Approaches to Discourse (CODI)
Braud, C. (ed.), Hardmeier, C. (ed.), Li, J. J. (ed.), Louis, A. (ed.), Strube, M. (ed.) & Zeldes, A. (ed.), 2021, Association for Computational Linguistics.Research output: Book / Anthology / Report / Ph.D. thesis › Anthology › Research › peer-review
- Published
Proceedings of the Third Workshop on Gender Bias in Natural Language Processing
Costa-jussà, M. R. (ed.), Gonen, H. (ed.), Hardmeier, C. (ed.) & Webster, K. (ed.), 2021, Association for Computational Linguistics.Research output: Book / Anthology / Report / Ph.D. thesis › Anthology › Research › peer-review
- Published
"I’ll be there for you": The One with Understanding Indirect Answers
Damgaard, C., Toborek, P., Eriksen, T. & Plank, B., 2021, The Second Workshop on Computational Approaches to Discourse. Association for Computational LinguisticsResearch output: Conference Article in Proceeding or Book/Report chapter › Article in proceedings › Research › peer-review
- Published
The Danish Gigaword Corpus
Derczynski, L., Ciosici, M. R., Baglini, R., Christiansen, M., Dalsgaard, J. A., Fusaroli, R., Henrichsen, P. J., Hvingelby, R., Kirkedal, A. S., Kjeldsen, A. S., Ladefoged, C., Nielsen, F. Å., Madsen, J., Petersen, M. L., Rystrøm, J. H. & Varab, D., 1 Jun 2021, Proceedings of the 23rd Nordic Conference on Computational Linguistics (NoDaLiDa). Northern European Association for Language Technology (NEALT)Research output: Conference Article in Proceeding or Book/Report chapter › Article in proceedings › Research › peer-review
- Published
Discriminating Between Similar Nordic Languages
Haas, R. & Derczynski, L., 20 Apr 2021, Proceedings of the Eighth Workshop on NLP for Similar Languages, Varieties and Dialects. Association for Computational Linguistics, p. 67–75Research output: Conference Article in Proceeding or Book/Report chapter › Article in proceedings › Research › peer-review
- Published
How to write a bias statement: Recommendations for submissions to the Workshop on Gender Bias in NLP
Hardmeier, C., Costa-jussà, M. R., Webster, K., Radford, W. & Blodgett, S. L., 2021.Research output: Working paper › Preprint › Research
- Published
Decoding EEG brain activity for multi-modal natural language processing
Hollenstein, N., Renggli, C., Glaus, B., Barrett, M. J., Troendle, M., Langer, N. & Zhang, C., 2021, In: Frontiers in Human Neuroscience. 15, 659410.Research output: Journal Article or Conference Article in Journal › Journal article › Research › peer-review
- Published
Much Gracias: Semi-supervised Code-switch Detection for Spanish-English: How far can we get?
Iliescu, D-M., Grand, R., van der Goot, R. & Qirko, S., Jun 2021, Proceedings of the Fifth Workshop on Computational Approaches to Linguistic Code-Switching. Association for Computational Linguistics, p. 65 6 p. (Proceedings of the Fifth Workshop on Computational Approaches to Linguistic Code-Switching).Research output: Conference Article in Proceeding or Book/Report chapter › Article in proceedings › Research › peer-review
- Published
An IDR Framework of Opportunities and Barriers between HCI and NLP
Inie, N. & Derczynski, L., 20 Apr 2021, Proceedings of the First Workshop on Bridging Human–Computer Interaction and Natural Language Processing: HCINLP. Association for Computational Linguistics, p. 101-108Research output: Conference Article in Proceeding or Book/Report chapter › Article in proceedings › Research › peer-review
- Published
Spurious Correlations in Cross-Topic Argument Mining
Jakobsen, T. S. T., Barrett, M. J. & Søgaard, A., 2021, Proceedings of *SEM 2021: The Tenth Joint Conference on Lexical and Computational Semantics. Association for Computational Linguistics, p. 263-277 9 p.Research output: Conference Article in Proceeding or Book/Report chapter › Article in proceedings › Research › peer-review
- Published
De-identification of Privacy-related Entities in Job Postings
Jensen, K. N., Zhang, M. & Plank, B., 21 May 2021, Proceedings of the 23rd Nordic Conference on Computational Linguistics. Association for Computational Linguistics, p. 210-221 (Linköping Electronic Conference Proceedings; No. 21, Vol. 178).Research output: Conference Article in Proceeding or Book/Report chapter › Article in proceedings › Research › peer-review
- Published
PROCAT: Product Catalogue Dataset for Implicit Clustering, Permutation Learning and Structure Prediction
Jurewicz, M. & Derczynski, L., 1 Dec 2021, Thirty-fifth Conference on Neural Information Processing Systems: Datasets and Benchmarks Track. 2021 ed. Vol. 1.Research output: Conference Article in Proceeding or Book/Report chapter › Article in proceedings › Research › peer-review
- Published
Set-to-Sequence Methods in Machine Learning: A Review
Jurewicz, M. & Derczynski, L., 12 Aug 2021, In: The Journal of Artificial Intelligence Research. 71, p. 885-924Research output: Journal Article or Conference Article in Journal › Journal article › Research › peer-review
- Published
Unsupervised discovery of unaccusative and unergative verbs
Loáiciga, S., Bevacqua, L. & Hardmeier, C., 2021.Research output: Working paper › Preprint › Research
- Published
Genre as Weak Supervision for Cross-lingual Dependency Parsing
Müller-Eberstein, M., van der Goot, R. & Plank, B., Nov 2021, Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing. Online and Punta Cana, Dominican Republic: Association for Computational Linguistics, p. 4786-4802Research output: Conference Article in Proceeding or Book/Report chapter › Article in proceedings › Research › peer-review
- Published
How Universal is Genre in Universal Dependencies?
Müller-Eberstein, M., van der Goot, R. & Plank, B., Dec 2021, Proceedings of the 20th International Workshop on Treebanks and Linguistic Theories (TLT, SyntaxFest 2021). Sofia, Bulgaria: Association for Computational Linguistics, p. 69-85Research output: Conference Article in Proceeding or Book/Report chapter › Article in proceedings › Research › peer-review
- Published
DanFEVER: claim verification dataset for Danish
Nørregaard, J. & Derczynski, L., 1 Jun 2021, Proceedings of the 23rd Nordic Conference on Computational Linguistics (NoDaLiDa). Northern European Association for Language Technology (NEALT)Research output: Conference Article in Proceeding or Book/Report chapter › Article in proceedings › Research › peer-review
- Published
Finding the needle in a haystack: Extraction of Informative COVID-19 Danish Tweets
Olsen, B. A. & Plank, B., 2021, Proceedings of the 2021 EMNLP Workshop W-NUT: The Seventh Workshop on Noisy User-generated Text. Association for Computational Linguistics, p. 11–19Research output: Conference Article in Proceeding or Book/Report chapter › Article in proceedings › Research › peer-review
- Published
DaNLP: An open-source toolkit for Danish Natural Language Processing
Pauli, A. B., Barrett, M. J., Lacroix, O. & Hvingelby, R., 2021, Proceedings of the 23rd Nordic Conference on Computational Linguistics. Linköping Electronic Conference Proceedings, p. 460-466Research output: Conference Article in Proceeding or Book/Report chapter › Article in proceedings › Research › peer-review
- Published
Cross-Lingual Cross-Domain Nested Named Entity Evaluation on English Web Texts
Plank, B., 2021, Findings of ACL 2021. Association for Computational Linguistics, p. 1808 1815 p.Research output: Conference Article in Proceeding or Book/Report chapter › Article in proceedings › Research › peer-review
- Published
Hyperparameter Power Impact in Transformer Language Model Training
Puvis de Chavannes, L. H., Kongsbak, M. G. K., Rantzau, T. & Derczynski, L., 1 Nov 2021, Proceedings of the Second Workshop on Simple and Efficient Natural Language Processing. Association for Computational LinguisticsResearch output: Conference Article in Proceeding or Book/Report chapter › Article in proceedings › Research › peer-review
- Published
A mention-based system for revision requirements detection
Ruby, A., Hardmeier, C. & Stymne, S., 2021, Proceedings of the First Workshop on Understanding Implicit and Underspecified Language (UnImplicit). Association for Computational Linguistics, p. 58-63Research output: Conference Article in Proceeding or Book/Report chapter › Article in proceedings › Research › peer-review
- Published
Abusive Language Recognition in Russian
Saitov, K. & Derczynski, L., 20 Apr 2021, Proceedings of the 8th Workshop on Balto-Slavic Natural Language Processing. Association for Computational Linguistics, p. 20-25Research output: Conference Article in Proceeding or Book/Report chapter › Article in proceedings › Research › peer-review
- Published
CL-MoNoise: Cross-lingual Lexical Normalization
van der Goot, R., Oct 2021, Proceedings of the Seventh Workshop on Noisy User-generated Text (W-NUT 2021). Association for Computational Linguistics, p. 510 4 p.Research output: Conference Article in Proceeding or Book/Report chapter › Article in proceedings › Research › peer-review
- Published
Cross-lingual Multi-task Transfer for Zero-shot Task-oriented Dialog
van der Goot, R., Stepanovic, M., Ramponi, A., Sharaf, I., Üstün, A., Imankulova, A., Khairunnisa, S. O., Komachi, M. & Plank, B., 25 Sep 2021.Research output: Contribution to conference - NOT published in proceeding or journal › Conference abstract for conference › Research › peer-review
- Published
From Masked Language Modeling to Translation: Non-English Auxiliary Tasks Improve Zero-shot Spoken Language Understanding
van der Goot, R., Sharaf, I., Imankulova, A., Üstün, A., Stepanovic, M., Ramponi, A., Khairunnisa, S. O., Komachi, M. & Plank, B., 2021, Proceedings of NAACL. Association for Computational LinguisticsResearch output: Conference Article in Proceeding or Book/Report chapter › Article in proceedings › Research › peer-review
- Published
Lexical Normalization for Code-switched Data and its Effect on POS Tagging
van der Goot, R. & Çetinoğlu, Ö., Apr 2021, Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume. Association for Computational Linguistics, p. 2352-2365 13 p.Research output: Conference Article in Proceeding or Book/Report chapter › Article in proceedings › Research › peer-review
- Published
Massive Choice, Ample Tasks (MaChAmp): A Toolkit for Multi-task Learning in NLP
van der Goot, R., Üstün, A., Ramponi, A., Sharaf, I. & Plank, B., 2021, Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: System Demonstrations. Association for Computational Linguistics, p. 176-197Research output: Conference Article in Proceeding or Book/Report chapter › Article in proceedings › Research › peer-review
- Published
MultiLexNorm: A Shared Task on Multilingual Lexical Normalization
van der Goot, R., Ramponi, A., Zubiaga, A., Plank, B., Muller, B., San Vicente Roncal, I., Ljubešic´, N., Çetinoğlu, Ö., Mahendra, R., Çolakoglu, T., Baldwin, T., Caselli, T. & Sidorenko, W., Nov 2021, Proceedings of the Seventh Workshop on Noisy User-generated Text (W-NUT 2021). Association for Computational Linguistics, p. 493–509 16 p.Research output: Conference Article in Proceeding or Book/Report chapter › Article in proceedings › Research › peer-review
- Published
On the Effectiveness of Dataset Embeddings in Mono-lingual, Multi-lingual and Zero-shot Conditions
van der Goot, R., Üstün, A. & Plank, B., Apr 2021, Proceedings of the Second Workshop on Domain Adaptation for NLP: EACL 2021 workshop. Association for Computational Linguistics, p. 183–194Research output: Conference Article in Proceeding or Book/Report chapter › Article in proceedings › Research › peer-review
- Published
We Need to Talk About train-dev-test Splits
van der Goot, R., Oct 2021, Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, p. 4485 9 p.Research output: Conference Article in Proceeding or Book/Report chapter › Article in proceedings › Research › peer-review
- Published
Exploring the importance of source text in automatic post-editing for context-aware machine translation
Wang, C., Hardmeier, C. & Sennrich, R., 2021, Proceedings of the 23rd Nordic Conference on Computational Linguistics (NODALIDA). Linköping University Electronic Press, p. 326-335Research output: Conference Article in Proceeding or Book/Report chapter › Article in proceedings › Research › peer-review
- Published
Annotating Online Misogyny
Zeinert, P., Inie, N. & Derczynski, L., 3 Aug 2021, Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers). Association for Computational Linguistics, p. 3181–3197Research output: Conference Article in Proceeding or Book/Report chapter › Article in proceedings › Research › peer-review
- Published
Cartography Active Learning
Zhang, M. & Plank, B., 8 Nov 2021, Findings of the Association for Computational Linguistics: EMNLP 2021. Association for Computational Linguistics, p. 395–406Research output: Conference Article in Proceeding or Book/Report chapter › Article in proceedings › Research › peer-review
- 2020
- Published
Sequence labelling and sequence classification with gaze: Novel uses of eye‐tracking data for Natural Language Processing
Barrett, M. J. & Hollenstein, N., 5 Nov 2020, In: Language and Linguistics Compass. 14, 11, p. 1-16 16 p.Research output: Journal Article or Conference Article in Journal › Journal article › Research › peer-review
- Published
Matching Theory and Data with Personal-ITY: What a Corpus of Italian YouTube Comments Reveals About Personality
Bassignana, E., Nissim, M. & Patti, V., 2020, Proceedings of the Third Workshop on Computational Modeling of People's Opinions, Personality, and Emotion's in Social Media. Association for Computational Linguistics, p. 11-22Research output: Conference Article in Proceeding or Book/Report chapter › Article in proceedings › Research › peer-review
- Published
One of these words is not like the other: a reproduction of outlier identification using non-contextual word representations
Brink Andersen, J., Bak Bertelsen, M., Hørby Schou, M., Ciosici, M. R. & Assent, I., Nov 2020, Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing and the 10th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP) . Association for Computational Linguistics, 11 p.Research output: Conference Article in Proceeding or Book/Report chapter › Article in proceedings › Research › peer-review
- Published
Accelerated High-Quality Mutual-Information Based Word Clustering
Ciosici, M. R., Assent, I. & Derczynski, L., 1 May 2020, Proceedings of The 12th Language Resources and Evaluation Conference. Marseille, France: European Language Resources Association, p. 2484-2489 6 p.Research output: Conference Article in Proceeding or Book/Report chapter › Article in proceedings › Research › peer-review
- Published
Synthetic Data for English Lexical Normalization: How Close Can We Get to Manually Annotated Data?
Dekker, K. & van der Goot, R., May 2020, Proceedings of the Twelfth International Conference on Language Resources and Evaluation (LREC 2020). European Language Resources Association, p. 6300-6309Research output: Conference Article in Proceeding or Book/Report chapter › Article in proceedings › Research › peer-review
- Published
Detection and Resolution of Rumors and Misinformation with NLP
Derczynski, L. & Zubiaga, A., Dec 2020, Proceedings of the 28th International Conference on Computational Linguistics: Tutorial Abstracts. Barcelona, Spain (Online): Association for Computational Linguistics, p. 22-26Research output: Conference Article in Proceeding or Book/Report chapter › Article in proceedings › Research › peer-review
- Published
The Rumour Mill: Making the Spread of Misinformation Explicit and Tangible
Inie, N., Falk Olesen, J. & Derczynski, L., Apr 2020, The ACM CHI Conference on Human Factors in Computing Systems. Association for Computing MachineryResearch output: Conference Article in Proceeding or Book/Report chapter › Article in proceedings › Research › peer-review
- Published
Buhscitu at SemEvaL-2020 Task 7: Assessing Humour in Edited News Headlines using Hand-Crafted Features and Online Knowledge Bases
Jensen, K. N., Filrup Rasmussen, N., Wang, T., Placenti, M. & Plank, B., 2020, SemEval. Association for Computational LinguisticsResearch output: Conference Article in Proceeding or Book/Report chapter › Article in proceedings › Research › peer-review