Natural Language Processing
Organisational unit: Research Group
IT University of Copenhagen
Rued Langgaards Vej 7
DK-2300 Copenhagen S
Denmark
Contact information
- Web: http://nlp.itu.dk
Organisation profile
Natural Language Processing (NLP) uses machine learning and other techniques to parse, analyse, translate and understand texts in human languages such as English or Danish. The work of ITU NLP researchers include transfer learning, representation learning, analysis of clinical patient records, automatic summarization, corpora building, stance detection, fake news analysis, and much more.
- 2022
- Published
SkillSpan: Hard and Soft Skill Extraction from English Job Postings
Zhang, M., Jensen, K. N., Sonniks, S. D. & Plank, B., 9 Jul 2022, 2022 Annual Conference of the North American Chapter of the Association for Computational Linguistics. Association for Computational LinguisticsResearch output: Conference Article in Proceeding or Book/Report chapter › Article in proceedings › Research › peer-review
- Published
Kompetencer: Fine-grained Skill Classification in Danish Job Postings via Distant Supervision and Transfer Learning
Zhang, M., Jensen, K. N. & Plank, B., 16 Jun 2022, 13th International Conference on Language Resources and Evaluation. European Language Resources Association (ELRA), p. 436-447 11 p.Research output: Conference Article in Proceeding or Book/Report chapter › Article in proceedings › Research › peer-review
- Published
What do You Mean by Relation Extraction? A Survey on Datasets and Study on Scientific Relation Classification
Bassignana, E. & Plank, B., 2022, The 60th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop. Dublin, Ireland: Association for Computational Linguistics, Vol. Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop. p. 67–83 17 p.Research output: Conference Article in Proceeding or Book/Report chapter › Article in proceedings › Research › peer-review
- 2021
- Published
Event and entity coreference across five European languages: Effects of context and referring expression
Bevacqua, L., Loáiciga, S., Rohde, H. & Hardmeier, C., 18 Dec 2021, In: Dialogue and Discourse.Research output: Journal Article or Conference Article in Journal › Journal article › Research › peer-review
- Published
PROCAT: Product Catalogue Dataset for Implicit Clustering, Permutation Learning and Structure Prediction
Jurewicz, M. & Derczynski, L., 1 Dec 2021, Thirty-fifth Conference on Neural Information Processing Systems: Datasets and Benchmarks Track. 2021 ed. Vol. 1.Research output: Conference Article in Proceeding or Book/Report chapter › Article in proceedings › Research › peer-review
- Published
How Universal is Genre in Universal Dependencies?
Müller-Eberstein, M., van der Goot, R. & Plank, B., Dec 2021, Proceedings of the 20th International Workshop on Treebanks and Linguistic Theories (TLT, SyntaxFest 2021). Sofia, Bulgaria: Association for Computational Linguistics, p. 69-85Research output: Conference Article in Proceeding or Book/Report chapter › Article in proceedings › Research › peer-review
- Published
Cartography Active Learning
Zhang, M. & Plank, B., 8 Nov 2021, Findings of the Association for Computational Linguistics: EMNLP 2021. Association for Computational Linguistics, p. 395–406Research output: Conference Article in Proceeding or Book/Report chapter › Article in proceedings › Research › peer-review
- Published
Hyperparameter Power Impact in Transformer Language Model Training
Puvis de Chavannes, L. H., Kongsbak, M. G. K., Rantzau, T. & Derczynski, L., 1 Nov 2021, Proceedings of the Second Workshop on Simple and Efficient Natural Language Processing. Association for Computational LinguisticsResearch output: Conference Article in Proceeding or Book/Report chapter › Article in proceedings › Research › peer-review
- Published
Genre as Weak Supervision for Cross-lingual Dependency Parsing
Müller-Eberstein, M., van der Goot, R. & Plank, B., Nov 2021, Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing. Online and Punta Cana, Dominican Republic: Association for Computational Linguistics, p. 4786-4802Research output: Conference Article in Proceeding or Book/Report chapter › Article in proceedings › Research › peer-review
- Published
MultiLexNorm: A Shared Task on Multilingual Lexical Normalization
van der Goot, R., Ramponi, A., Zubiaga, A., Plank, B., Muller, B., San Vicente Roncal, I., Ljubešic´, N., Çetinoğlu, Ö., Mahendra, R., Çolakoglu, T., Baldwin, T., Caselli, T. & Sidorenko, W., Nov 2021, Proceedings of the Seventh Workshop on Noisy User-generated Text (W-NUT 2021). Association for Computational Linguistics, p. 493–509 16 p.Research output: Conference Article in Proceeding or Book/Report chapter › Article in proceedings › Research › peer-review
- Published
CL-MoNoise: Cross-lingual Lexical Normalization
van der Goot, R., Oct 2021, Proceedings of the Seventh Workshop on Noisy User-generated Text (W-NUT 2021). Association for Computational Linguistics, p. 510 4 p.Research output: Conference Article in Proceeding or Book/Report chapter › Article in proceedings › Research › peer-review
- Published
We Need to Talk About train-dev-test Splits
van der Goot, R., Oct 2021, Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, p. 4485 9 p.Research output: Conference Article in Proceeding or Book/Report chapter › Article in proceedings › Research › peer-review
- Published
Creating a Universal Dependencies Treebank of Spoken Frisian-Dutch Code-switched Data
Braggaar, A. & van der Goot, R., 25 Sep 2021.Research output: Contribution to conference - NOT published in proceeding or journal › Conference abstract for conference › Research › peer-review
- Published
Cross-lingual Multi-task Transfer for Zero-shot Task-oriented Dialog
van der Goot, R., Stepanovic, M., Ramponi, A., Sharaf, I., Üstün, A., Imankulova, A., Khairunnisa, S. O., Komachi, M. & Plank, B., 25 Sep 2021.Research output: Contribution to conference - NOT published in proceeding or journal › Conference abstract for conference › Research › peer-review
- Published
Set-to-Sequence Methods in Machine Learning: A Review
Jurewicz, M. & Derczynski, L., 12 Aug 2021, In: The Journal of Artificial Intelligence Research. 71, p. 885-924Research output: Journal Article or Conference Article in Journal › Journal article › Research › peer-review
- Published
Annotating Online Misogyny
Zeinert, P., Inie, N. & Derczynski, L., 3 Aug 2021, Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers). Association for Computational Linguistics, p. 3181–3197Research output: Conference Article in Proceeding or Book/Report chapter › Article in proceedings › Research › peer-review
- Published
DanFEVER: claim verification dataset for Danish
Nørregaard, J. & Derczynski, L., 1 Jun 2021, Proceedings of the 23rd Nordic Conference on Computational Linguistics (NoDaLiDa). Northern European Association for Language Technology (NEALT)Research output: Conference Article in Proceeding or Book/Report chapter › Article in proceedings › Research › peer-review
- Published
The Danish Gigaword Corpus
Derczynski, L., Ciosici, M. R., Baglini, R., Christiansen, M., Dalsgaard, J. A., Fusaroli, R., Henrichsen, P. J., Hvingelby, R., Kirkedal, A. S., Kjeldsen, A. S., Ladefoged, C., Nielsen, F. Å., Madsen, J., Petersen, M. L., Rystrøm, J. H. & Varab, D., 1 Jun 2021, Proceedings of the 23rd Nordic Conference on Computational Linguistics (NoDaLiDa). Northern European Association for Language Technology (NEALT)Research output: Conference Article in Proceeding or Book/Report chapter › Article in proceedings › Research › peer-review
- Published
Much Gracias: Semi-supervised Code-switch Detection for Spanish-English: How far can we get?
Iliescu, D-M., Grand, R., van der Goot, R. & Qirko, S., Jun 2021, Proceedings of the Fifth Workshop on Computational Approaches to Linguistic Code-Switching. Association for Computational Linguistics, p. 65 6 p. (Proceedings of the Fifth Workshop on Computational Approaches to Linguistic Code-Switching).Research output: Conference Article in Proceeding or Book/Report chapter › Article in proceedings › Research › peer-review
- Published
De-identification of Privacy-related Entities in Job Postings
Jensen, K. N., Zhang, M. & Plank, B., 21 May 2021, Proceedings of the 23rd Nordic Conference on Computational Linguistics. Association for Computational Linguistics, p. 210-221 (Linköping Electronic Conference Proceedings; No. 21, Vol. 178).Research output: Conference Article in Proceeding or Book/Report chapter › Article in proceedings › Research › peer-review
- Published
Abusive Language Recognition in Russian
Saitov, K. & Derczynski, L., 20 Apr 2021, Proceedings of the 8th Workshop on Balto-Slavic Natural Language Processing. Association for Computational Linguistics, p. 20-25Research output: Conference Article in Proceeding or Book/Report chapter › Article in proceedings › Research › peer-review
- Published
An IDR Framework of Opportunities and Barriers between HCI and NLP
Inie, N. & Derczynski, L., 20 Apr 2021, Proceedings of the First Workshop on Bridging Human–Computer Interaction and Natural Language Processing: HCINLP. Association for Computational Linguistics, p. 101-108Research output: Conference Article in Proceeding or Book/Report chapter › Article in proceedings › Research › peer-review
- Published
Discriminating Between Similar Nordic Languages
Haas, R. & Derczynski, L., 20 Apr 2021, Proceedings of the Eighth Workshop on NLP for Similar Languages, Varieties and Dialects. Association for Computational Linguistics, p. 67–75Research output: Conference Article in Proceeding or Book/Report chapter › Article in proceedings › Research › peer-review
- Published
Challenges in Annotating and Parsing Spoken, Code-switched, Frisian-Dutch Data
Braggaar, A. & van der Goot, R., Apr 2021, Proceedings of the Second Workshop on Domain Adaptation for NLP. Association for Computational Linguistics, p. 50-58Research output: Conference Article in Proceeding or Book/Report chapter › Article in proceedings › Research › peer-review
- Published
Lexical Normalization for Code-switched Data and its Effect on POS Tagging
van der Goot, R. & Çetinoğlu, Ö., Apr 2021, Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume. Association for Computational Linguistics, p. 2352-2365 13 p.Research output: Conference Article in Proceeding or Book/Report chapter › Article in proceedings › Research › peer-review