Abstract
We explore different approaches to explicit entity modelling in language models (LM). We independently replicate two existing models in a controlled setup, introduce a simplified variant of one of the models and analyze their performance in direct comparison. Our results suggest that today's models are limited as several stochastic variables make learning difficult. We show that the most challenging point in the systems is the decision if the next token is an entity token. The low precision and recall for this variable will lead to severe cascading errors. Our own simplified approach dispenses with the need for latent variables and improves the performance in the entity yes/no decision. A standard well-tuned baseline RNN-LM with a larger number of hidden units outperforms all entity-enabled LMs in terms of perplexity.
Originalsprog | Engelsk |
---|---|
Titel | Proceedings of the Second Workshop on Computational Models of Reference, Anaphora and Coreference |
Antal sider | 5 |
Publikationsdato | 7 jun. 2019 |
ISBN (Trykt) | 978-1-948087-97-1 |
DOI | |
Status | Udgivet - 7 jun. 2019 |
Udgivet eksternt | Ja |