Hyperparameter Power Impact in Transformer Language Model Training

Lucas Høyberg Puvis de Chavannes, Mads Guldborg Kjeldgaard Kongsbak, Timmie Rantzau, Leon Derczynski

Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

Original languageEnglish
Title of host publicationProceedings of the Second Workshop on Simple and Efficient Natural Language Processing
PublisherAssociation for Computational Linguistics
Publication date1 Nov 2021
Publication statusPublished - 1 Nov 2021

Cite this