Hyperparameter Power Impact in Transformer Language Model Training

Research output: Conference Article in Proceeding or Book/Report chapterArticle in proceedingsResearchpeer-review

View graph of relations

Original languageEnglish
Title of host publicationProceedings of the Second Workshop on Simple and Efficient Natural Language Processing
PublisherAssociation for Computational Linguistics
Publication date1 Nov 2021
Publication statusPublished - 1 Nov 2021

ID: 86559935