Illuminating Generalization in Deep Reinforcement Learning through Procedural Level Generation

Niels Justesen, Ruben Rodriguez Torrado, Philip Bontrager, Ahmed Khalifa, Julian Togelius, Sebastian Risi

    Publikation: Konferencebidrag - EJ publiceret i proceeding eller tidsskriftPaperForskning

    Abstract

    Deep reinforcement learning (RL) has shown impressive results in a variety of domains, learning directly from high-dimensional sensory streams. However, when neural networks are trained in a fixed environment, such as a single level in a video game, they will usually overfit and fail to generalize to new levels. When RL models overfit, even slight modifications to the environment can result in poor agent performance. This paper explores how procedurally generated levels during training can increase generality. We show that for some games procedural level generation enables generalization to new levels within the same distribution. Additionally, it is possible to achieve better performance with less data by manipulating the difficulty of the levels in response to the performance of the agent. The generality of the learned behaviors is also evaluated on a set of human-designed levels. The results suggest that the ability to generalize to human-designed levels highly depends on the design of the level generators. We apply dimensionality reduction and clustering techniques to visualize the generators’ distributions of levels and analyze to what degree they can produce levels similar to those designed by a human.
    OriginalsprogEngelsk
    Publikationsdato2018
    StatusUdgivet - 2018
    BegivenhedNeurIPS Workshop on Deep Reinforcement Learning Workshop - Palais des Congrès de Montréal, Montréal, Canada
    Varighed: 7 dec. 20187 dec. 2018
    https://sites.google.com/view/deep-rl-workshop-nips-2018/home

    Konference

    KonferenceNeurIPS Workshop on Deep Reinforcement Learning Workshop
    LokationPalais des Congrès de Montréal
    Land/OmrådeCanada
    ByMontréal
    Periode07/12/201807/12/2018
    Internetadresse

    Emneord

    • Deep Reinforcement Learning
    • Procedural Level Generation
    • Generalization
    • Neural Networks
    • Dimensionality Reduction and Clustering Techniques

    Fingeraftryk

    Dyk ned i forskningsemnerne om 'Illuminating Generalization in Deep Reinforcement Learning through Procedural Level Generation'. Sammen danner de et unikt fingeraftryk.

    Citationsformater