Analysing the Multiple Timescale Recurrent Neural Network for Embodied Language Understanding

Stefan Heinrich, Sven Magg, Stefan Wermter

Research output: Conference Article in Proceeding or Book/Report chapterBook chapterResearchpeer-review

Abstract

How the human brain understands natural language and how we can exploit this understanding for building intelligent grounded language systems is open research. Recently, researchers claimed that language is embodied in most – if not all – sensory and sensorimotor modalities and that the brain’s architecture favours the emergence of language. In this chapter we investigate the characteristics of such an architecture and propose a model based on the Multiple Timescale Recurrent Neural Network, extended by embodied visual perception, and tested in a real world scenario. We show that such an architecture can learn the meaning of utterances with respect to visual perception and that it can produce verbal utterances that correctly describe previously unknown scenes. In addition we rigorously study the timescale mechanism (also known as hysteresis) and explore the impact of the architectural connectivity in the language acquisition task
Original languageEnglish
Title of host publicationArtificial Neural Networks -- Methods and Applications in Bio-/Neuroinformatics
EditorsPetia D. Koprinkova-Hristova, Valeri M. Mladenov, Nikola K. Kasabov
Number of pages26
Volume4
PublisherSpringer International Publishing, Switzerland
Publication date1 Jan 2015
Pages149-174
DOIs
Publication statusPublished - 1 Jan 2015
Externally publishedYes

Fingerprint

Dive into the research topics of 'Analysing the Multiple Timescale Recurrent Neural Network for Embodied Language Understanding'. Together they form a unique fingerprint.

Cite this