Robots need to be able to continually learn from natural interactions, which are inherently multimodal or multi- sensory (e.g. hearing, vision). Here, we report the development of a deep multimodal autoencoder, capable of unsupervised learning. We created a joint vector space combining labels and images, using a multimodal autoencoder, which maps from images to labels and vice versa. We achieved a label prediction accuracy of 96.9% in the image-only testing condition, where the state-of-the-art is 99.79% and uses a committee of 5 neural networks. In robotics sensory data, is often cheap whilst computation is expensive, thus multimodal systems which require less computation to achieve comparable recognition rates are highly desirable.

(2018). Towards life long learning: Multimodal learning of mnist handwritten digits. . Retrieved from http://hdl.handle.net/10446/215799

Towards life long learning: Multimodal learning of mnist handwritten digits.

Lehmann, Hagen;
2018-01-01

Abstract

Robots need to be able to continually learn from natural interactions, which are inherently multimodal or multi- sensory (e.g. hearing, vision). Here, we report the development of a deep multimodal autoencoder, capable of unsupervised learning. We created a joint vector space combining labels and images, using a multimodal autoencoder, which maps from images to labels and vice versa. We achieved a label prediction accuracy of 96.9% in the image-only testing condition, where the state-of-the-art is 99.79% and uses a committee of 5 neural networks. In robotics sensory data, is often cheap whilst computation is expensive, thus multimodal systems which require less computation to achieve comparable recognition rates are highly desirable.
2018
Sheppard, Eli; Lehmann, Hagen; Rajendran, G.; Mckenna, Peter E.; Lemon, Oliver; Lohan, Katrin S.
File allegato/i alla scheda:
File Dimensione del file Formato  
2018 - Sheppard - Towards Life Long Learning_Multimodal Learning of MNIST Handwritten Digits.pdf

Solo gestori di archivio

Versione: publisher's version - versione editoriale
Licenza: Licenza default Aisberg
Dimensione del file 987.82 kB
Formato Adobe PDF
987.82 kB Adobe PDF   Visualizza/Apri
Pubblicazioni consigliate

Aisberg ©2008 Servizi bibliotecari, Università degli studi di Bergamo | Terms of use/Condizioni di utilizzo

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10446/215799
Citazioni
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact