Gated Multimodal Units for Information Fusion

Yoel Zeldes
Towards Data Science
5 min readMay 5, 2018

--

The output of the GMU architecture

Deep learning has proven its superiority in many domains, in a variety of tasks such as image classification and text generation. Dealing with tasks that involve inputs from multiple modalities is an interesting research area.

The Gated Multimodal Unit (GMU) is a new building block proposed by a recent paper, which is presented in ICLR 2017 as a workshop. The goal of this building…

--

--