AE-OT: A New Generative Model Based on Extended Semi-Discrete Optimal Transport

Zitieren

Technische Universität Braunschweig

Na, Lei

Formale Metadaten

Titel

AE-OT: A New Generative Model Based on Extended Semi-Discrete Optimal Transport

Serientitel

International Congress on Mathematical Software (ICMS) 2020

Anzahl der Teile

Autor

Na, Lei

0000-0003-3361-0756 (ORCID)

Lizenz

CC-Namensnennung 3.0 Deutschland:
Sie dürfen das Werk bzw. den Inhalt zu jedem legalen Zweck nutzen, verändern und in unveränderter oder veränderter Form vervielfältigen, verbreiten und öffentlich zugänglich machen, sofern Sie den Namen des Autors/Rechteinhabers in der von ihm festgelegten Weise nennen.

Identifikatoren

10.5446/47999 (DOI)

Herausgeber

Technische Universität Braunschweig

Erscheinungsjahr

2020

Sprache

Englisch

Produzent

Lei, Na

Produktionsjahr

2020

Produktionsort

China

Inhaltliche Metadaten

Fachgebiet

Informatik

Genre

Konferenz/Talk

Abstract

Current generative models like generative adversarial networks (GANs) and variational autoencoders (VAEs) have attracted huge attention due to its capability to generate visual realistic images. However, most of the existing models suffer from the mode collapse or mode mixture problems. In this work, we give a theoretic explanation of the both problems by Figalli’s regularity theory of optimal transportation maps. Basically, the generator compute the transportation maps between the white noise distributions and the data distributions, which are in general discontinuous. However, deep neural networks (DNNs) can only represent continuous maps. This intrinsic conflict induces mode collapse and mode mixture. In order to tackle the both problems, we explicitly separate the manifold embedding and the optimal transportation; the first part is carried out using an autoencoder (AE) to map the images onto the latent space; the second part is accomplished using a GPU-based convex optimization to find the discontinuous transportation maps. Composing the extended optimal transport (OT) map and the decoder, we can finally generate new images from the white noise. This AE-OT model avoids representing discontinuous maps by DNNs, therefore effectively prevents mode collapse and mode mixture.

Schlagwörter

GENERATIVE MODEL

OPTIMAL TRANSPORT

Manifold Distribution Hypothesis

Mode Collapse