Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
In this paper, the author begins by describing how GANs are superior alternatives to
maximum likelihood techniques for unsupervised learning tasks like image
classification. Since they are unstable to train, this paper puts forth a new architecture
called Deep Convolutional GAN which makes training stable in most settings. The
author also addresses other issues like image generation and visualizing the internals of
neural networks which have been historically unsuccessful.
Three essential changes are required to the CNN architecture: 1. An all convolutional
net in the generator which replaces deterministic spatial pooling functions with strided
convolutions, allowing the network to learn its own spatial downsampling, 2. Eliminating
fully connected layers on top of convolutional feature and 3. Batch normalisation. ReLU
activation is used in generator for all layers except for the output, which uses Tanh
function and LeakyReLU activation in the discriminator for all layers. Also remove fully
connected hidden layers for deeper architectures.
The authors conclude by describing the future work of tackling instability introduced by a
subset of filters collapsing to single oscillating mode. Application of DCGAN on other
domains like video and audio and extensive research on latent space would be
interesting.