GAN

Basic support for Generative Adversarial Networks

GAN stands for Generative Adversarial Nets and were invented by Ian Goodfellow. The concept is that we train two models at the same time: a generator and a critic. The generator will try to make new images similar to the ones in a dataset, and the critic will try to classify real images from the ones the generator does. The generator returns images, the critic a single number (usually a probability, 0. for fake images and 1. for real ones).

We train them against each other in the sense that at each step (more or less), we:

Freeze the generator and train the critic for one step by:

getting one batch of true images (let’s call that real)
generating one batch of fake images (let’s call that fake)
have the critic evaluate each batch and compute a loss function from that; the important part is that it rewards positively the detection of real images and penalizes the fake ones
update the weights of the critic with the gradients of this loss

Freeze the critic and train the generator for one step by:

generating one batch of fake images
evaluate the critic on it
return a loss that rewards positively the critic thinking those are real images
update the weights of the generator with the gradients of this loss

Note

The fastai library provides support for training GANs through the GANTrainer, but doesn’t include more than basic models.

	Type	Default	Details
generator	Module	None	The generator PyTorch module
critic	Module	None	The discriminator PyTorch module
gen_mode	None \| bool	False	Whether the GAN should be set to generator mode

	Type	Default	Details
in_size	int		Input size for the critic (same as the output size of the generator)
n_channels	int		Number of channels of the input for the critic
n_features	int	64	Number of features used in the critic
n_extra_layers	int	0	Number of extra hidden layers in the critic
norm_type	NormType	NormType.Batch	Type of normalization to use in the critic
ks	int	3
stride	int	1
padding	NoneType	None
bias	NoneType	None
ndim	int	2
bn_1st	bool	True
act_cls	type	ReLU
transpose	bool	False
init	str	auto
xtra	NoneType	None
bias_std	float	0.01
dilation	Union	1
groups	int	1
padding_mode	str	zeros	TODO: refine this type
device	NoneType	None
dtype	NoneType	None
Returns	nn.Sequential

	Type	Default	Details
out_size	int		Output size for the generator (same as the input size for the critic)
n_channels	int		Number of channels of the output of the generator
in_sz	int	100	Size of the input noise vector for the generator
n_features	int	64	Number of features used in the generator
n_extra_layers	int	0	Number of extra hidden layers in the generator
ks	int	3
stride	int	1
padding	NoneType	None
bias	NoneType	None
ndim	int	2
norm_type	NormType	NormType.Batch
bn_1st	bool	True
act_cls	type	ReLU
transpose	bool	False
init	str	auto
xtra	NoneType	None
bias_std	float	0.01
dilation	Union	1
groups	int	1
padding_mode	str	zeros	TODO: refine this type
device	NoneType	None
dtype	NoneType	None
Returns	nn.Sequential

	Type	Default	Details
nf	int		Number of features
norm_type	NormType	NormType.Batch	Normalization type
ks	int	3
stride	int	1
padding	NoneType	None
bias	NoneType	None
ndim	int	2
bn_1st	bool	True
act_cls	type	ReLU
transpose	bool	False
init	str	auto
xtra	NoneType	None
bias_std	float	0.01
dilation	Union	1
groups	int	1
padding_mode	str	zeros	TODO: refine this type
device	NoneType	None
dtype	NoneType	None
Returns	SequentialEx

Wrapping the modules

GANModule

GANModule.switch

basic_critic

AddChannels

basic_generator

DenseResBlock

gan_critic

GANLoss

GANLoss.generator

GANLoss.critic

AdaptiveLoss

accuracy_thresh_expand

Callbacks for GAN training

set_freeze_model

GANTrainer

FixedGANSwitcher

AdaptiveGANSwitcher

GANDiscriminativeLR

GAN data

InvisibleTensor

generate_noise

GAN Learner

gan_loss_from_func

GANLearner

GANLearner.from_learners

GANLearner.wgan