CTGAN Explained

A technical deep-dive into CTGAN: the conditional tabular GAN architecture, how it handles mixed data types and imbalanced columns, and when to use it.

Architecture Overview

CTGAN uses a conditional generator that takes as input both random noise and a one-hot encoding of a randomly sampled categorical value. The discriminator receives real and synthetic rows along with their conditional vectors. This forces the generator to learn to produce realistic samples conditioned on specific categorical values, improving coverage of rare categories.

Related Coverage