Image-to-Image Translation with Conditional Adversarial Networks

Video-to-Video Synthesis

Image-to-Image Translation
- Without modeling temporal dynamics, directly applying existing image synthesis approaches to an input video often results in temporally incoherent videos of low visual quality.

with a spatio-temporal adversarial objective
- Through carefully-designed generator and discriminator architectures, coupled with a spatio-temporal adversarial objective, we achieve high-resolution, photorealistic, temporally coher- ent video results on a diverse set of input formats including segmentation masks, sketches, and poses.
future video prediction
- Finally, we apply our approach to future video prediction, outperforming several state-of-the-art competing systems.



In [ ]: