Image-to-Image Translation with Conditional Adversarial Networks
- BAIR, UC Berkeley
### Background
Video-to-Video Synthesis
Background
- Image-to-Image Translation
- Without modeling temporal dynamics, directly applying existing image synthesis approaches to an input video often results in temporally incoherent videos of low visual quality.
Main Strategy
- with a spatio-temporal adversarial objective
- Through carefully-designed generator and discriminator architectures, coupled with a spatio-temporal adversarial objective, we achieve high-resolution, photorealistic, temporally coher- ent video results on a diverse set of input formats including segmentation masks, sketches, and poses.
- future video prediction
- Finally, we apply our approach to future video prediction, outperforming several state-of-the-art competing systems.