Tracking Emerges by Colorizing Videos

  • Google Research, 2018 ### Main Strategy
  • Use large amounts of unlabeled video to learn models for visual tracking
    • Without manual human supervision.
  • Colorization
    • We leverage the natural temporal coherency of color to create a model that learns to colorize gray-scale videos by copying colors from a reference frame.
    • Quantitative and qualitative experiments suggest that this task causes the model to automatically learn to track visual regions.
  • Outperform Optical flow
    • Although the model is trained without any ground-truth labels, our method learns to track well enough to outperform optical flow based methods.
  • Advancing Colorization
    • Our results suggest that failures to track are correlated with failures to colorize, indicating that advancing video colorization may further improve self-supervised visual tracking.

In [ ]: