ViSAGE wins NTIRE 2026 video saliency prediction challenge with multi-expert ensemble

JO
James Okafor
AI Research CorrespondentArXiv CS.CVVerified across 1 source

The Brief

Researchers unveiled ViSAGE, a multi-expert framework that won the NTIRE 2026 Challenge on Video Saliency Prediction at CVPR 2026. The model uses adaptive gating across specialized decoders to capture complex spatio-temporal patterns in videos, ranking first on half the evaluation metrics and outperforming competitors on others.
Verified across 1 independent source
The DeepBrief Daily
5 verified AI stories, every morning. No noise, no fluff. Free forever.