V-JEPA 2: Self-Supervised Video Models Enable Understanding, Prediction and Planning
V-JEPA 2 is a self-supervised video model trained on over 1 million hours of internet video that achieves state-of-the-art performance on motion understanding and video question-answering tasks. The model can be adapted …
AI · Development