This media is not supported in your browser
VIEW IN TELEGRAM
π₯¬ "Perception Test" by #DeepMind π₯¬
πHuge dataset with obj & point tracks, temporal sounds, multiple & grounded vQA
πReview https://bit.ly/3Vqh96Q
πDataset github.com/deepmind/perception_test
πProject www.deepmind.com/blog/measuring-perception-in-ai-models
πHuge dataset with obj & point tracks, temporal sounds, multiple & grounded vQA
πReview https://bit.ly/3Vqh96Q
πDataset github.com/deepmind/perception_test
πProject www.deepmind.com/blog/measuring-perception-in-ai-models
π15π₯4π±3
This media is not supported in your browser
VIEW IN TELEGRAM
π Bootstrapping TAP π
π#Deepmind shows how large-scale, unlabeled, uncurated real-world data can improve TAP with minimal architectural changes, via a self-supervised student-teacher setup. Source Code released π
πReview https://t.ly/-S_ZL
πPaper arxiv.org/pdf/2402.00847.pdf
πCode https://github.com/google-deepmind/tapnet
π#Deepmind shows how large-scale, unlabeled, uncurated real-world data can improve TAP with minimal architectural changes, via a self-supervised student-teacher setup. Source Code released π
πReview https://t.ly/-S_ZL
πPaper arxiv.org/pdf/2402.00847.pdf
πCode https://github.com/google-deepmind/tapnet
π₯5π3π₯°1π€©1
This media is not supported in your browser
VIEW IN TELEGRAM
πΎTAPVid-3D: benchmark for TAP-3DπΎ
π#Deepmind (+College London & Oxford) introduces TAPVid-3D, a new benchmark for evaluating long-range Tracking Any Point in 3D: 4,000+ real-world videos, composed of three different data sources spanning a variety of object types, motion patterns, and indoor/outdoor environments. Data & Code available, Apache 2.0π
πReview https://t.ly/SsptD
πPaper arxiv.org/pdf/2407.05921
πProject tapvid3d.github.io/
πCode github.com/google-deepmind/tapnet/tree/main/tapnet/tapvid3d
π#Deepmind (+College London & Oxford) introduces TAPVid-3D, a new benchmark for evaluating long-range Tracking Any Point in 3D: 4,000+ real-world videos, composed of three different data sources spanning a variety of object types, motion patterns, and indoor/outdoor environments. Data & Code available, Apache 2.0π
πReview https://t.ly/SsptD
πPaper arxiv.org/pdf/2407.05921
πProject tapvid3d.github.io/
πCode github.com/google-deepmind/tapnet/tree/main/tapnet/tapvid3d
π₯3π1π€―1