AI with Papers - Artificial Intelligence & Deep Learning
15K subscribers
95 photos
236 videos
11 files
1.26K links
All the AI with papers. Every day fresh updates on Deep Learning, Machine Learning, and Computer Vision (with Papers).

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ₯¬ "Perception Test" by #DeepMind πŸ₯¬

πŸ‘‰Huge dataset with obj & point tracks, temporal sounds, multiple & grounded vQA

😎Review https://bit.ly/3Vqh96Q
😎Dataset github.com/deepmind/perception_test
😎Project www.deepmind.com/blog/measuring-perception-in-ai-models
πŸ‘15πŸ”₯4😱3
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ‡ Bootstrapping TAP πŸ‡

πŸ‘‰#Deepmind shows how large-scale, unlabeled, uncurated real-world data can improve TAP with minimal architectural changes, via a self-supervised student-teacher setup. Source Code released πŸ’™

πŸ‘‰Review https://t.ly/-S_ZL
πŸ‘‰Paper arxiv.org/pdf/2402.00847.pdf
πŸ‘‰Code https://github.com/google-deepmind/tapnet
πŸ”₯5πŸ‘3πŸ₯°1🀩1
This media is not supported in your browser
VIEW IN TELEGRAM
🍾TAPVid-3D: benchmark for TAP-3D🍾

πŸ‘‰#Deepmind (+College London & Oxford) introduces TAPVid-3D, a new benchmark for evaluating long-range Tracking Any Point in 3D: 4,000+ real-world videos, composed of three different data sources spanning a variety of object types, motion patterns, and indoor/outdoor environments. Data & Code available, Apache 2.0πŸ’™

πŸ‘‰Review https://t.ly/SsptD
πŸ‘‰Paper arxiv.org/pdf/2407.05921
πŸ‘‰Project tapvid3d.github.io/
πŸ‘‰Code github.com/google-deepmind/tapnet/tree/main/tapnet/tapvid3d
πŸ”₯3πŸ‘1🀯1