AI with Papers - Artificial Intelligence & Deep Learning
15K subscribers
95 photos
235 videos
11 files
1.26K links
All the AI with papers. Every day fresh updates on Deep Learning, Machine Learning, and Computer Vision (with Papers).

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
🌼SOTA Textured 3D-Guided VTON🌼

πŸ‘‰#ALIBABA unveils 3DV-TON, a novel diffusion model for HQ and temporally consistent video. Generating animatable textured 3D meshes as explicit frame-level guidance, alleviating the issue of models over-focusing on appearance fidelity at the expanse of motion coherence. Code & benchmark to be releasedπŸ’™

πŸ‘‰Review https://t.ly/0tjdC
πŸ‘‰Paper https://lnkd.in/dFseYSXz
πŸ‘‰Project https://lnkd.in/djtqzrzs
πŸ‘‰Repo TBA
🀯9πŸ‘7❀4πŸ”₯2πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
🍏#Nvidia Dynamic Pose 🍏

πŸ‘‰Nvidia unveils DynPose-100K, the largest dataset of dynamic Internet videos annotated with camera poses. Dataset released under Nvidia licenseπŸ’™

πŸ‘‰Review https://t.ly/wrcb0
πŸ‘‰Paper https://lnkd.in/dycGjAyy
πŸ‘‰Project https://lnkd.in/dDZ2Ej_Q
πŸ€—Data https://lnkd.in/d8yUSB7m
πŸ”₯4πŸ‘2🀯1😍1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯ S3MOT: SOTA 3D MOT πŸ”₯

πŸ‘‰S3MOT: Selective-State-Space model-based MOT that efficiently infers 3D motion and object associations from 2D images through three core components. New SOTA on KITTI with 76.86 HOTA at 31 FPS! Code & Weights to be released under MIT licenseπŸ’™

πŸ‘‰Review https://t.ly/H_JPv
πŸ‘‰Paper https://arxiv.org/pdf/2504.18068
πŸ‘‰Repo https://github.com/bytepioneerX/s3mot
πŸ”₯7😍2πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯ Diffusion Model <-> Depth πŸ”₯

πŸ‘‰ETH & CMU on how to turn a single-image latent diffusion model (LDM) into the SOTA video depth estimator: video depth without video models. Repo released under Apache 2.0 and HF demo availableπŸ’™

πŸ‘‰Review https://t.ly/sP9ma
πŸ‘‰Paper arxiv.org/pdf/2411.19189
πŸ‘‰Project rollingdepth.github.io/
πŸ‘‰Repo github.com/prs-eth/rollingdepth
πŸ€—Demo huggingface.co/spaces/prs-eth/rollingdepthhttps://t.ly/sP9ma
❀11πŸ”₯6πŸ‘3πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
🩷Dance vs. #ComputerVision🩷

πŸ‘‰The Saint-Etienne university proposed a new 3D human body pose estimation pipeline to deal with dance analysis. Project page w/ results and interactive demo releasedπŸ’™

πŸ‘‰Review https://t.ly/JEdM3
πŸ‘‰Paper arxiv.org/pdf/2505.07249
πŸ‘‰Project https://lnkd.in/dD5dsMv5
❀8πŸ‘1πŸ”₯1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ§žβ€β™€οΈGENMO: Generalist Human Motion πŸ§žβ€β™€οΈ

πŸ‘‰#Nvidia presents GENMO, a unified Generalist Model for Human Motion that bridges motion estimation and generation in a single framework. Conditioning on videos, 2D keypoints, text, music, and 3D keyframes. No code at the momentπŸ₯²

πŸ‘‰Review https://t.ly/Q5T_Y
πŸ‘‰Paper https://lnkd.in/ds36BY49
πŸ‘‰Project https://lnkd.in/dAYHhuFU
πŸ”₯12❀3πŸ‘2😒1😍1
Dear friends,
I’m truly sorry for being away from the group for so long. I know: no updates so far while AI is running faster than speed of light.

I’m going through a very difficult time in my life and I need some space to heal. This spare-time project (but important for a lot of people here) needs energy and commitment I don’t have right now. I’m sorry, be patient. I’ll be back.

Love u all,
Alessandro.
❀373πŸ‘27😒24