This media is not supported in your browser
VIEW IN TELEGRAM
π₯ Diffusion Model <-> Depth π₯
πETH & CMU on how to turn a single-image latent diffusion model (LDM) into the SOTA video depth estimator: video depth without video models. Repo released under Apache 2.0 and HF demo availableπ
πReview https://t.ly/sP9ma
πPaper arxiv.org/pdf/2411.19189
πProject rollingdepth.github.io/
πRepo github.com/prs-eth/rollingdepth
π€Demo huggingface.co/spaces/prs-eth/rollingdepthhttps://t.ly/sP9ma
πETH & CMU on how to turn a single-image latent diffusion model (LDM) into the SOTA video depth estimator: video depth without video models. Repo released under Apache 2.0 and HF demo availableπ
πReview https://t.ly/sP9ma
πPaper arxiv.org/pdf/2411.19189
πProject rollingdepth.github.io/
πRepo github.com/prs-eth/rollingdepth
π€Demo huggingface.co/spaces/prs-eth/rollingdepthhttps://t.ly/sP9ma
β€11π₯6π3π1
This media is not supported in your browser
VIEW IN TELEGRAM
π©·Dance vs. #ComputerVisionπ©·
πThe Saint-Etienne university proposed a new 3D human body pose estimation pipeline to deal with dance analysis. Project page w/ results and interactive demo releasedπ
πReview https://t.ly/JEdM3
πPaper arxiv.org/pdf/2505.07249
πProject https://lnkd.in/dD5dsMv5
πThe Saint-Etienne university proposed a new 3D human body pose estimation pipeline to deal with dance analysis. Project page w/ results and interactive demo releasedπ
πReview https://t.ly/JEdM3
πPaper arxiv.org/pdf/2505.07249
πProject https://lnkd.in/dD5dsMv5
β€8π1π₯1
This media is not supported in your browser
VIEW IN TELEGRAM
π§ββοΈGENMO: Generalist Human Motion π§ββοΈ
π#Nvidia presents GENMO, a unified Generalist Model for Human Motion that bridges motion estimation and generation in a single framework. Conditioning on videos, 2D keypoints, text, music, and 3D keyframes. No code at the momentπ₯²
πReview https://t.ly/Q5T_Y
πPaper https://lnkd.in/ds36BY49
πProject https://lnkd.in/dAYHhuFU
π#Nvidia presents GENMO, a unified Generalist Model for Human Motion that bridges motion estimation and generation in a single framework. Conditioning on videos, 2D keypoints, text, music, and 3D keyframes. No code at the momentπ₯²
πReview https://t.ly/Q5T_Y
πPaper https://lnkd.in/ds36BY49
πProject https://lnkd.in/dAYHhuFU
π₯12β€3π2π’1π1
Dear friends,
Iβm truly sorry for being away from the group for so long. I know: no updates so far while AI is running faster than speed of light.
Iβm going through a very difficult time in my life and I need some space to heal. This spare-time project (but important for a lot of people here) needs energy and commitment I donβt have right now. Iβm sorry, be patient. Iβll be back.
Love u all,
Alessandro.
Iβm truly sorry for being away from the group for so long. I know: no updates so far while AI is running faster than speed of light.
Iβm going through a very difficult time in my life and I need some space to heal. This spare-time project (but important for a lot of people here) needs energy and commitment I donβt have right now. Iβm sorry, be patient. Iβll be back.
Love u all,
Alessandro.
β€360π27π’23