Hello everybody,
a lot of you asked me to re-open the sharing of the contents to involve more people. I want to follow your suggestion, hope you will enjoy this new mood!
đ FREE TO FORWARD TO OTHER TELEGRAM CHANNELS
đĨ NO COPY OF THE POSTS
đĨ NO COMMERCIAL USAGE
đĨ NO UNRESPECTFUL USAGE
â ī¸ UNDO THE FORWARDING OPTION AT THE FIRST VIOLATION â ī¸
a lot of you asked me to re-open the sharing of the contents to involve more people. I want to follow your suggestion, hope you will enjoy this new mood!
đ FREE TO FORWARD TO OTHER TELEGRAM CHANNELS
đĨ NO COPY OF THE POSTS
đĨ NO COMMERCIAL USAGE
đĨ NO UNRESPECTFUL USAGE
â ī¸ UNDO THE FORWARDING OPTION AT THE FIRST VIOLATION â ī¸
â¤19đ10đ3đĨ°1đž1
This media is not supported in your browser
VIEW IN TELEGRAM
𩰠Magic Animating Human đа
đMagicAnimate: the new SOTA in human animation. Code available: let's dance!
đReview https://t.ly/Oq7Za
đPaper https://lnkd.in/dSUbGgCs
đProject https://lnkd.in/dkVFf-SV
đCode https://lnkd.in/dj2dbzdg
đDemo https://lnkd.in/dHEKPE9q
đMagicAnimate: the new SOTA in human animation. Code available: let's dance!
đReview https://t.ly/Oq7Za
đPaper https://lnkd.in/dSUbGgCs
đProject https://lnkd.in/dkVFf-SV
đCode https://lnkd.in/dj2dbzdg
đDemo https://lnkd.in/dHEKPE9q
đ¤¯6â¤2đ1đĨ1đĨ°1
This media is not supported in your browser
VIEW IN TELEGRAM
đĨ EfficientSAM: 20x faster Segment Anything đĨ
đMeta AI Research unveils a novel family of SAM-like models, light-weight SAM models with SOTA quality-efficiency trade-offs. Up to 20x faster!
đReview https://t.ly/966QS
đPaper https://lnkd.in/duijp_Rh
đProject https://lnkd.in/dW-p2CuH
đCode https://lnkd.in/dAbZaB2t
đDemo https://lnkd.in/d-tjKiUd
đMeta AI Research unveils a novel family of SAM-like models, light-weight SAM models with SOTA quality-efficiency trade-offs. Up to 20x faster!
đReview https://t.ly/966QS
đPaper https://lnkd.in/duijp_Rh
đProject https://lnkd.in/dW-p2CuH
đCode https://lnkd.in/dAbZaB2t
đDemo https://lnkd.in/d-tjKiUd
đĨ15â¤4đ4đ¤¯2
This media is not supported in your browser
VIEW IN TELEGRAM
đĢļ3D Hands with TransformersđĢļ
đ HaMeR is a robust and accurate Hand Mesh Recovery from images and video frames, based on Transformer architecture. It's the new SOTA.
đReview https://t.ly/YtAW8
đPaper https://arxiv.org/pdf/2312.05251.pdf
đProject https://geopavlakos.github.io/hamer
đDemo huggingface.co/spaces/geopavlakos/HaMeR
đColab colab.research.google.com/drive/1rQbQzegFWGVOm1n1d-S6koOWDo7F2ucu
đ HaMeR is a robust and accurate Hand Mesh Recovery from images and video frames, based on Transformer architecture. It's the new SOTA.
đReview https://t.ly/YtAW8
đPaper https://arxiv.org/pdf/2312.05251.pdf
đProject https://geopavlakos.github.io/hamer
đDemo huggingface.co/spaces/geopavlakos/HaMeR
đColab colab.research.google.com/drive/1rQbQzegFWGVOm1n1d-S6koOWDo7F2ucu
đ10â¤1đ1đ¤¯1đą1
This media is not supported in your browser
VIEW IN TELEGRAM
đĒŠ DreaMoving: Human Dancer đĒŠ
đAlibaba strikes again with DreaMoving: a diffusion-based controllable video generation framework to produce HQ customized human videos.
đReview https://t.ly/BD_Yf
đPaper https://lnkd.in/gepP6Rjw
đProject https://lnkd.in/gwm72cfS
đRepo (empty) https://lnkd.in/gsc2Qt-F
đAlibaba strikes again with DreaMoving: a diffusion-based controllable video generation framework to produce HQ customized human videos.
đReview https://t.ly/BD_Yf
đPaper https://lnkd.in/gepP6Rjw
đProject https://lnkd.in/gwm72cfS
đRepo (empty) https://lnkd.in/gsc2Qt-F
đ7đŠ6â¤2đĨ°1
This media is not supported in your browser
VIEW IN TELEGRAM
đ˛ EdgeSAM: Mobile 40x SAM đ˛
đA novel hyper-optimized version of SAM for mobile devices such as #Iphone. Pure CNNs backbone (better suitable for ANE), up to 40x faster. Code available đ
đReview https://t.ly/m_vLH
đPaper https://lnkd.in/gHZVZN2x
đProject https://lnkd.in/gK8qEK8p
đRepo https://lnkd.in/gj6YAGNv
đHugging Face https://lnkd.in/gUUHJvxz
đA novel hyper-optimized version of SAM for mobile devices such as #Iphone. Pure CNNs backbone (better suitable for ANE), up to 40x faster. Code available đ
đReview https://t.ly/m_vLH
đPaper https://lnkd.in/gHZVZN2x
đProject https://lnkd.in/gK8qEK8p
đRepo https://lnkd.in/gj6YAGNv
đHugging Face https://lnkd.in/gUUHJvxz
đĨ20âĄ2â¤2đ¤Š1
This media is not supported in your browser
VIEW IN TELEGRAM
đĒŧPatchFusion: SOTA Mono-DepthđĒŧ
đPatchFusion: novel end-to-end tile-based framework for hi-res monocular metric depth estimation. It's the new SOTA in metric depth estimation from mono. Code & Demo on Hugging Face able đĨ
đReview https://t.ly/hv3yT
đPaper https://lnkd.in/d9dXP7iP
đProject https://lnkd.in/dQcvVJSx
đRepo https://lnkd.in/dW2GdVR5
đDemo https://lnkd.in/dFW-gAiY
đPatchFusion: novel end-to-end tile-based framework for hi-res monocular metric depth estimation. It's the new SOTA in metric depth estimation from mono. Code & Demo on Hugging Face able đĨ
đReview https://t.ly/hv3yT
đPaper https://lnkd.in/d9dXP7iP
đProject https://lnkd.in/dQcvVJSx
đRepo https://lnkd.in/dW2GdVR5
đDemo https://lnkd.in/dFW-gAiY
đĨ10â¤5đ1đ¤¯1đą1
This media is not supported in your browser
VIEW IN TELEGRAM
đOutfit Anyone: Ultra-HQ VTOđ
đAlibaba unveils Outfit Anyone: a two-stream conditional diffusion able to adeptly handle garment deformation for more lifelike results in VOT. Extra: Outfit Anyone + Animate Anyone for outfit + motion generation of any character. NO CODE / NO PAPER / DEMO AVAILABLE :)
đReview https://t.ly/o6UR9
đDemo https://lnkd.in/dpQYdXhc
đRepo (empty) https://lnkd.in/dBsNST6r
đAlibaba unveils Outfit Anyone: a two-stream conditional diffusion able to adeptly handle garment deformation for more lifelike results in VOT. Extra: Outfit Anyone + Animate Anyone for outfit + motion generation of any character. NO CODE / NO PAPER / DEMO AVAILABLE :)
đReview https://t.ly/o6UR9
đDemo https://lnkd.in/dpQYdXhc
đRepo (empty) https://lnkd.in/dBsNST6r
đ¤¯10đ4â¤3đĨ2
đĨ #AIwithPapers: we are 8k+ đĨ
đ After flirting with #ChatGpt for months, you back in love with this channel. I felt bad, but I forgive you đ§Ą
đ Hey Telegram Premium Subscribers, what about boosting us? Click: https://t.me/AI_DeepLearning?boost
đ Invite -> https://t.me/AI_DeepLearning
đ After flirting with #ChatGpt for months, you back in love with this channel. I felt bad, but I forgive you đ§Ą
đ Hey Telegram Premium Subscribers, what about boosting us? Click: https://t.me/AI_DeepLearning?boost
đ Invite -> https://t.me/AI_DeepLearning
â¤16đ¤Ŗ7đĨ1đĨ°1
This media is not supported in your browser
VIEW IN TELEGRAM
đ§ Depth Conditioning đ§
đLooseControl to control the generative image modeling process. Layout by boundaries and #3D box control via object locations (approximate bounding boxes)
đReview https://t.ly/9y72m
đPaper https://arxiv.org/pdf/2312.03079.pdf
đProject https://shariqfarooq123.github.io/loose-control/
đRepo https://github.com/shariqfarooq123/LooseControl
đLooseControl to control the generative image modeling process. Layout by boundaries and #3D box control via object locations (approximate bounding boxes)
đReview https://t.ly/9y72m
đPaper https://arxiv.org/pdf/2312.03079.pdf
đProject https://shariqfarooq123.github.io/loose-control/
đRepo https://github.com/shariqfarooq123/LooseControl
đĨ14â¤6đ¤¯4đ1đĨ°1
This media is not supported in your browser
VIEW IN TELEGRAM
đ˛ī¸ Amodal Tracking Any Object đ˛ī¸
đAmodal tracking": inferring complete object boundaries, even when certain portions are occluded. New benchmark & approach, 2x better than SOTA in people tracking đĨ
đReview https://t.ly/Rc6Ku
đPaper https://lnkd.in/d39rFYT4
đProject https://lnkd.in/d7bkEcni
đ(empty) Repo https://lnkd.in/dTsNKdfz
đAmodal tracking": inferring complete object boundaries, even when certain portions are occluded. New benchmark & approach, 2x better than SOTA in people tracking đĨ
đReview https://t.ly/Rc6Ku
đPaper https://lnkd.in/d39rFYT4
đProject https://lnkd.in/d7bkEcni
đ(empty) Repo https://lnkd.in/dTsNKdfz
â¤16đ¤¯8đĨ3đ2đ1đą1
This media is not supported in your browser
VIEW IN TELEGRAM
đŋ Event-Cam (1000 fps) Hands đŋ
đEv2Hands, the first method for the 3D reconstruction of two interacting hands from a single event camera. Code available.
đReview https://t.ly/YpQpX
đPaper arxiv.org/pdf/2312.14157.pdf
đProject 4dqv.mpi-inf.mpg.de/Ev2Hands
đRepo github.com/Chris10M/Ev2Hands
đEv2Hands, the first method for the 3D reconstruction of two interacting hands from a single event camera. Code available.
đReview https://t.ly/YpQpX
đPaper arxiv.org/pdf/2312.14157.pdf
đProject 4dqv.mpi-inf.mpg.de/Ev2Hands
đRepo github.com/Chris10M/Ev2Hands
đĨ3â¤2đ2đ1
This media is not supported in your browser
VIEW IN TELEGRAM
đUniSDF: Unifying Neural Representationsđ
đUniSDF: novel general purpose 3D reconstruction for large complex scenes with reflections. SOTA on DTU, Shiny Blender, Mip-NeRF 360 and Ref-NeRF dataset.
đReview https://t.ly/2QEul
đPaper https://arxiv.org/pdf/2312.13285.pdf
đProject https://fangjinhuawang.github.io/UniSDF/
đRepo: No code :(
đUniSDF: novel general purpose 3D reconstruction for large complex scenes with reflections. SOTA on DTU, Shiny Blender, Mip-NeRF 360 and Ref-NeRF dataset.
đReview https://t.ly/2QEul
đPaper https://arxiv.org/pdf/2312.13285.pdf
đProject https://fangjinhuawang.github.io/UniSDF/
đRepo: No code :(
đĨ7đ2â¤1đĨ°1đ¤¯1
This media is not supported in your browser
VIEW IN TELEGRAM
đĒŽHAAR: Text-Driven Generative HairstylesđĒŽ
đ HAAR: new strand-based generative model for #3D human hairstyles driven by textual input.
đReview https://t.ly/L38iD
đProject https://haar.is.tue.mpg.de/
đPaper https://arxiv.org/pdf/2312.11666.pdf
đRepo coming
đ HAAR: new strand-based generative model for #3D human hairstyles driven by textual input.
đReview https://t.ly/L38iD
đProject https://haar.is.tue.mpg.de/
đPaper https://arxiv.org/pdf/2312.11666.pdf
đRepo coming
đ¤¯4đž3đ2đĨ1
This media is not supported in your browser
VIEW IN TELEGRAM
đǞUniRef++: Segment Every ReferenceđǞ
đ UniRef++ is a unified model for RIS, FSS, RVOS & VOS. Code available!
đReview https://t.ly/OxtOx
đPaper https://lnkd.in/eTrmDTK3
đRepo https://lnkd.in/etfTm4Wq
đ UniRef++ is a unified model for RIS, FSS, RVOS & VOS. Code available!
đReview https://t.ly/OxtOx
đPaper https://lnkd.in/eTrmDTK3
đRepo https://lnkd.in/etfTm4Wq
đ11â¤3đ¤¯3âĄ1
This media is not supported in your browser
VIEW IN TELEGRAM
đ Seeing Through Occlusions đ
đNovel NSF to see through occlusions, reflection suppression & shadow removal.
đReview https://t.ly/5jcIG
đProject https://light.princeton.edu/publication/nsf
đPaper https://arxiv.org/pdf/2312.14235.pdf
đRepo https://github.com/princeton-computational-imaging/NSF
đNovel NSF to see through occlusions, reflection suppression & shadow removal.
đReview https://t.ly/5jcIG
đProject https://light.princeton.edu/publication/nsf
đPaper https://arxiv.org/pdf/2312.14235.pdf
đRepo https://github.com/princeton-computational-imaging/NSF
â¤10đ¤¯7đĨ3đž1
This media is not supported in your browser
VIEW IN TELEGRAM
đģ Avatar Behind Occlusions đģ
đNeural rendering for occluded in-the-wild mono-videos. Decoupling scenes in occlusion, human, and background.
đReview https://t.ly/8q__B
đPaper https://arxiv.org/pdf/2401.00431.pdf
đProject https://cs.stanford.edu/~xtiange/projects/wild2avatar
đNeural rendering for occluded in-the-wild mono-videos. Decoupling scenes in occlusion, human, and background.
đReview https://t.ly/8q__B
đPaper https://arxiv.org/pdf/2401.00431.pdf
đProject https://cs.stanford.edu/~xtiange/projects/wild2avatar
đĨ11â¤3đ1đ¤Š1
This media is not supported in your browser
VIEW IN TELEGRAM
đ En3D: Generative 3D Humans đ
đ#Alibaba unveils En3D: generative scheme for sculpting HQ 3D human avatars. Zero-shot 3D generative scheme capable of producing visually realistic, geometrically accurate and content-wise diverse 3D humans without relying on pre-existing 3D or 2D asset.
đReview https://t.ly/nGmDK
đProject menyifang.github.io/projects/En3D/index.html
đPaper https://arxiv.org/pdf/2401.01173.pdf
đRepo (soon?) https://github.com/menyifang/En3D
đ#Alibaba unveils En3D: generative scheme for sculpting HQ 3D human avatars. Zero-shot 3D generative scheme capable of producing visually realistic, geometrically accurate and content-wise diverse 3D humans without relying on pre-existing 3D or 2D asset.
đReview https://t.ly/nGmDK
đProject menyifang.github.io/projects/En3D/index.html
đPaper https://arxiv.org/pdf/2401.01173.pdf
đRepo (soon?) https://github.com/menyifang/En3D
đ¤¯5â¤3đĨ1
This media is not supported in your browser
VIEW IN TELEGRAM
đ¤ MagicVideo-V2 announced! đ¤
đ#Bytedance announces a novel multi-stage pipeline capable of generating high-aesthetic videos from textual description
đReview https://t.ly/zIq4v
đProject https://lnkd.in/dKUrJPJd
đPaper https://lnkd.in/dixnN-kU
đ#Bytedance announces a novel multi-stage pipeline capable of generating high-aesthetic videos from textual description
đReview https://t.ly/zIq4v
đProject https://lnkd.in/dKUrJPJd
đPaper https://lnkd.in/dixnN-kU
đĨ7â¤1đ1đĨ°1đŠ1
This media is not supported in your browser
VIEW IN TELEGRAM
đĨ #6D Foundation Pose đĨ
đ#Nvidia unveils FoundationPose, a novel (and unified) foundation model for 6D object pose estimation and tracking.
đReview https://t.ly/HGd4h
đProject https://lnkd.in/dPcnBKWm
đPaper https://lnkd.in/dixn_iHZ
đCode coming đЎ
đ#Nvidia unveils FoundationPose, a novel (and unified) foundation model for 6D object pose estimation and tracking.
đReview https://t.ly/HGd4h
đProject https://lnkd.in/dPcnBKWm
đPaper https://lnkd.in/dixn_iHZ
đCode coming đЎ
đĨ12â¤5đ1đ¤¯1