This media is not supported in your browser
VIEW IN TELEGRAM
๐ชผPatchFusion: SOTA Mono-Depth๐ชผ
๐PatchFusion: novel end-to-end tile-based framework for hi-res monocular metric depth estimation. It's the new SOTA in metric depth estimation from mono. Code & Demo on Hugging Face able ๐ฅ
๐Review https://t.ly/hv3yT
๐Paper https://lnkd.in/d9dXP7iP
๐Project https://lnkd.in/dQcvVJSx
๐Repo https://lnkd.in/dW2GdVR5
๐Demo https://lnkd.in/dFW-gAiY
๐PatchFusion: novel end-to-end tile-based framework for hi-res monocular metric depth estimation. It's the new SOTA in metric depth estimation from mono. Code & Demo on Hugging Face able ๐ฅ
๐Review https://t.ly/hv3yT
๐Paper https://lnkd.in/d9dXP7iP
๐Project https://lnkd.in/dQcvVJSx
๐Repo https://lnkd.in/dW2GdVR5
๐Demo https://lnkd.in/dFW-gAiY
๐ฅ10โค5๐1๐คฏ1๐ฑ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Outfit Anyone: Ultra-HQ VTO๐
๐Alibaba unveils Outfit Anyone: a two-stream conditional diffusion able to adeptly handle garment deformation for more lifelike results in VOT. Extra: Outfit Anyone + Animate Anyone for outfit + motion generation of any character. NO CODE / NO PAPER / DEMO AVAILABLE :)
๐Review https://t.ly/o6UR9
๐Demo https://lnkd.in/dpQYdXhc
๐Repo (empty) https://lnkd.in/dBsNST6r
๐Alibaba unveils Outfit Anyone: a two-stream conditional diffusion able to adeptly handle garment deformation for more lifelike results in VOT. Extra: Outfit Anyone + Animate Anyone for outfit + motion generation of any character. NO CODE / NO PAPER / DEMO AVAILABLE :)
๐Review https://t.ly/o6UR9
๐Demo https://lnkd.in/dpQYdXhc
๐Repo (empty) https://lnkd.in/dBsNST6r
๐คฏ10๐4โค3๐ฅ2
๐ฅ #AIwithPapers: we are 8k+ ๐ฅ
๐ After flirting with #ChatGpt for months, you back in love with this channel. I felt bad, but I forgive you ๐งก
๐ Hey Telegram Premium Subscribers, what about boosting us? Click: https://t.me/AI_DeepLearning?boost
๐ Invite -> https://t.me/AI_DeepLearning
๐ After flirting with #ChatGpt for months, you back in love with this channel. I felt bad, but I forgive you ๐งก
๐ Hey Telegram Premium Subscribers, what about boosting us? Click: https://t.me/AI_DeepLearning?boost
๐ Invite -> https://t.me/AI_DeepLearning
โค16๐คฃ7๐ฅ1๐ฅฐ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ง Depth Conditioning ๐ง
๐LooseControl to control the generative image modeling process. Layout by boundaries and #3D box control via object locations (approximate bounding boxes)
๐Review https://t.ly/9y72m
๐Paper https://arxiv.org/pdf/2312.03079.pdf
๐Project https://shariqfarooq123.github.io/loose-control/
๐Repo https://github.com/shariqfarooq123/LooseControl
๐LooseControl to control the generative image modeling process. Layout by boundaries and #3D box control via object locations (approximate bounding boxes)
๐Review https://t.ly/9y72m
๐Paper https://arxiv.org/pdf/2312.03079.pdf
๐Project https://shariqfarooq123.github.io/loose-control/
๐Repo https://github.com/shariqfarooq123/LooseControl
๐ฅ14โค6๐คฏ4๐1๐ฅฐ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ฒ๏ธ Amodal Tracking Any Object ๐ฒ๏ธ
๐Amodal tracking": inferring complete object boundaries, even when certain portions are occluded. New benchmark & approach, 2x better than SOTA in people tracking ๐ฅ
๐Review https://t.ly/Rc6Ku
๐Paper https://lnkd.in/d39rFYT4
๐Project https://lnkd.in/d7bkEcni
๐(empty) Repo https://lnkd.in/dTsNKdfz
๐Amodal tracking": inferring complete object boundaries, even when certain portions are occluded. New benchmark & approach, 2x better than SOTA in people tracking ๐ฅ
๐Review https://t.ly/Rc6Ku
๐Paper https://lnkd.in/d39rFYT4
๐Project https://lnkd.in/d7bkEcni
๐(empty) Repo https://lnkd.in/dTsNKdfz
โค16๐คฏ8๐ฅ3๐2๐1๐ฑ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ฟ Event-Cam (1000 fps) Hands ๐ฟ
๐Ev2Hands, the first method for the 3D reconstruction of two interacting hands from a single event camera. Code available.
๐Review https://t.ly/YpQpX
๐Paper arxiv.org/pdf/2312.14157.pdf
๐Project 4dqv.mpi-inf.mpg.de/Ev2Hands
๐Repo github.com/Chris10M/Ev2Hands
๐Ev2Hands, the first method for the 3D reconstruction of two interacting hands from a single event camera. Code available.
๐Review https://t.ly/YpQpX
๐Paper arxiv.org/pdf/2312.14157.pdf
๐Project 4dqv.mpi-inf.mpg.de/Ev2Hands
๐Repo github.com/Chris10M/Ev2Hands
๐ฅ3โค2๐2๐1
This media is not supported in your browser
VIEW IN TELEGRAM
๐UniSDF: Unifying Neural Representations๐
๐UniSDF: novel general purpose 3D reconstruction for large complex scenes with reflections. SOTA on DTU, Shiny Blender, Mip-NeRF 360 and Ref-NeRF dataset.
๐Review https://t.ly/2QEul
๐Paper https://arxiv.org/pdf/2312.13285.pdf
๐Project https://fangjinhuawang.github.io/UniSDF/
๐Repo: No code :(
๐UniSDF: novel general purpose 3D reconstruction for large complex scenes with reflections. SOTA on DTU, Shiny Blender, Mip-NeRF 360 and Ref-NeRF dataset.
๐Review https://t.ly/2QEul
๐Paper https://arxiv.org/pdf/2312.13285.pdf
๐Project https://fangjinhuawang.github.io/UniSDF/
๐Repo: No code :(
๐ฅ7๐2โค1๐ฅฐ1๐คฏ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ชฎHAAR: Text-Driven Generative Hairstyles๐ชฎ
๐ HAAR: new strand-based generative model for #3D human hairstyles driven by textual input.
๐Review https://t.ly/L38iD
๐Project https://haar.is.tue.mpg.de/
๐Paper https://arxiv.org/pdf/2312.11666.pdf
๐Repo coming
๐ HAAR: new strand-based generative model for #3D human hairstyles driven by textual input.
๐Review https://t.ly/L38iD
๐Project https://haar.is.tue.mpg.de/
๐Paper https://arxiv.org/pdf/2312.11666.pdf
๐Repo coming
๐คฏ4๐พ3๐2๐ฅ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ชฒUniRef++: Segment Every Reference๐ชฒ
๐ UniRef++ is a unified model for RIS, FSS, RVOS & VOS. Code available!
๐Review https://t.ly/OxtOx
๐Paper https://lnkd.in/eTrmDTK3
๐Repo https://lnkd.in/etfTm4Wq
๐ UniRef++ is a unified model for RIS, FSS, RVOS & VOS. Code available!
๐Review https://t.ly/OxtOx
๐Paper https://lnkd.in/eTrmDTK3
๐Repo https://lnkd.in/etfTm4Wq
๐11โค3๐คฏ3โก1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ Seeing Through Occlusions ๐
๐Novel NSF to see through occlusions, reflection suppression & shadow removal.
๐Review https://t.ly/5jcIG
๐Project https://light.princeton.edu/publication/nsf
๐Paper https://arxiv.org/pdf/2312.14235.pdf
๐Repo https://github.com/princeton-computational-imaging/NSF
๐Novel NSF to see through occlusions, reflection suppression & shadow removal.
๐Review https://t.ly/5jcIG
๐Project https://light.princeton.edu/publication/nsf
๐Paper https://arxiv.org/pdf/2312.14235.pdf
๐Repo https://github.com/princeton-computational-imaging/NSF
โค10๐คฏ7๐ฅ3๐พ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ป Avatar Behind Occlusions ๐ป
๐Neural rendering for occluded in-the-wild mono-videos. Decoupling scenes in occlusion, human, and background.
๐Review https://t.ly/8q__B
๐Paper https://arxiv.org/pdf/2401.00431.pdf
๐Project https://cs.stanford.edu/~xtiange/projects/wild2avatar
๐Neural rendering for occluded in-the-wild mono-videos. Decoupling scenes in occlusion, human, and background.
๐Review https://t.ly/8q__B
๐Paper https://arxiv.org/pdf/2401.00431.pdf
๐Project https://cs.stanford.edu/~xtiange/projects/wild2avatar
๐ฅ11โค3๐1๐คฉ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ En3D: Generative 3D Humans ๐
๐#Alibaba unveils En3D: generative scheme for sculpting HQ 3D human avatars. Zero-shot 3D generative scheme capable of producing visually realistic, geometrically accurate and content-wise diverse 3D humans without relying on pre-existing 3D or 2D asset.
๐Review https://t.ly/nGmDK
๐Project menyifang.github.io/projects/En3D/index.html
๐Paper https://arxiv.org/pdf/2401.01173.pdf
๐Repo (soon?) https://github.com/menyifang/En3D
๐#Alibaba unveils En3D: generative scheme for sculpting HQ 3D human avatars. Zero-shot 3D generative scheme capable of producing visually realistic, geometrically accurate and content-wise diverse 3D humans without relying on pre-existing 3D or 2D asset.
๐Review https://t.ly/nGmDK
๐Project menyifang.github.io/projects/En3D/index.html
๐Paper https://arxiv.org/pdf/2401.01173.pdf
๐Repo (soon?) https://github.com/menyifang/En3D
๐คฏ5โค3๐ฅ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ค MagicVideo-V2 announced! ๐ค
๐#Bytedance announces a novel multi-stage pipeline capable of generating high-aesthetic videos from textual description
๐Review https://t.ly/zIq4v
๐Project https://lnkd.in/dKUrJPJd
๐Paper https://lnkd.in/dixnN-kU
๐#Bytedance announces a novel multi-stage pipeline capable of generating high-aesthetic videos from textual description
๐Review https://t.ly/zIq4v
๐Project https://lnkd.in/dKUrJPJd
๐Paper https://lnkd.in/dixnN-kU
๐ฅ7โค1๐1๐ฅฐ1๐ฉ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ฅ #6D Foundation Pose ๐ฅ
๐#Nvidia unveils FoundationPose, a novel (and unified) foundation model for 6D object pose estimation and tracking.
๐Review https://t.ly/HGd4h
๐Project https://lnkd.in/dPcnBKWm
๐Paper https://lnkd.in/dixn_iHZ
๐Code coming ๐ฉท
๐#Nvidia unveils FoundationPose, a novel (and unified) foundation model for 6D object pose estimation and tracking.
๐Review https://t.ly/HGd4h
๐Project https://lnkd.in/dPcnBKWm
๐Paper https://lnkd.in/dixn_iHZ
๐Code coming ๐ฉท
๐ฅ12โค5๐1๐คฏ1
๐ReplaceAnything: demo is out!๐
๐ReplaceAnything: ultra-high quality content replacement. The ultimate #AI solution for human, clothing & background replacement to change the e-commerce experience for vendors.
๐Review https://t.ly/FMyvf
๐Project https://lnkd.in/dcyZvP2b
๐ModelScope https://lnkd.in/dU4x4nE6
๐Hugging Face https://lnkd.in/dn3uXWgd
๐Empty report https://lnkd.in/dcuGXd6c
๐Paper coming?
๐ReplaceAnything: ultra-high quality content replacement. The ultimate #AI solution for human, clothing & background replacement to change the e-commerce experience for vendors.
๐Review https://t.ly/FMyvf
๐Project https://lnkd.in/dcyZvP2b
๐ModelScope https://lnkd.in/dU4x4nE6
๐Hugging Face https://lnkd.in/dn3uXWgd
๐Empty report https://lnkd.in/dcuGXd6c
๐Paper coming?
โค11๐3๐2๐1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ฅ Transparent Object Tracking ๐ฅ
๐Trans2k: transparent object tracking dataset of 2,000+ sequences with 100,000+ images, annotated by bounding boxes & segmentation mask.
๐Review https://t.ly/mEI6O
๐Paper https://lnkd.in/dsudY3DB
๐Project https://lnkd.in/d48SSJJ3
๐TOB https://lnkd.in/dykBUNfC
๐Trans2k: transparent object tracking dataset of 2,000+ sequences with 100,000+ images, annotated by bounding boxes & segmentation mask.
๐Review https://t.ly/mEI6O
๐Paper https://lnkd.in/dsudY3DB
๐Project https://lnkd.in/d48SSJJ3
๐TOB https://lnkd.in/dykBUNfC
๐ฅ18๐คฏ7โค3๐2๐ฑ2๐1
๐๐ AGNOSTIC Object Counting ๐๐
๐PseCo: combining SAM to segment all possible objects as mask proposals & CLIP to classify proposals to obtain accurate object counts. The new SOTA in both few-shot/zero-shot object counting/detection.
๐Review https://t.ly/e4iza
๐Paper https://lnkd.in/dbzMXKWG
๐Repo https://lnkd.in/db9Q9Pse
๐PseCo: combining SAM to segment all possible objects as mask proposals & CLIP to classify proposals to obtain accurate object counts. The new SOTA in both few-shot/zero-shot object counting/detection.
๐Review https://t.ly/e4iza
๐Paper https://lnkd.in/dbzMXKWG
๐Repo https://lnkd.in/db9Q9Pse
๐ฅ17๐5๐ฅฐ1๐1
๐ฅ Announcing #Py4Ai Conference๐ฅ
๐ Super proud to unveil #Py4AI, the newest conference dedicated to exploring the depths of Python & AI. Py4AI is a 1-day free event for Python and Artificial Intelligence developers.
๐๐ก๐ ๐๐ข๐ซ๐ฌ๐ญ ๐๐๐ญ๐๐ก ๐จ๐ ๐ฌ๐ฉ๐๐๐ค๐๐ซ๐ฌ:
๐Merve Noyan | #HuggingFace ๐ค
๐Gabriele Lombardi | ARGO Vision
๐Amanda Cercas Curry | Uni. Bocconi
๐Piero Savastano | Cheshire Cat AI
๐Francesco Zuppichini | Zurich Insurance
๐Andrea Palladino, PhD | Sr. Data Scientist
๐ More: https://www.linkedin.com/posts/visionarynet_py4ai-py4ai-python-activity-7152928716988243968-pOUn?utm_source=share&utm_medium=member_desktop
๐ Super proud to unveil #Py4AI, the newest conference dedicated to exploring the depths of Python & AI. Py4AI is a 1-day free event for Python and Artificial Intelligence developers.
๐๐ก๐ ๐๐ข๐ซ๐ฌ๐ญ ๐๐๐ญ๐๐ก ๐จ๐ ๐ฌ๐ฉ๐๐๐ค๐๐ซ๐ฌ:
๐Merve Noyan | #HuggingFace ๐ค
๐Gabriele Lombardi | ARGO Vision
๐Amanda Cercas Curry | Uni. Bocconi
๐Piero Savastano | Cheshire Cat AI
๐Francesco Zuppichini | Zurich Insurance
๐Andrea Palladino, PhD | Sr. Data Scientist
๐ More: https://www.linkedin.com/posts/visionarynet_py4ai-py4ai-python-activity-7152928716988243968-pOUn?utm_source=share&utm_medium=member_desktop
Linkedin
๐ฅBOOOM! | Alessandro Ferrari
๐ฅBOOOM! Announcing #Py4AI Conference๐ฅ
๐ Super proud to unveil #Py4AI, the newest conference dedicated to exploring the depths of Python & AI. Py4AI is a 1-day free event for Python and Artificial Intelligence developers.
๐๐ฏ๐๐ง๐ญ ๐๐๐ญ๐๐ข๐ฅ๐ฌ:
โ 16th March 2024โฆ
๐ Super proud to unveil #Py4AI, the newest conference dedicated to exploring the depths of Python & AI. Py4AI is a 1-day free event for Python and Artificial Intelligence developers.
๐๐ฏ๐๐ง๐ญ ๐๐๐ญ๐๐ข๐ฅ๐ฌ:
โ 16th March 2024โฆ
๐10๐2โค1๐ฅฐ1๐คฏ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Timeline Text-Driven Humans๐
๐Novel challenge: timeline control for text-driven motion synthesis of 3D Humans.
๐Review https://t.ly/HLm-N
๐Paper https://lnkd.in/esaR_M_9
๐Project https://lnkd.in/epCZDvFW
๐Repo coming
๐Novel challenge: timeline control for text-driven motion synthesis of 3D Humans.
๐Review https://t.ly/HLm-N
๐Paper https://lnkd.in/esaR_M_9
๐Project https://lnkd.in/epCZDvFW
๐Repo coming
๐ฅ13โค6๐4๐3๐คฉ1
AI with Papers - Artificial Intelligence & Deep Learning
๐ฒ๏ธ Amodal Tracking Any Object ๐ฒ๏ธ ๐Amodal tracking": inferring complete object boundaries, even when certain portions are occluded. New benchmark & approach, 2x better than SOTA in people tracking ๐ฅ ๐Review https://t.ly/Rc6Ku ๐Paper https://lnkd.in/d39rFYT4โฆ
๐ฅ๐ฅ Code is out ๐ฅ๐ฅ
Check the comments for the links ;)
Check the comments for the links ;)