This media is not supported in your browser
VIEW IN TELEGRAM
๐ฟ Event-Cam (1000 fps) Hands ๐ฟ
๐Ev2Hands, the first method for the 3D reconstruction of two interacting hands from a single event camera. Code available.
๐Review https://t.ly/YpQpX
๐Paper arxiv.org/pdf/2312.14157.pdf
๐Project 4dqv.mpi-inf.mpg.de/Ev2Hands
๐Repo github.com/Chris10M/Ev2Hands
๐Ev2Hands, the first method for the 3D reconstruction of two interacting hands from a single event camera. Code available.
๐Review https://t.ly/YpQpX
๐Paper arxiv.org/pdf/2312.14157.pdf
๐Project 4dqv.mpi-inf.mpg.de/Ev2Hands
๐Repo github.com/Chris10M/Ev2Hands
๐ฅ3โค2๐2๐1
This media is not supported in your browser
VIEW IN TELEGRAM
๐UniSDF: Unifying Neural Representations๐
๐UniSDF: novel general purpose 3D reconstruction for large complex scenes with reflections. SOTA on DTU, Shiny Blender, Mip-NeRF 360 and Ref-NeRF dataset.
๐Review https://t.ly/2QEul
๐Paper https://arxiv.org/pdf/2312.13285.pdf
๐Project https://fangjinhuawang.github.io/UniSDF/
๐Repo: No code :(
๐UniSDF: novel general purpose 3D reconstruction for large complex scenes with reflections. SOTA on DTU, Shiny Blender, Mip-NeRF 360 and Ref-NeRF dataset.
๐Review https://t.ly/2QEul
๐Paper https://arxiv.org/pdf/2312.13285.pdf
๐Project https://fangjinhuawang.github.io/UniSDF/
๐Repo: No code :(
๐ฅ7๐2โค1๐ฅฐ1๐คฏ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ชฎHAAR: Text-Driven Generative Hairstyles๐ชฎ
๐ HAAR: new strand-based generative model for #3D human hairstyles driven by textual input.
๐Review https://t.ly/L38iD
๐Project https://haar.is.tue.mpg.de/
๐Paper https://arxiv.org/pdf/2312.11666.pdf
๐Repo coming
๐ HAAR: new strand-based generative model for #3D human hairstyles driven by textual input.
๐Review https://t.ly/L38iD
๐Project https://haar.is.tue.mpg.de/
๐Paper https://arxiv.org/pdf/2312.11666.pdf
๐Repo coming
๐คฏ4๐พ3๐2๐ฅ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ชฒUniRef++: Segment Every Reference๐ชฒ
๐ UniRef++ is a unified model for RIS, FSS, RVOS & VOS. Code available!
๐Review https://t.ly/OxtOx
๐Paper https://lnkd.in/eTrmDTK3
๐Repo https://lnkd.in/etfTm4Wq
๐ UniRef++ is a unified model for RIS, FSS, RVOS & VOS. Code available!
๐Review https://t.ly/OxtOx
๐Paper https://lnkd.in/eTrmDTK3
๐Repo https://lnkd.in/etfTm4Wq
๐11โค3๐คฏ3โก1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ Seeing Through Occlusions ๐
๐Novel NSF to see through occlusions, reflection suppression & shadow removal.
๐Review https://t.ly/5jcIG
๐Project https://light.princeton.edu/publication/nsf
๐Paper https://arxiv.org/pdf/2312.14235.pdf
๐Repo https://github.com/princeton-computational-imaging/NSF
๐Novel NSF to see through occlusions, reflection suppression & shadow removal.
๐Review https://t.ly/5jcIG
๐Project https://light.princeton.edu/publication/nsf
๐Paper https://arxiv.org/pdf/2312.14235.pdf
๐Repo https://github.com/princeton-computational-imaging/NSF
โค10๐คฏ7๐ฅ3๐พ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ป Avatar Behind Occlusions ๐ป
๐Neural rendering for occluded in-the-wild mono-videos. Decoupling scenes in occlusion, human, and background.
๐Review https://t.ly/8q__B
๐Paper https://arxiv.org/pdf/2401.00431.pdf
๐Project https://cs.stanford.edu/~xtiange/projects/wild2avatar
๐Neural rendering for occluded in-the-wild mono-videos. Decoupling scenes in occlusion, human, and background.
๐Review https://t.ly/8q__B
๐Paper https://arxiv.org/pdf/2401.00431.pdf
๐Project https://cs.stanford.edu/~xtiange/projects/wild2avatar
๐ฅ11โค3๐1๐คฉ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ En3D: Generative 3D Humans ๐
๐#Alibaba unveils En3D: generative scheme for sculpting HQ 3D human avatars. Zero-shot 3D generative scheme capable of producing visually realistic, geometrically accurate and content-wise diverse 3D humans without relying on pre-existing 3D or 2D asset.
๐Review https://t.ly/nGmDK
๐Project menyifang.github.io/projects/En3D/index.html
๐Paper https://arxiv.org/pdf/2401.01173.pdf
๐Repo (soon?) https://github.com/menyifang/En3D
๐#Alibaba unveils En3D: generative scheme for sculpting HQ 3D human avatars. Zero-shot 3D generative scheme capable of producing visually realistic, geometrically accurate and content-wise diverse 3D humans without relying on pre-existing 3D or 2D asset.
๐Review https://t.ly/nGmDK
๐Project menyifang.github.io/projects/En3D/index.html
๐Paper https://arxiv.org/pdf/2401.01173.pdf
๐Repo (soon?) https://github.com/menyifang/En3D
๐คฏ5โค3๐ฅ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ค MagicVideo-V2 announced! ๐ค
๐#Bytedance announces a novel multi-stage pipeline capable of generating high-aesthetic videos from textual description
๐Review https://t.ly/zIq4v
๐Project https://lnkd.in/dKUrJPJd
๐Paper https://lnkd.in/dixnN-kU
๐#Bytedance announces a novel multi-stage pipeline capable of generating high-aesthetic videos from textual description
๐Review https://t.ly/zIq4v
๐Project https://lnkd.in/dKUrJPJd
๐Paper https://lnkd.in/dixnN-kU
๐ฅ7โค1๐1๐ฅฐ1๐ฉ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ฅ #6D Foundation Pose ๐ฅ
๐#Nvidia unveils FoundationPose, a novel (and unified) foundation model for 6D object pose estimation and tracking.
๐Review https://t.ly/HGd4h
๐Project https://lnkd.in/dPcnBKWm
๐Paper https://lnkd.in/dixn_iHZ
๐Code coming ๐ฉท
๐#Nvidia unveils FoundationPose, a novel (and unified) foundation model for 6D object pose estimation and tracking.
๐Review https://t.ly/HGd4h
๐Project https://lnkd.in/dPcnBKWm
๐Paper https://lnkd.in/dixn_iHZ
๐Code coming ๐ฉท
๐ฅ12โค5๐1๐คฏ1
๐ReplaceAnything: demo is out!๐
๐ReplaceAnything: ultra-high quality content replacement. The ultimate #AI solution for human, clothing & background replacement to change the e-commerce experience for vendors.
๐Review https://t.ly/FMyvf
๐Project https://lnkd.in/dcyZvP2b
๐ModelScope https://lnkd.in/dU4x4nE6
๐Hugging Face https://lnkd.in/dn3uXWgd
๐Empty report https://lnkd.in/dcuGXd6c
๐Paper coming?
๐ReplaceAnything: ultra-high quality content replacement. The ultimate #AI solution for human, clothing & background replacement to change the e-commerce experience for vendors.
๐Review https://t.ly/FMyvf
๐Project https://lnkd.in/dcyZvP2b
๐ModelScope https://lnkd.in/dU4x4nE6
๐Hugging Face https://lnkd.in/dn3uXWgd
๐Empty report https://lnkd.in/dcuGXd6c
๐Paper coming?
โค11๐3๐2๐1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ฅ Transparent Object Tracking ๐ฅ
๐Trans2k: transparent object tracking dataset of 2,000+ sequences with 100,000+ images, annotated by bounding boxes & segmentation mask.
๐Review https://t.ly/mEI6O
๐Paper https://lnkd.in/dsudY3DB
๐Project https://lnkd.in/d48SSJJ3
๐TOB https://lnkd.in/dykBUNfC
๐Trans2k: transparent object tracking dataset of 2,000+ sequences with 100,000+ images, annotated by bounding boxes & segmentation mask.
๐Review https://t.ly/mEI6O
๐Paper https://lnkd.in/dsudY3DB
๐Project https://lnkd.in/d48SSJJ3
๐TOB https://lnkd.in/dykBUNfC
๐ฅ18๐คฏ7โค3๐2๐ฑ2๐1
๐๐ AGNOSTIC Object Counting ๐๐
๐PseCo: combining SAM to segment all possible objects as mask proposals & CLIP to classify proposals to obtain accurate object counts. The new SOTA in both few-shot/zero-shot object counting/detection.
๐Review https://t.ly/e4iza
๐Paper https://lnkd.in/dbzMXKWG
๐Repo https://lnkd.in/db9Q9Pse
๐PseCo: combining SAM to segment all possible objects as mask proposals & CLIP to classify proposals to obtain accurate object counts. The new SOTA in both few-shot/zero-shot object counting/detection.
๐Review https://t.ly/e4iza
๐Paper https://lnkd.in/dbzMXKWG
๐Repo https://lnkd.in/db9Q9Pse
๐ฅ17๐5๐ฅฐ1๐1
๐ฅ Announcing #Py4Ai Conference๐ฅ
๐ Super proud to unveil #Py4AI, the newest conference dedicated to exploring the depths of Python & AI. Py4AI is a 1-day free event for Python and Artificial Intelligence developers.
๐๐ก๐ ๐๐ข๐ซ๐ฌ๐ญ ๐๐๐ญ๐๐ก ๐จ๐ ๐ฌ๐ฉ๐๐๐ค๐๐ซ๐ฌ:
๐Merve Noyan | #HuggingFace ๐ค
๐Gabriele Lombardi | ARGO Vision
๐Amanda Cercas Curry | Uni. Bocconi
๐Piero Savastano | Cheshire Cat AI
๐Francesco Zuppichini | Zurich Insurance
๐Andrea Palladino, PhD | Sr. Data Scientist
๐ More: https://www.linkedin.com/posts/visionarynet_py4ai-py4ai-python-activity-7152928716988243968-pOUn?utm_source=share&utm_medium=member_desktop
๐ Super proud to unveil #Py4AI, the newest conference dedicated to exploring the depths of Python & AI. Py4AI is a 1-day free event for Python and Artificial Intelligence developers.
๐๐ก๐ ๐๐ข๐ซ๐ฌ๐ญ ๐๐๐ญ๐๐ก ๐จ๐ ๐ฌ๐ฉ๐๐๐ค๐๐ซ๐ฌ:
๐Merve Noyan | #HuggingFace ๐ค
๐Gabriele Lombardi | ARGO Vision
๐Amanda Cercas Curry | Uni. Bocconi
๐Piero Savastano | Cheshire Cat AI
๐Francesco Zuppichini | Zurich Insurance
๐Andrea Palladino, PhD | Sr. Data Scientist
๐ More: https://www.linkedin.com/posts/visionarynet_py4ai-py4ai-python-activity-7152928716988243968-pOUn?utm_source=share&utm_medium=member_desktop
Linkedin
๐ฅBOOOM! | Alessandro Ferrari
๐ฅBOOOM! Announcing #Py4AI Conference๐ฅ
๐ Super proud to unveil #Py4AI, the newest conference dedicated to exploring the depths of Python & AI. Py4AI is a 1-day free event for Python and Artificial Intelligence developers.
๐๐ฏ๐๐ง๐ญ ๐๐๐ญ๐๐ข๐ฅ๐ฌ:
โ 16th March 2024โฆ
๐ Super proud to unveil #Py4AI, the newest conference dedicated to exploring the depths of Python & AI. Py4AI is a 1-day free event for Python and Artificial Intelligence developers.
๐๐ฏ๐๐ง๐ญ ๐๐๐ญ๐๐ข๐ฅ๐ฌ:
โ 16th March 2024โฆ
๐10๐2โค1๐ฅฐ1๐คฏ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Timeline Text-Driven Humans๐
๐Novel challenge: timeline control for text-driven motion synthesis of 3D Humans.
๐Review https://t.ly/HLm-N
๐Paper https://lnkd.in/esaR_M_9
๐Project https://lnkd.in/epCZDvFW
๐Repo coming
๐Novel challenge: timeline control for text-driven motion synthesis of 3D Humans.
๐Review https://t.ly/HLm-N
๐Paper https://lnkd.in/esaR_M_9
๐Project https://lnkd.in/epCZDvFW
๐Repo coming
๐ฅ13โค6๐4๐3๐คฉ1
AI with Papers - Artificial Intelligence & Deep Learning
๐ฒ๏ธ Amodal Tracking Any Object ๐ฒ๏ธ ๐Amodal tracking": inferring complete object boundaries, even when certain portions are occluded. New benchmark & approach, 2x better than SOTA in people tracking ๐ฅ ๐Review https://t.ly/Rc6Ku ๐Paper https://lnkd.in/d39rFYT4โฆ
๐ฅ๐ฅ Code is out ๐ฅ๐ฅ
Check the comments for the links ;)
Check the comments for the links ;)
๐ซ AlphaGeometry: Olympiad-level AI ๐ซ
๐ Theorem prover for Euclidean plane geometry that sidesteps the need for human demonstrations by
synthesizing millions of theorems and proofs across different levels of complexity ๐คฏ
๐Review https://t.ly/2-Z7C
๐Paper https://lnkd.in/g3QkqwCE
๐Blog https://lnkd.in/ge-mpM7q
๐Repo https://lnkd.in/gHjwks_9
๐ Theorem prover for Euclidean plane geometry that sidesteps the need for human demonstrations by
synthesizing millions of theorems and proofs across different levels of complexity ๐คฏ
๐Review https://t.ly/2-Z7C
๐Paper https://lnkd.in/g3QkqwCE
๐Blog https://lnkd.in/ge-mpM7q
๐Repo https://lnkd.in/gHjwks_9
๐คฏ20๐3๐ฅฐ2๐คฉ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ฆ XINC: Pixels to Neurons ๐ฆ
๐eXplaining the Implicit Neural Canvas (XINC) from the University of Maryland, is a unified framework for explaining properties of INRs by examining the strength of each neuronโs contribution to each output pixel
๐Review https://t.ly/wwAmz
๐Paper arxiv.org/pdf/2401.10217.pdf
๐Project namithap10.github.io/xinc
๐Repo github.com/namithap10/xinc
๐eXplaining the Implicit Neural Canvas (XINC) from the University of Maryland, is a unified framework for explaining properties of INRs by examining the strength of each neuronโs contribution to each output pixel
๐Review https://t.ly/wwAmz
๐Paper arxiv.org/pdf/2401.10217.pdf
๐Project namithap10.github.io/xinc
๐Repo github.com/namithap10/xinc
๐คฏ9๐3๐2๐ฅ1
๐ฝ One Model <-> All Segmentations ๐ฝ
๐ 10+ different segmentation tasks in one framework, including image-level, video-level, interactive segmentation, & open-vocabulary segmentation. All in one!
๐Review https://t.ly/fywVz
๐Paper https://lnkd.in/dw3S4B74
๐Project https://lnkd.in/dzHT9v45
๐Repo https://lnkd.in/d6fDCnSp
๐ 10+ different segmentation tasks in one framework, including image-level, video-level, interactive segmentation, & open-vocabulary segmentation. All in one!
๐Review https://t.ly/fywVz
๐Paper https://lnkd.in/dw3S4B74
๐Project https://lnkd.in/dzHT9v45
๐Repo https://lnkd.in/d6fDCnSp
๐ฅ17๐5โค2๐ฅฐ1๐พ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ป GARField: Group Anything ๐ป
๐ GARField is a novel approach for decomposing #3D scenes into a hierarchy of semantically meaningful groups from posed image inputs.
๐Review https://t.ly/6Hkeq
๐Paper https://lnkd.in/d28mfRcZ
๐Project https://lnkd.in/dzYdRNKy
๐Repo (coming) https://lnkd.in/d2VeRJCS
๐ GARField is a novel approach for decomposing #3D scenes into a hierarchy of semantically meaningful groups from posed image inputs.
๐Review https://t.ly/6Hkeq
๐Paper https://lnkd.in/d28mfRcZ
๐Project https://lnkd.in/dzYdRNKy
๐Repo (coming) https://lnkd.in/d2VeRJCS
๐8โค3๐ฅฐ1๐คฉ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ฅ Depth Anything: new SOTA ๐ฅ
๐Depth Anything: the new SOTA in monocular depth estimation (MDE), trained with 1.5M labeled images and 62M+ unlabeled images jointly. It's the new SOTA!
๐Review https://t.ly/tCBwO
๐Paper https://lnkd.in/djx-9k2J
๐Project https://lnkd.in/dYetqZFa
๐Repo https://lnkd.in/d87CrUGv
๐Demo๐ค https://lnkd.in/dJhvKBep
๐Depth Anything: the new SOTA in monocular depth estimation (MDE), trained with 1.5M labeled images and 62M+ unlabeled images jointly. It's the new SOTA!
๐Review https://t.ly/tCBwO
๐Paper https://lnkd.in/djx-9k2J
๐Project https://lnkd.in/dYetqZFa
๐Repo https://lnkd.in/d87CrUGv
๐Demo๐ค https://lnkd.in/dJhvKBep
๐ฅ17โค3๐ฅฐ2๐คฉ2