AI with Papers - Artificial Intelligence & Deep Learning

🪼PatchFusion: SOTA Mono-Depth🪼

👉PatchFusion: novel end-to-end tile-based framework for hi-res monocular metric depth estimation. It's the new SOTA in metric depth estimation from mono. Code & Demo on Hugging Face able 🔥

👉Review https://t.ly/hv3yT
👉Paper https://lnkd.in/d9dXP7iP
👉Project https://lnkd.in/dQcvVJSx
👉Repo https://lnkd.in/dW2GdVR5
👉Demo https://lnkd.in/dFW-gAiY

🔥10❤5👏1🤯1😱1

7.14K viewsedited 13:10

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

💃Outfit Anyone: Ultra-HQ VTO💃

👉Alibaba unveils Outfit Anyone: a two-stream conditional diffusion able to adeptly handle garment deformation for more lifelike results in VOT. Extra: Outfit Anyone + Animate Anyone for outfit + motion generation of any character. NO CODE / NO PAPER / DEMO AVAILABLE :)

👉Review https://t.ly/o6UR9
👉Demo https://lnkd.in/dpQYdXhc
👉Repo (empty) https://lnkd.in/dBsNST6r

🤯10👍4❤3🔥2

7.3K views14:07

AI with Papers - Artificial Intelligence & Deep Learning

🔥 #AIwithPapers: we are 8k+ 🔥

👉 After flirting with #ChatGpt for months, you back in love with this channel. I felt bad, but I forgive you 🧡

😈 Hey Telegram Premium Subscribers, what about boosting us? Click: https://t.me/AI_DeepLearning?boost

😈 Invite -> https://t.me/AI_DeepLearning

❤16🤣7🔥1🥰1

7.16K viewsedited 09:37

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🧊 Depth Conditioning 🧊

👉LooseControl to control the generative image modeling process. Layout by boundaries and #3D box control via object locations (approximate bounding boxes)

👉Review https://t.ly/9y72m
👉Paper https://arxiv.org/pdf/2312.03079.pdf
👉Project https://shariqfarooq123.github.io/loose-control/
👉Repo https://github.com/shariqfarooq123/LooseControl

🔥14❤6🤯4👍1🥰1

6.74K viewsedited 15:31

AI with Papers - Artificial Intelligence & Deep Learning

0:08

This media is not supported in your browser

VIEW IN TELEGRAM

🖲️ Amodal Tracking Any Object 🖲️

👉Amodal tracking": inferring complete object boundaries, even when certain portions are occluded. New benchmark & approach, 2x better than SOTA in people tracking 🔥

👉Review https://t.ly/Rc6Ku
👉Paper https://lnkd.in/d39rFYT4
👉Project https://lnkd.in/d7bkEcni
👉(empty) Repo https://lnkd.in/dTsNKdfz

❤16🤯8🔥3👍2👏1😱1

7.53K views09:28

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🚿 Event-Cam (1000 fps) Hands 🚿

👉Ev2Hands, the first method for the 3D reconstruction of two interacting hands from a single event camera. Code available.

👉Review https://t.ly/YpQpX
👉Paper arxiv.org/pdf/2312.14157.pdf
👉Project 4dqv.mpi-inf.mpg.de/Ev2Hands
👉Repo github.com/Chris10M/Ev2Hands

🔥3❤2👍2👏1

8.21K viewsedited 09:36

AI with Papers - Artificial Intelligence & Deep Learning

0:08

This media is not supported in your browser

VIEW IN TELEGRAM

🎄UniSDF: Unifying Neural Representations🎄

👉UniSDF: novel general purpose 3D reconstruction for large complex scenes with reflections. SOTA on DTU, Shiny Blender, Mip-NeRF 360 and Ref-NeRF dataset.

👉Review https://t.ly/2QEul
👉Paper https://arxiv.org/pdf/2312.13285.pdf
👉Project https://fangjinhuawang.github.io/UniSDF/
👉Repo: No code :(

🔥7👍2❤1🥰1🤯1

7.54K viewsedited 08:45

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🪮HAAR: Text-Driven Generative Hairstyles🪮

👉 HAAR: new strand-based generative model for #3D human hairstyles driven by textual input.

👉Review https://t.ly/L38iD
👉Project https://haar.is.tue.mpg.de/
👉Paper https://arxiv.org/pdf/2312.11666.pdf
👉Repo coming

🤯4🍾3👍2🔥1

7.42K views09:01

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🪲UniRef++: Segment Every Reference🪲

👉 UniRef++ is a unified model for RIS, FSS, RVOS & VOS. Code available!

👉Review https://t.ly/OxtOx
👉Paper https://lnkd.in/eTrmDTK3
👉Repo https://lnkd.in/etfTm4Wq

👍11❤3🤯3⚡1

7.14K views11:02

AI with Papers - Artificial Intelligence & Deep Learning

0:08

This media is not supported in your browser

VIEW IN TELEGRAM

🈚 Seeing Through Occlusions 🈚

👉Novel NSF to see through occlusions, reflection suppression & shadow removal.

👉Review https://t.ly/5jcIG
👉Project https://light.princeton.edu/publication/nsf
👉Paper https://arxiv.org/pdf/2312.14235.pdf
👉Repo https://github.com/princeton-computational-imaging/NSF

❤10🤯7🔥3🍾1

7.66K viewsedited 13:13

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

👻 Avatar Behind Occlusions 👻

👉Neural rendering for occluded in-the-wild mono-videos. Decoupling scenes in occlusion, human, and background.

👉Review https://t.ly/8q__B
👉Paper https://arxiv.org/pdf/2401.00431.pdf
👉Project https://cs.stanford.edu/~xtiange/projects/wild2avatar

🔥11❤3👏1🤩1

7.67K viewsedited 12:59

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🕍 En3D: Generative 3D Humans 🕍

👉#Alibaba unveils En3D: generative scheme for sculpting HQ 3D human avatars. Zero-shot 3D generative scheme capable of producing visually realistic, geometrically accurate and content-wise diverse 3D humans without relying on pre-existing 3D or 2D asset.

👉Review https://t.ly/nGmDK
👉Project menyifang.github.io/projects/En3D/index.html
👉Paper https://arxiv.org/pdf/2401.01173.pdf
👉Repo (soon?) https://github.com/menyifang/En3D

🤯5❤3🔥1

8.06K viewsedited 08:25

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🐤 MagicVideo-V2 announced! 🐤

👉#Bytedance announces a novel multi-stage pipeline capable of generating high-aesthetic videos from textual description

👉Review https://t.ly/zIq4v
👉Project https://lnkd.in/dKUrJPJd
👉Paper https://lnkd.in/dixnN-kU

🔥7❤1👍1🥰1💩1

6.42K viewsedited 07:48

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🔥 #6D Foundation Pose 🔥

👉#Nvidia unveils FoundationPose, a novel (and unified) foundation model for 6D object pose estimation and tracking.

👉Review https://t.ly/HGd4h
👉Project https://lnkd.in/dPcnBKWm
👉Paper https://lnkd.in/dixn_iHZ
👉Code coming 🩷

🔥12❤5👏1🤯1

6.47K views12:46

AI with Papers - Artificial Intelligence & Deep Learning

🃏ReplaceAnything: demo is out!🃏

👉ReplaceAnything: ultra-high quality content replacement. The ultimate #AI solution for human, clothing & background replacement to change the e-commerce experience for vendors.

👉Review https://t.ly/FMyvf
👉Project https://lnkd.in/dcyZvP2b
👉ModelScope https://lnkd.in/dU4x4nE6
👉Hugging Face https://lnkd.in/dn3uXWgd
👉Empty report https://lnkd.in/dcuGXd6c
👉Paper coming?

❤11👍3👏2😍1

6.62K views09:43

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🥛 Transparent Object Tracking 🥛

👉Trans2k: transparent object tracking dataset of 2,000+ sequences with 100,000+ images, annotated by bounding boxes & segmentation mask.

👉Review https://t.ly/mEI6O
👉Paper https://lnkd.in/dsudY3DB
👉Project https://lnkd.in/d48SSJJ3
👉TOB https://lnkd.in/dykBUNfC

🔥18🤯7❤3👍2😱2👏1

6.83K views10:55

AI with Papers - Artificial Intelligence & Deep Learning

💊💊 AGNOSTIC Object Counting 💊💊

👉PseCo: combining SAM to segment all possible objects as mask proposals & CLIP to classify proposals to obtain accurate object counts. The new SOTA in both few-shot/zero-shot object counting/detection.

👉Review https://t.ly/e4iza
👉Paper https://lnkd.in/dbzMXKWG
👉Repo https://lnkd.in/db9Q9Pse

🔥17👍5🥰1👏1

7.16K viewsedited 08:37

AI with Papers - Artificial Intelligence & Deep Learning

💥 Announcing #Py4Ai Conference💥

👉 Super proud to unveil #Py4AI, the newest conference dedicated to exploring the depths of Python & AI. Py4AI is a 1-day free event for Python and Artificial Intelligence developers.

𝐓𝐡𝐞 𝐟𝐢𝐫𝐬𝐭 𝐛𝐚𝐭𝐜𝐡 𝐨𝐟 𝐬𝐩𝐞𝐚𝐤𝐞𝐫𝐬:
🚀Merve Noyan | #HuggingFace 🤗
🚀Gabriele Lombardi | ARGO Vision
🚀Amanda Cercas Curry | Uni. Bocconi
🚀Piero Savastano | Cheshire Cat AI
🚀Francesco Zuppichini | Zurich Insurance
🚀Andrea Palladino, PhD | Sr. Data Scientist

👉 More: https://www.linkedin.com/posts/visionarynet_py4ai-py4ai-python-activity-7152928716988243968-pOUn?utm_source=share&utm_medium=member_desktop

#py4ai #py4ai #python #ai #telegram #py4ai | Alessandro Ferrari | 26 comments

💥BOOOM! Announcing #Py4AI Conference💥

👉 Super proud to unveil #Py4AI, the newest conference dedicated to exploring the depths of Python & AI. Py4AI is a 1-day free event for Python and Artificial Intelligence developers.

𝐄𝐯𝐞𝐧𝐭 𝐃𝐞𝐭𝐚𝐢𝐥𝐬:
✅16th March 2024…

👍10👏2❤1🥰1🤯1

6.48K viewsedited 08:40

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

💃Timeline Text-Driven Humans💃

👉Novel challenge: timeline control for text-driven motion synthesis of 3D Humans.

👉Review https://t.ly/HLm-N
👉Paper https://lnkd.in/esaR_M_9
👉Project https://lnkd.in/epCZDvFW
👉Repo coming

🔥13❤6👍4👏3🤩1

6.63K views13:01

AI with Papers - Artificial Intelligence & Deep Learning

🖲️ Amodal Tracking Any Object 🖲️ 👉Amodal tracking": inferring complete object boundaries, even when certain portions are occluded. New benchmark & approach, 2x better than SOTA in people tracking 🔥 👉Review https://t.ly/Rc6Ku 👉Paper https://lnkd.in/d39rFYT4…

🔥🔥 Code is out 🔥🔥

Check the comments for the links ;)

6.42K views16:23

About

Blog

Apps

Platform