AI with Papers - Artificial Intelligence & Deep Learning
15K subscribers
95 photos
235 videos
11 files
1.26K links
All the AI with papers. Every day fresh updates on Deep Learning, Machine Learning, and Computer Vision (with Papers).

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
๐ŸชผPatchFusion: SOTA Mono-Depth๐Ÿชผ

๐Ÿ‘‰PatchFusion: novel end-to-end tile-based framework for hi-res monocular metric depth estimation. It's the new SOTA in metric depth estimation from mono. Code & Demo on Hugging Face able ๐Ÿ”ฅ

๐Ÿ‘‰Review https://t.ly/hv3yT
๐Ÿ‘‰Paper https://lnkd.in/d9dXP7iP
๐Ÿ‘‰Project https://lnkd.in/dQcvVJSx
๐Ÿ‘‰Repo https://lnkd.in/dW2GdVR5
๐Ÿ‘‰Demo https://lnkd.in/dFW-gAiY
๐Ÿ”ฅ10โค5๐Ÿ‘1๐Ÿคฏ1๐Ÿ˜ฑ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ’ƒOutfit Anyone: Ultra-HQ VTO๐Ÿ’ƒ

๐Ÿ‘‰Alibaba unveils Outfit Anyone: a two-stream conditional diffusion able to adeptly handle garment deformation for more lifelike results in VOT. Extra: Outfit Anyone + Animate Anyone for outfit + motion generation of any character. NO CODE / NO PAPER / DEMO AVAILABLE :)

๐Ÿ‘‰Review https://t.ly/o6UR9
๐Ÿ‘‰Demo https://lnkd.in/dpQYdXhc
๐Ÿ‘‰Repo (empty) https://lnkd.in/dBsNST6r
๐Ÿคฏ10๐Ÿ‘4โค3๐Ÿ”ฅ2
๐Ÿ”ฅ #AIwithPapers: we are 8k+ ๐Ÿ”ฅ

๐Ÿ‘‰ After flirting with #ChatGpt for months, you back in love with this channel. I felt bad, but I forgive you ๐Ÿงก

๐Ÿ˜ˆ Hey Telegram Premium Subscribers, what about boosting us? Click: https://t.me/AI_DeepLearning?boost

๐Ÿ˜ˆ Invite -> https://t.me/AI_DeepLearning
โค16๐Ÿคฃ7๐Ÿ”ฅ1๐Ÿฅฐ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ŸงŠ Depth Conditioning ๐ŸงŠ

๐Ÿ‘‰LooseControl to control the generative image modeling process. Layout by boundaries and #3D box control via object locations (approximate bounding boxes)

๐Ÿ‘‰Review https://t.ly/9y72m
๐Ÿ‘‰Paper https://arxiv.org/pdf/2312.03079.pdf
๐Ÿ‘‰Project https://shariqfarooq123.github.io/loose-control/
๐Ÿ‘‰Repo https://github.com/shariqfarooq123/LooseControl
๐Ÿ”ฅ14โค6๐Ÿคฏ4๐Ÿ‘1๐Ÿฅฐ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ–ฒ๏ธ Amodal Tracking Any Object ๐Ÿ–ฒ๏ธ

๐Ÿ‘‰Amodal tracking": inferring complete object boundaries, even when certain portions are occluded. New benchmark & approach, 2x better than SOTA in people tracking ๐Ÿ”ฅ

๐Ÿ‘‰Review https://t.ly/Rc6Ku
๐Ÿ‘‰Paper https://lnkd.in/d39rFYT4
๐Ÿ‘‰Project https://lnkd.in/d7bkEcni
๐Ÿ‘‰(empty) Repo https://lnkd.in/dTsNKdfz
โค16๐Ÿคฏ8๐Ÿ”ฅ3๐Ÿ‘2๐Ÿ‘1๐Ÿ˜ฑ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿšฟ Event-Cam (1000 fps) Hands ๐Ÿšฟ

๐Ÿ‘‰Ev2Hands, the first method for the 3D reconstruction of two interacting hands from a single event camera. Code available.

๐Ÿ‘‰Review https://t.ly/YpQpX
๐Ÿ‘‰Paper arxiv.org/pdf/2312.14157.pdf
๐Ÿ‘‰Project 4dqv.mpi-inf.mpg.de/Ev2Hands
๐Ÿ‘‰Repo github.com/Chris10M/Ev2Hands
๐Ÿ”ฅ3โค2๐Ÿ‘2๐Ÿ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ŸŽ„UniSDF: Unifying Neural Representations๐ŸŽ„

๐Ÿ‘‰UniSDF: novel general purpose 3D reconstruction for large complex scenes with reflections. SOTA on DTU, Shiny Blender, Mip-NeRF 360 and Ref-NeRF dataset.

๐Ÿ‘‰Review https://t.ly/2QEul
๐Ÿ‘‰Paper https://arxiv.org/pdf/2312.13285.pdf
๐Ÿ‘‰Project https://fangjinhuawang.github.io/UniSDF/
๐Ÿ‘‰Repo: No code :(
๐Ÿ”ฅ7๐Ÿ‘2โค1๐Ÿฅฐ1๐Ÿคฏ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ŸชฎHAAR: Text-Driven Generative Hairstyles๐Ÿชฎ

๐Ÿ‘‰ HAAR: new strand-based generative model for #3D human hairstyles driven by textual input.

๐Ÿ‘‰Review https://t.ly/L38iD
๐Ÿ‘‰Project https://haar.is.tue.mpg.de/
๐Ÿ‘‰Paper https://arxiv.org/pdf/2312.11666.pdf
๐Ÿ‘‰Repo coming
๐Ÿคฏ4๐Ÿพ3๐Ÿ‘2๐Ÿ”ฅ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ŸชฒUniRef++: Segment Every Reference๐Ÿชฒ

๐Ÿ‘‰ UniRef++ is a unified model for RIS, FSS, RVOS & VOS. Code available!

๐Ÿ‘‰Review https://t.ly/OxtOx
๐Ÿ‘‰Paper https://lnkd.in/eTrmDTK3
๐Ÿ‘‰Repo https://lnkd.in/etfTm4Wq
๐Ÿ‘11โค3๐Ÿคฏ3โšก1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿˆš Seeing Through Occlusions ๐Ÿˆš

๐Ÿ‘‰Novel NSF to see through occlusions, reflection suppression & shadow removal.

๐Ÿ‘‰Review https://t.ly/5jcIG
๐Ÿ‘‰Project https://light.princeton.edu/publication/nsf
๐Ÿ‘‰Paper https://arxiv.org/pdf/2312.14235.pdf
๐Ÿ‘‰Repo https://github.com/princeton-computational-imaging/NSF
โค10๐Ÿคฏ7๐Ÿ”ฅ3๐Ÿพ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ‘ป Avatar Behind Occlusions ๐Ÿ‘ป

๐Ÿ‘‰Neural rendering for occluded in-the-wild mono-videos. Decoupling scenes in occlusion, human, and background.

๐Ÿ‘‰Review https://t.ly/8q__B
๐Ÿ‘‰Paper https://arxiv.org/pdf/2401.00431.pdf
๐Ÿ‘‰Project https://cs.stanford.edu/~xtiange/projects/wild2avatar
๐Ÿ”ฅ11โค3๐Ÿ‘1๐Ÿคฉ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ• En3D: Generative 3D Humans ๐Ÿ•

๐Ÿ‘‰#Alibaba unveils En3D: generative scheme for sculpting HQ 3D human avatars. Zero-shot 3D generative scheme capable of producing visually realistic, geometrically accurate and content-wise diverse 3D humans without relying on pre-existing 3D or 2D asset.

๐Ÿ‘‰Review https://t.ly/nGmDK
๐Ÿ‘‰Project menyifang.github.io/projects/En3D/index.html
๐Ÿ‘‰Paper https://arxiv.org/pdf/2401.01173.pdf
๐Ÿ‘‰Repo (soon?) https://github.com/menyifang/En3D
๐Ÿคฏ5โค3๐Ÿ”ฅ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿค MagicVideo-V2 announced! ๐Ÿค

๐Ÿ‘‰#Bytedance announces a novel multi-stage pipeline capable of generating high-aesthetic videos from textual description

๐Ÿ‘‰Review https://t.ly/zIq4v
๐Ÿ‘‰Project https://lnkd.in/dKUrJPJd
๐Ÿ‘‰Paper https://lnkd.in/dixnN-kU
๐Ÿ”ฅ7โค1๐Ÿ‘1๐Ÿฅฐ1๐Ÿ’ฉ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ”ฅ #6D Foundation Pose ๐Ÿ”ฅ

๐Ÿ‘‰#Nvidia unveils FoundationPose, a novel (and unified) foundation model for 6D object pose estimation and tracking.

๐Ÿ‘‰Review https://t.ly/HGd4h
๐Ÿ‘‰Project https://lnkd.in/dPcnBKWm
๐Ÿ‘‰Paper https://lnkd.in/dixn_iHZ
๐Ÿ‘‰Code coming ๐Ÿฉท
๐Ÿ”ฅ12โค5๐Ÿ‘1๐Ÿคฏ1
๐ŸƒReplaceAnything: demo is out!๐Ÿƒ

๐Ÿ‘‰ReplaceAnything: ultra-high quality content replacement. The ultimate #AI solution for human, clothing & background replacement to change the e-commerce experience for vendors.

๐Ÿ‘‰Review https://t.ly/FMyvf
๐Ÿ‘‰Project https://lnkd.in/dcyZvP2b
๐Ÿ‘‰ModelScope https://lnkd.in/dU4x4nE6
๐Ÿ‘‰Hugging Face https://lnkd.in/dn3uXWgd
๐Ÿ‘‰Empty report https://lnkd.in/dcuGXd6c
๐Ÿ‘‰Paper coming?
โค11๐Ÿ‘3๐Ÿ‘2๐Ÿ˜1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿฅ› Transparent Object Tracking ๐Ÿฅ›

๐Ÿ‘‰Trans2k: transparent object tracking dataset of 2,000+ sequences with 100,000+ images, annotated by bounding boxes & segmentation mask.

๐Ÿ‘‰Review https://t.ly/mEI6O
๐Ÿ‘‰Paper https://lnkd.in/dsudY3DB
๐Ÿ‘‰Project https://lnkd.in/d48SSJJ3
๐Ÿ‘‰TOB https://lnkd.in/dykBUNfC
๐Ÿ”ฅ18๐Ÿคฏ7โค3๐Ÿ‘2๐Ÿ˜ฑ2๐Ÿ‘1
๐Ÿ’Š๐Ÿ’Š AGNOSTIC Object Counting ๐Ÿ’Š๐Ÿ’Š

๐Ÿ‘‰PseCo: combining SAM to segment all possible objects as mask proposals & CLIP to classify proposals to obtain accurate object counts. The new SOTA in both few-shot/zero-shot object counting/detection.

๐Ÿ‘‰Review https://t.ly/e4iza
๐Ÿ‘‰Paper https://lnkd.in/dbzMXKWG
๐Ÿ‘‰Repo https://lnkd.in/db9Q9Pse
๐Ÿ”ฅ17๐Ÿ‘5๐Ÿฅฐ1๐Ÿ‘1
๐Ÿ’ฅ Announcing #Py4Ai Conference๐Ÿ’ฅ

๐Ÿ‘‰ Super proud to unveil #Py4AI, the newest conference dedicated to exploring the depths of Python & AI. Py4AI is a 1-day free event for Python and Artificial Intelligence developers.

๐“๐ก๐ž ๐Ÿ๐ข๐ซ๐ฌ๐ญ ๐›๐š๐ญ๐œ๐ก ๐จ๐Ÿ ๐ฌ๐ฉ๐ž๐š๐ค๐ž๐ซ๐ฌ:
๐Ÿš€Merve Noyan | #HuggingFace ๐Ÿค—
๐Ÿš€Gabriele Lombardi | ARGO Vision
๐Ÿš€Amanda Cercas Curry | Uni. Bocconi
๐Ÿš€Piero Savastano | Cheshire Cat AI
๐Ÿš€Francesco Zuppichini | Zurich Insurance
๐Ÿš€Andrea Palladino, PhD | Sr. Data Scientist

๐Ÿ‘‰ More: https://www.linkedin.com/posts/visionarynet_py4ai-py4ai-python-activity-7152928716988243968-pOUn?utm_source=share&utm_medium=member_desktop
๐Ÿ‘10๐Ÿ‘2โค1๐Ÿฅฐ1๐Ÿคฏ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ’ƒTimeline Text-Driven Humans๐Ÿ’ƒ

๐Ÿ‘‰Novel challenge: timeline control for text-driven motion synthesis of 3D Humans.

๐Ÿ‘‰Review https://t.ly/HLm-N
๐Ÿ‘‰Paper https://lnkd.in/esaR_M_9
๐Ÿ‘‰Project https://lnkd.in/epCZDvFW
๐Ÿ‘‰Repo coming
๐Ÿ”ฅ13โค6๐Ÿ‘4๐Ÿ‘3๐Ÿคฉ1