AI with Papers - Artificial Intelligence & Deep Learning
15K subscribers
95 photos
235 videos
11 files
1.26K links
All the AI with papers. Every day fresh updates on Deep Learning, Machine Learning, and Computer Vision (with Papers).

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿšฟ Event-Cam (1000 fps) Hands ๐Ÿšฟ

๐Ÿ‘‰Ev2Hands, the first method for the 3D reconstruction of two interacting hands from a single event camera. Code available.

๐Ÿ‘‰Review https://t.ly/YpQpX
๐Ÿ‘‰Paper arxiv.org/pdf/2312.14157.pdf
๐Ÿ‘‰Project 4dqv.mpi-inf.mpg.de/Ev2Hands
๐Ÿ‘‰Repo github.com/Chris10M/Ev2Hands
๐Ÿ”ฅ3โค2๐Ÿ‘2๐Ÿ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ŸŽ„UniSDF: Unifying Neural Representations๐ŸŽ„

๐Ÿ‘‰UniSDF: novel general purpose 3D reconstruction for large complex scenes with reflections. SOTA on DTU, Shiny Blender, Mip-NeRF 360 and Ref-NeRF dataset.

๐Ÿ‘‰Review https://t.ly/2QEul
๐Ÿ‘‰Paper https://arxiv.org/pdf/2312.13285.pdf
๐Ÿ‘‰Project https://fangjinhuawang.github.io/UniSDF/
๐Ÿ‘‰Repo: No code :(
๐Ÿ”ฅ7๐Ÿ‘2โค1๐Ÿฅฐ1๐Ÿคฏ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ŸชฎHAAR: Text-Driven Generative Hairstyles๐Ÿชฎ

๐Ÿ‘‰ HAAR: new strand-based generative model for #3D human hairstyles driven by textual input.

๐Ÿ‘‰Review https://t.ly/L38iD
๐Ÿ‘‰Project https://haar.is.tue.mpg.de/
๐Ÿ‘‰Paper https://arxiv.org/pdf/2312.11666.pdf
๐Ÿ‘‰Repo coming
๐Ÿคฏ4๐Ÿพ3๐Ÿ‘2๐Ÿ”ฅ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ŸชฒUniRef++: Segment Every Reference๐Ÿชฒ

๐Ÿ‘‰ UniRef++ is a unified model for RIS, FSS, RVOS & VOS. Code available!

๐Ÿ‘‰Review https://t.ly/OxtOx
๐Ÿ‘‰Paper https://lnkd.in/eTrmDTK3
๐Ÿ‘‰Repo https://lnkd.in/etfTm4Wq
๐Ÿ‘11โค3๐Ÿคฏ3โšก1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿˆš Seeing Through Occlusions ๐Ÿˆš

๐Ÿ‘‰Novel NSF to see through occlusions, reflection suppression & shadow removal.

๐Ÿ‘‰Review https://t.ly/5jcIG
๐Ÿ‘‰Project https://light.princeton.edu/publication/nsf
๐Ÿ‘‰Paper https://arxiv.org/pdf/2312.14235.pdf
๐Ÿ‘‰Repo https://github.com/princeton-computational-imaging/NSF
โค10๐Ÿคฏ7๐Ÿ”ฅ3๐Ÿพ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ‘ป Avatar Behind Occlusions ๐Ÿ‘ป

๐Ÿ‘‰Neural rendering for occluded in-the-wild mono-videos. Decoupling scenes in occlusion, human, and background.

๐Ÿ‘‰Review https://t.ly/8q__B
๐Ÿ‘‰Paper https://arxiv.org/pdf/2401.00431.pdf
๐Ÿ‘‰Project https://cs.stanford.edu/~xtiange/projects/wild2avatar
๐Ÿ”ฅ11โค3๐Ÿ‘1๐Ÿคฉ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ• En3D: Generative 3D Humans ๐Ÿ•

๐Ÿ‘‰#Alibaba unveils En3D: generative scheme for sculpting HQ 3D human avatars. Zero-shot 3D generative scheme capable of producing visually realistic, geometrically accurate and content-wise diverse 3D humans without relying on pre-existing 3D or 2D asset.

๐Ÿ‘‰Review https://t.ly/nGmDK
๐Ÿ‘‰Project menyifang.github.io/projects/En3D/index.html
๐Ÿ‘‰Paper https://arxiv.org/pdf/2401.01173.pdf
๐Ÿ‘‰Repo (soon?) https://github.com/menyifang/En3D
๐Ÿคฏ5โค3๐Ÿ”ฅ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿค MagicVideo-V2 announced! ๐Ÿค

๐Ÿ‘‰#Bytedance announces a novel multi-stage pipeline capable of generating high-aesthetic videos from textual description

๐Ÿ‘‰Review https://t.ly/zIq4v
๐Ÿ‘‰Project https://lnkd.in/dKUrJPJd
๐Ÿ‘‰Paper https://lnkd.in/dixnN-kU
๐Ÿ”ฅ7โค1๐Ÿ‘1๐Ÿฅฐ1๐Ÿ’ฉ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ”ฅ #6D Foundation Pose ๐Ÿ”ฅ

๐Ÿ‘‰#Nvidia unveils FoundationPose, a novel (and unified) foundation model for 6D object pose estimation and tracking.

๐Ÿ‘‰Review https://t.ly/HGd4h
๐Ÿ‘‰Project https://lnkd.in/dPcnBKWm
๐Ÿ‘‰Paper https://lnkd.in/dixn_iHZ
๐Ÿ‘‰Code coming ๐Ÿฉท
๐Ÿ”ฅ12โค5๐Ÿ‘1๐Ÿคฏ1
๐ŸƒReplaceAnything: demo is out!๐Ÿƒ

๐Ÿ‘‰ReplaceAnything: ultra-high quality content replacement. The ultimate #AI solution for human, clothing & background replacement to change the e-commerce experience for vendors.

๐Ÿ‘‰Review https://t.ly/FMyvf
๐Ÿ‘‰Project https://lnkd.in/dcyZvP2b
๐Ÿ‘‰ModelScope https://lnkd.in/dU4x4nE6
๐Ÿ‘‰Hugging Face https://lnkd.in/dn3uXWgd
๐Ÿ‘‰Empty report https://lnkd.in/dcuGXd6c
๐Ÿ‘‰Paper coming?
โค11๐Ÿ‘3๐Ÿ‘2๐Ÿ˜1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿฅ› Transparent Object Tracking ๐Ÿฅ›

๐Ÿ‘‰Trans2k: transparent object tracking dataset of 2,000+ sequences with 100,000+ images, annotated by bounding boxes & segmentation mask.

๐Ÿ‘‰Review https://t.ly/mEI6O
๐Ÿ‘‰Paper https://lnkd.in/dsudY3DB
๐Ÿ‘‰Project https://lnkd.in/d48SSJJ3
๐Ÿ‘‰TOB https://lnkd.in/dykBUNfC
๐Ÿ”ฅ18๐Ÿคฏ7โค3๐Ÿ‘2๐Ÿ˜ฑ2๐Ÿ‘1
๐Ÿ’Š๐Ÿ’Š AGNOSTIC Object Counting ๐Ÿ’Š๐Ÿ’Š

๐Ÿ‘‰PseCo: combining SAM to segment all possible objects as mask proposals & CLIP to classify proposals to obtain accurate object counts. The new SOTA in both few-shot/zero-shot object counting/detection.

๐Ÿ‘‰Review https://t.ly/e4iza
๐Ÿ‘‰Paper https://lnkd.in/dbzMXKWG
๐Ÿ‘‰Repo https://lnkd.in/db9Q9Pse
๐Ÿ”ฅ17๐Ÿ‘5๐Ÿฅฐ1๐Ÿ‘1
๐Ÿ’ฅ Announcing #Py4Ai Conference๐Ÿ’ฅ

๐Ÿ‘‰ Super proud to unveil #Py4AI, the newest conference dedicated to exploring the depths of Python & AI. Py4AI is a 1-day free event for Python and Artificial Intelligence developers.

๐“๐ก๐ž ๐Ÿ๐ข๐ซ๐ฌ๐ญ ๐›๐š๐ญ๐œ๐ก ๐จ๐Ÿ ๐ฌ๐ฉ๐ž๐š๐ค๐ž๐ซ๐ฌ:
๐Ÿš€Merve Noyan | #HuggingFace ๐Ÿค—
๐Ÿš€Gabriele Lombardi | ARGO Vision
๐Ÿš€Amanda Cercas Curry | Uni. Bocconi
๐Ÿš€Piero Savastano | Cheshire Cat AI
๐Ÿš€Francesco Zuppichini | Zurich Insurance
๐Ÿš€Andrea Palladino, PhD | Sr. Data Scientist

๐Ÿ‘‰ More: https://www.linkedin.com/posts/visionarynet_py4ai-py4ai-python-activity-7152928716988243968-pOUn?utm_source=share&utm_medium=member_desktop
๐Ÿ‘10๐Ÿ‘2โค1๐Ÿฅฐ1๐Ÿคฏ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ’ƒTimeline Text-Driven Humans๐Ÿ’ƒ

๐Ÿ‘‰Novel challenge: timeline control for text-driven motion synthesis of 3D Humans.

๐Ÿ‘‰Review https://t.ly/HLm-N
๐Ÿ‘‰Paper https://lnkd.in/esaR_M_9
๐Ÿ‘‰Project https://lnkd.in/epCZDvFW
๐Ÿ‘‰Repo coming
๐Ÿ”ฅ13โค6๐Ÿ‘4๐Ÿ‘3๐Ÿคฉ1
๐Ÿซ’ AlphaGeometry: Olympiad-level AI ๐Ÿซ’

๐Ÿ‘‰ Theorem prover for Euclidean plane geometry that sidesteps the need for human demonstrations by
synthesizing millions of theorems and proofs across different levels of complexity ๐Ÿคฏ

๐Ÿ‘‰Review https://t.ly/2-Z7C
๐Ÿ‘‰Paper https://lnkd.in/g3QkqwCE
๐Ÿ‘‰Blog https://lnkd.in/ge-mpM7q
๐Ÿ‘‰Repo https://lnkd.in/gHjwks_9
๐Ÿคฏ20๐Ÿ‘3๐Ÿฅฐ2๐Ÿคฉ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿฆ  XINC: Pixels to Neurons ๐Ÿฆ 

๐Ÿ‘‰eXplaining the Implicit Neural Canvas (XINC) from the University of Maryland, is a unified framework for explaining properties of INRs by examining the strength of each neuronโ€™s contribution to each output pixel

๐Ÿ‘‰Review https://t.ly/wwAmz
๐Ÿ‘‰Paper arxiv.org/pdf/2401.10217.pdf
๐Ÿ‘‰Project namithap10.github.io/xinc
๐Ÿ‘‰Repo github.com/namithap10/xinc
๐Ÿคฏ9๐Ÿ‘3๐Ÿ‘2๐Ÿ”ฅ1
๐Ÿ‘ฝ One Model <-> All Segmentations ๐Ÿ‘ฝ

๐Ÿ‘‰ 10+ different segmentation tasks in one framework, including image-level, video-level, interactive segmentation, & open-vocabulary segmentation. All in one!

๐Ÿ‘‰Review https://t.ly/fywVz
๐Ÿ‘‰Paper https://lnkd.in/dw3S4B74
๐Ÿ‘‰Project https://lnkd.in/dzHT9v45
๐Ÿ‘‰Repo https://lnkd.in/d6fDCnSp
๐Ÿ”ฅ17๐Ÿ‘5โค2๐Ÿฅฐ1๐Ÿพ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ˜ป GARField: Group Anything ๐Ÿ˜ป

๐Ÿ‘‰ GARField is a novel approach for decomposing #3D scenes into a hierarchy of semantically meaningful groups from posed image inputs.

๐Ÿ‘‰Review https://t.ly/6Hkeq
๐Ÿ‘‰Paper https://lnkd.in/d28mfRcZ
๐Ÿ‘‰Project https://lnkd.in/dzYdRNKy
๐Ÿ‘‰Repo (coming) https://lnkd.in/d2VeRJCS
๐Ÿ‘8โค3๐Ÿฅฐ1๐Ÿคฉ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐Ÿ”ฅ Depth Anything: new SOTA ๐Ÿ”ฅ

๐Ÿ‘‰Depth Anything: the new SOTA in monocular depth estimation (MDE), trained with 1.5M labeled images and 62M+ unlabeled images jointly. It's the new SOTA!

๐Ÿ‘‰Review https://t.ly/tCBwO
๐Ÿ‘‰Paper https://lnkd.in/djx-9k2J
๐Ÿ‘‰Project https://lnkd.in/dYetqZFa
๐Ÿ‘‰Repo https://lnkd.in/d87CrUGv
๐Ÿ‘‰Demo๐Ÿค— https://lnkd.in/dJhvKBep
๐Ÿ”ฅ17โค3๐Ÿฅฐ2๐Ÿคฉ2