AI with Papers - Artificial Intelligence & Deep Learning – Telegram

AI with Papers - Artificial Intelligence & Deep Learning

@AI_DeepLearning

17.2K subscribers

158 photos

276 videos

14 files

1.45K links

All the AI with papers. Every day fresh updates about #DeepLearning #MachineLearning #LLM & #ComputerVision

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/

#AI #chatGPT

Download Telegram

About

Blog

Apps

Platform

AI with Papers - Artificial Intelligence & Deep Learning

17.2K subscribers

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

💥 GaussianGPT 3D GSC💥

👉From TUM, GaussianGPT: transformer-based 3D Gaussians generation via next-token prediction -> full 3D complex indoor scene. Repo announced💙

👉Review https://t.ly/bj-lL
👉Paper arxiv.org/pdf/2603.26661
👉Project nicolasvonluetzow.github.io/GaussianGPT/
👉Repo TBA

🔥8❤2👍1👏1

4.14K viewsedited 07:03

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

👌HandX: Scaling Hands Motion👌

👉 HandX is a unified foundation spanning data, annotation, and evaluation: novel large-scale dataset of bimanual & dexterous motions with fine-grained textual. Around 6M frames. Repo available💙

👉Review https://t.ly/1nGxw
👉Paper https://arxiv.org/pdf/2603.28766
👉Project https://handx-project.github.io/
👉Repo github.com/handx-project/HandX

🔥9❤2👏1

4.06K views11:30

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🌵SOTA Training-Free In-Context Segmentation🌵

👉INSID3 is the new SOTA, training-free approach that segments concepts at varying granularities only from frozen DINOv3 features, given an in-context example. Repo under Apache 2.0💙

👉Review https://t.ly/NVWHN
👉Paper arxiv.org/pdf/2603.28480
👉Project visinf.github.io/INSID3/
👉Repo github.com/visinf/INSID3

❤16🔥2🤩2👍1🍾1

4.19K viewsedited 07:24

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🪬Camera Raw Image Generation🪬

👉RawGen by #Samsung is a generative approach that learns the complex distribution of raw sensor data directly, enabling high-fidelity generation from either text descriptions or standard sRGB images across arbitrary camera sensors. Linear raw image once, then apply any ISP operation. Repo announced💙

👉Review https://t.ly/_QVKP
👉Paper https://arxiv.org/pdf/2604.00093
👉Project https://dy112.github.io/rawgen-page/
👉Repo TBA

❤4🔥2👍1

4.71K views07:54

AI with Papers - Artificial Intelligence & Deep Learning

If you have to invest TODAY 1B$ on a frontier tech for the next decade, would you invest in space, agentic, quantum or frugal GPUs? Vote here: https://t.ly/hSx6i

🤣3❤1🔥1

4.52K views14:04

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🍎Video Object Deletion🍎

👉Void by Netflix is a novel video object removal framework designed to perform physically-plausible inpainting in very complex scenarios. Repo under Apache 2.0💙

👉Review https://t.ly/cMVny
👉Paper https://arxiv.org/pdf/2604.02296
👉Project https://void-model.github.io/
👉Repo https://github.com/Netflix/void-model

❤4🤯3👍2👏1

5.16K views06:33

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🔥Vanast: VTON w/ Human Animation🔥

👉SNU unveils a novel unified framework that generates garment-transferred human animation videos directly from a single human/garment images, and pose guidance clip. Repo announced💙

👉Review https://t.ly/c0t79
👉Paper arxiv.org/pdf/2604.04934
👉Project hyunsoocha.github.io/vanast/
👉Repo github.com/snuvclab/vanast

❤7👍2🤯2🔥1🍾1

4K views06:31

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🔥BoxerNet: SOTA 2D->3D BBs🔥

👉Boxer by META: transformer-based network to lift 2D BB proposals into 3D, followed by multi-view fusion and geometric filtering to produce globally consistent de-duplicated 3DBBs in metric world space. Repo under A-NC 4.0 International💙

👉Review https://t.ly/mlmV1
👉Paper https://arxiv.org/pdf/2604.05212
👉Project facebookresearch.github.io/boxer/
👉Repo github.com/facebookresearch/boxer

🤯9👍1🔥1

4.03K viewsedited 06:53

AI with Papers - Artificial Intelligence & Deep Learning

Hinton our guest in Pavia (remotely) 💚😈

Would you see a clip about the interview?

👍12❤6🔥2😍1

4.14K viewsedited 20:15

AI with Papers - Artificial Intelligence & Deep Learning

Media is too big

VIEW IN TELEGRAM

Here the preview, tomorrow the full clip from official source :)

❤5🔥1🍾1

4.49K views21:04

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🪞1.1M Metric VTON Dataset🪞

👉Google's Fit-Inclusive Try-on: large-scale VTO dataset comprising over 1.13M try-on image triplets accompanied by precise body and garment measurements. Repo & dataset announced💙

👉Review https://t.ly/cs-pt
👉Paper arxiv.org/pdf/2604.08526
👉Project johannakarras.github.io/FIT/
👉Repo TBA

🔥8❤2👍1

4.36K views06:34

AI with Papers - Artificial Intelligence & Deep Learning

🐞6D Object Pose w/ Deformation🐞

👉DeSOPE by Xidian & #MagicLeap is a novel large-scale dataset for 6DoF deformed objects: 665K pose annotations produced via a semiautomatic pipeline. Repo & Dataset announced💙

👉Review https://t.ly/M5VgX
👉Paper https://arxiv.org/pdf/2604.06720
👉Project https://desope-6d.github.io/
👉Repo TBA

🔥8❤3👏1

3.99K views06:47

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🔥SOTA 3D Detection in the wild🔥

👉WildDet3D is a novel unified geometry-aware architecture for 3D detection that natively accepts text, point, and box prompts and can incorporate auxiliary depth signals at inference time. New SOTA! Repo, models and iphone 💙

👉Review https://t.ly/8NxBN
👉Paper arxiv.org/pdf/2604.08626
👉Project allenai.github.io/WildDet3D/
👉Repo github.com/allenai/WildDet3D

🔥7❤4👏1🤯1

4.35K viewsedited 06:03

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🧴OmniShow Content Creation🧴

👉OmniShow is the novel SOTA in content creation with industry-grade performance. Impressive results, best with audio. Repo announced💙

👉Review https://t.ly/Pm-7U
👉Paper arxiv.org/pdf/2604.11804
👉Project correr-zhou.github.io/OmniShow/
👉Repo github.com/Correr-Zhou/OmniShow

❤7🤯6😢1

3.85K viewsedited 06:51

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🐓Interactive Objects from EgoVideo🐓

👉EgoFun3D by Simon Fraser University is a coordinated task, dataset and benchmark for modeling interactive 3D objects from egocentric videos. Repo (TBA), demo & dataset💙

👉Review https://t.ly/YhGN7
👉Paper arxiv.org/pdf/2604.11038
👉Project 3dlg-hcvc.github.io/EgoFun3D/
👉Repo github.com/3dlg-hcvc/EgoFun3D
👉Demo bc79fea884062374b3.gradio.live/

❤2🤯2🔥1

4.52K views12:34

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

📱3D Human-Object Contact📱

👉Pi-HOC by CMU + NREC is a novel single-pass, instance-aware framework for dense 3D semantic contact prediction of all human-object pairs. Repo announced💙

👉Review https://t.ly/TAgG1
👉Paper https://arxiv.org/pdf/2604.12923
👉Project https://pi-hoc.github.io/
👉Repo https://github.com/SravanChittupalli/Pi-HOC

🔥3❤2👏2👍1🤩1

4.82K views12:19

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🐞GCT 3D Reconstruction🐞

👉ANT unveils LingBot-Map, a feed-forward 3D foundation model for reconstructing scenes from streaming data, built upon a geometric context transformer (GCT) architecture. Repo under A-NC 4.0 International💙

👉Review https://t.ly/ExodA
👉Paper https://arxiv.org/pdf/2604.14141
👉Project https://arxiv.org/pdf/2604.14141
👉Repo github.com/robbyant/lingbot-map

🔥9❤4👍2👏1

5.09K views06:57

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

👩‍🦰Deformable 3D Hair👩‍🦰

👉Xi’an Jiaotong University unveils a novel method that reconstructs decoupled 3D Gaussian head avatars from a single input image: effortless hairstyle transfer with natural dynamic hair motion. Code announced💙

👉Review https://t.ly/kWZdd
👉Paper https://arxiv.org/pdf/2604.14782
👉Project yuansun-xjtu.github.io/CompHairHead.io/
👉Repo yuansun-xjtu.github.io/CompHairHead.io/

❤6🔥3👏1🤩1

4.58K viewsedited 06:35

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🌗Mobile Ultra-detailed Avatars🌗

👉Given skeletal poses and a virtual camera as inputs, MUA by Max Planck Institute produces photorealistic renderings and hyper-detailed geometry of animatable clothed humans. Repo announced💙

👉Review https://t.ly/QPCy6
👉Paper https://arxiv.org/pdf/2604.18583
👉Project https://vcai.mpi-inf.mpg.de/projects/MUA/
👉Repo TBA

❤11🔥1

4.41K views06:55

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🎈Face Anything 4D (SOTA)🎈

👉A novel unified 4D facial reconstruction and dense tracking from image sequences: new SOTA in facial single-image and mono-video depth estimation, dense 4D reconstruction, and 3D point tracking. Repo & Dataset announced💙

👉Review https://t.ly/zItie
👉Paper https://arxiv.org/pdf/2604.19702
👉Project kocasariumut.github.io/FaceAnything
👉Repo TBA

❤5🔥2👍1🤯1

5.15K views06:33

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

💙 PY4AI 2026: here we are! 💙

👉The third edition of our conference is official! Speaker list and (free) tickets: https://t.ly/L4_52

❤10👍1🤯1😢1🤩1

5.38K viewsedited 06:41