AI with Papers - Artificial Intelligence & Deep Learning
15K subscribers
95 photos
235 videos
11 files
1.26K links
All the AI with papers. Every day fresh updates on Deep Learning, Machine Learning, and Computer Vision (with Papers).

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
🕷️ Gen-NeRF2NeRF Translation 🕷️

👉GenN2N: unified NeRF-to-NeRF translation for editing tasks such as text-driven NeRF editing, colorization, super-resolution, inpainting, etc.

👉Review https://t.ly/VMWAH
👉Paper arxiv.org/pdf/2404.02788.pdf
👉Project xiangyueliu.github.io/GenN2N/
👉Code github.com/Lxiangyue/GenN2N
🤯43🥰1
This media is not supported in your browser
VIEW IN TELEGRAM
👆iSeg: Interactive 3D Segmentation👆

👉 iSeg: interactive segmentation technique for 3D shapes operating entirely in 3D. It accepts both positive/negative clicks directly on the shape's surface, indicating inclusion & exclusion of regions.

👉Review https://t.ly/tyFnD
👉Paper https://lnkd.in/dydAz8zp
👉Project https://lnkd.in/de-h6SRi
👉Code (coming)
7👏2🔥1
This media is not supported in your browser
VIEW IN TELEGRAM
👗 Neural Bodies with Clothes 👗

👉Neural-ABC is a novel parametric model based on neural implicit functions that can represent clothed human bodies with disentangled latent spaces for ID, clothing, shape, and pose.

👉Review https://t.ly/Un1wc
👉Project https://lnkd.in/dhDG6FF5
👉Paper https://lnkd.in/dhcfK7jZ
👉Code https://lnkd.in/dQvXWysP
🔥7👍2👏1
This media is not supported in your browser
VIEW IN TELEGRAM
🔌 BodyMAP: human body & pressure 🔌

👉#Nvidia (+CMU) unveils BodyMAP, the new SOTA in predicting body mesh (3D pose & shape) and 3D applied pressure on the human body. Source Code released, Dataset coming 💙

👉Review https://t.ly/8926S
👉Project bodymap3d.github.io/
👉Paper https://lnkd.in/gCxH4ev3
👉Code https://lnkd.in/gaifdy3q
8🤯41👍1🔥1
This media is not supported in your browser
VIEW IN TELEGRAM
🧞 XComposer2: 4K Vision-Language 🧞

👉InternLMXComposer2-4KHD brings LVLM resolution capabilities up to 4K HD (3840×1600) and beyond. Authors: Shanghai AI Lab, CUHK, SenseTime & Tsinghua. Source Code & Models released 💙

👉Review https://t.ly/GCHsz
👉Paper arxiv.org/pdf/2404.06512.pdf
👉Code github.com/InternLM/InternLM-XComposer
🥰72👍1
This media is not supported in your browser
VIEW IN TELEGRAM
⚛️ Flying w/ Photons: Neural Render ⚛️

👉Novel neural rendering technique that seeks to synthesize videos of light propagating through a scene from novel, moving camera viewpoints. Pico-Seconds time resolution!

👉Review https://t.ly/ZqL3a
👉Paper arxiv.org/pdf/2404.06493.pdf
👉Project anaghmalik.com/FlyingWithPhotons/
👉Code github.com/anaghmalik/FlyingWithPhotons
🤯632👍1🤣1
This media is not supported in your browser
VIEW IN TELEGRAM
☄️ Tracking Any 2D Pixels in 3D ☄️

👉 SpatialTracker lifts 2D pixels to 3D using monocular depth, represents the 3D content of each frame efficiently using a triplane representation, and performs iterative updates using a transformer to estimate 3D trajectories.

👉Review https://t.ly/B28Cj
👉Paper https://lnkd.in/d8ers_nm
👉Project https://lnkd.in/deHjtZuE
👉Code https://lnkd.in/dMe3TvFT
10🔥51👏1
This media is not supported in your browser
VIEW IN TELEGRAM
🪐YOLO-CIANNA: Neural Astro🪐

👉 CIANNA is a general-purpose deep learning framework for (but not only for) astronomical data analysis. Source Code released 💙

👉Review https://t.ly/441XS
👉Paper arxiv.org/pdf/2402.05925.pdf
👉Code github.com/Deyht/CIANNA
👉Wiki github.com/Deyht/CIANNA/wiki
👍754🔥2🥰2
This media is not supported in your browser
VIEW IN TELEGRAM
🧤Neuro MusculoSkeletal-MANO🧤

👉SJTU unveils MusculoSkeletal-MANO, novel musculoskeletal system with a learnable parametric hand model. Source Code announced 💙

👉Review https://t.ly/HOQrn
👉Paper arxiv.org/pdf/2404.10227.pdf
👉Project https://ms-mano.robotflow.ai/
👉Code announced (no repo yet)
🔥311👍1👏1
This media is not supported in your browser
VIEW IN TELEGRAM
SoccerNET: Athlete Tracking

👉SoccerNet Challenge is a novel high level computer vision task that is specific to sports analytics. It aims at recognizing the state of a sport game, i.e., identifying and localizing all sports individuals (players, referees, ..) on the field.

👉Review https://t.ly/Mdu9s
👉Paper arxiv.org/pdf/2404.11335.pdf
👉Code github.com/SoccerNet/sn-gamestate
9👍8🔥32🤯1
This media is not supported in your browser
VIEW IN TELEGRAM
🎲 Articulated Objs from MonoClips 🎲

👉REACTO is the new SOTA to address the challenge of reconstructing general articulated 3D objects from single monocular video

👉Review https://t.ly/REuM8
👉Paper https://lnkd.in/d6PWagij
👉Project https://lnkd.in/dpg3x4tm
👉Repo https://lnkd.in/dRZWj6_N
🤯6👍1🔥1👏1
This media is not supported in your browser
VIEW IN TELEGRAM
🪼 All You Need is SAM (+Flow) 🪼

👉Oxford unveils the new SOTA for moving object segmentation via SAM + Optical Flow. Two novel models & Source Code announced 💙

👉Review https://t.ly/ZRYtp
👉Paper https://lnkd.in/d4XqkEGF
👉Project https://lnkd.in/dHpmx3FF
👉Repo coming: https://github.com/Jyxarthur/
12👍7🔥2🤯2
This media is not supported in your browser
VIEW IN TELEGRAM
🛞 6Img-to-3D driving scenarios 🛞

👉EPFL (+ Continental) unveils 6Img-to-3D, novel transformer-based encoder-renderer method to create 3D onbounded outdoor driving scenarios with only six pics

👉Review https://shorturl.at/dZ018
👉Paper arxiv.org/pdf/2404.12378.pdf
👉Project 6img-to-3d.github.io/
👉Code github.com/continental/6Img-to-3D
🔥51👍1
This media is not supported in your browser
VIEW IN TELEGRAM
🌹 Physics-Based 3D Video-Gen 🌹

👉PhysDreamer, a physics-based approach that leverages the object dynamics priors learned by video generation models. It enables realistic 3D interaction with objects

👉Review https://t.ly/zxXf9
👉Paper arxiv.org/pdf/2404.13026.pdf
👉Project physdreamer.github.io/
👉Code github.com/a1600012888/PhysDreamer
👍149🤯4👏1
This media is not supported in your browser
VIEW IN TELEGRAM
🎡 NER-Net: Seeing at Night-Time 🎡

👉Huazhong (+Beijing) unveils a novel event-based nighttime imaging solution under non-uniform illumination, plus a paired multi-illumination level real-world dataset. Repo online, code coming 💙

👉Review https://t.ly/Z9JMJ
👉Paper arxiv.org/pdf/2404.11884.pdf
👉Repo github.com/Liu-haoyue/NER-Net
👉Clip https://www.youtube.com/watch?v=zpfTLCF1Kw4
🤯3🔥21👍1
This media is not supported in your browser
VIEW IN TELEGRAM
🌊 FlowMap: dense depth video 🌊

👉MIT (+CSAIL) unveils FlowMap, a novel E2E differentiable method that solves for precise camera poses, camera intrinsics, and perframe dense depth of a video sequence. Source Code released 💙

👉Review https://t.ly/CBH48
👉Paper arxiv.org/pdf/2404.15259.pdf
👉Project cameronosmith.github.io/flowmap
👉Code github.com/dcharatan/flowmap
🔥183👍2
This media is not supported in your browser
VIEW IN TELEGRAM
👗TELA: Text to 3D Clothed Human👗

👉 TELA is a novel approach for the new task of clothing disentangled 3D human model generation from texts. This novel approach unleashes the potential of many downstream applications (e.g., virtual try-on).

👉Review https://t.ly/6N7JV
👉Paper https://arxiv.org/pdf/2404.16748
👉Project https://jtdong.com/tela_layer/
👉Code https://github.com/DongJT1996/TELA
👍5🔥4🤯3👏1🍾1
This media is not supported in your browser
VIEW IN TELEGRAM
🪷 Tunnel Try-on: SOTA VTON 🪷

👉"Tunnel Try-on", the first diffusion-based video virtual try-on model that demonstrates SOTA performance in complex scenarios. No code announced :(

👉Review https://t.ly/joMtJ
👉Paper arxiv.org/pdf/2404.17571
👉Project mengtingchen.github.io/tunnel-try-on-page/
9🔥4👍1🥰1🍾1
This media is not supported in your browser
VIEW IN TELEGRAM
🏝️1000x Scalable Neural 3D Fields🏝️

👉Highly-scalable neural 3D Fields: 1000x reductions in memory maintaining speed/quality: 10 MB vs. 10 GB! Code released 💙

👉Review https://t.ly/sLTK5
👉Paper https://lnkd.in/dEYM8-t2
👉Project https://lnkd.in/djptdujx
👉Code https://lnkd.in/dcCnFZ2n
🤯13👍5🔥43🥰1
This media is not supported in your browser
VIEW IN TELEGRAM
🌐3D Scenes w/ Depth Inpainting🌐

👉Oxford announced two novel contributions to the field of 3D scene generation: a new benchmark and a novel depth completion model. 🤗-Demo and Source Code released💙

👉Review https://t.ly/BKiny
👉Paper arxiv.org/pdf/2404.19758
👉Project research.paulengstler.com/invisible-stitch/
👉Code github.com/paulengstler/invisible-stitch
👉Demo huggingface.co/spaces/paulengstler/invisible-stitch
3👏2👍1🔥1🥰1🤯1🍾1