AI with Papers - Artificial Intelligence & Deep Learning

🔘 RELI11D: Multimodal Humans 🔘

👉RELI11D is the ultimate and high-quality multimodal human motion dataset involving LiDAR, IMU system, RGB camera, and Event camera. Dataset & Source Code to be released soon💙

👉Review https://t.ly/5EG6X
👉Paper https://lnkd.in/ep6Utcik
👉Project https://lnkd.in/eDhNHYBb

❤3🔥2

8.12K views09:23

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🔥 ECoDepth: SOTA Diffusive Mono-Depth 🔥

👉New SIDE model using a diffusion backbone conditioned on ViT embeddings. It's the new SOTA in SIDE. Source Code released 💙

👉Review https://t.ly/s2pbB
👉Paper https://lnkd.in/eYt5yr_q
👉Code https://lnkd.in/eEcyPQcd

🔥11👍4❤3⚡1

8.58K views09:24

AI with Papers - Artificial Intelligence & Deep Learning

🦕 DINO-based Video Tracking 🦕 👉The Weizmann Institute announced the new SOTA in point-tracking via pre-trained DINO features. Source code announced (not yet released)💙 👉Review https://t.ly/_GIMT 👉Paper https://lnkd.in/dsGVDcar 👉Project dino-tracker.github.io/…

🔥🔥Code is out 🔥🔥

https://github.com/AssafSinger94/dino-tracker

GitHub

GitHub - AssafSinger94/dino-tracker: Official Pytorch Implementation for “DINO-Tracker: Taming DINO for Self-Supervised Point Tracking…

Official Pytorch Implementation for “DINO-Tracker: Taming DINO for Self-Supervised Point Tracking in a Single Video” (ECCV 2024) - AssafSinger94/dino-tracker

👍10❤2

7.31K views11:04

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🕷️ Gen-NeRF2NeRF Translation 🕷️

👉GenN2N: unified NeRF-to-NeRF translation for editing tasks such as text-driven NeRF editing, colorization, super-resolution, inpainting, etc.

👉Review https://t.ly/VMWAH
👉Paper arxiv.org/pdf/2404.02788.pdf
👉Project xiangyueliu.github.io/GenN2N/
👉Code github.com/Lxiangyue/GenN2N

🤯4❤3🥰1

7.9K viewsedited 13:28

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

👆iSeg: Interactive 3D Segmentation👆

👉 iSeg: interactive segmentation technique for 3D shapes operating entirely in 3D. It accepts both positive/negative clicks directly on the shape's surface, indicating inclusion & exclusion of regions.

👉Review https://t.ly/tyFnD
👉Paper https://lnkd.in/dydAz8zp
👉Project https://lnkd.in/de-h6SRi
👉Code (coming)

❤7👏2🔥1

7.94K viewsedited 07:37

AI with Papers - Artificial Intelligence & Deep Learning

0:06

This media is not supported in your browser

VIEW IN TELEGRAM

👗 Neural Bodies with Clothes 👗

👉Neural-ABC is a novel parametric model based on neural implicit functions that can represent clothed human bodies with disentangled latent spaces for ID, clothing, shape, and pose.

👉Review https://t.ly/Un1wc
👉Project https://lnkd.in/dhDG6FF5
👉Paper https://lnkd.in/dhcfK7jZ
👉Code https://lnkd.in/dQvXWysP

🔥7👍2👏1

7.58K viewsedited 07:40

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🔌 BodyMAP: human body & pressure 🔌

👉#Nvidia (+CMU) unveils BodyMAP, the new SOTA in predicting body mesh (3D pose & shape) and 3D applied pressure on the human body. Source Code released, Dataset coming 💙

👉Review https://t.ly/8926S
👉Project bodymap3d.github.io/
👉Paper https://lnkd.in/gCxH4ev3
👉Code https://lnkd.in/gaifdy3q

❤8🤯4⚡1👍1🔥1

7.7K views15:07

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🧞 XComposer2: 4K Vision-Language 🧞

👉InternLMXComposer2-4KHD brings LVLM resolution capabilities up to 4K HD (3840×1600) and beyond. Authors: Shanghai AI Lab, CUHK, SenseTime & Tsinghua. Source Code & Models released 💙

👉Review https://t.ly/GCHsz
👉Paper arxiv.org/pdf/2404.06512.pdf
👉Code github.com/InternLM/InternLM-XComposer

🥰7⚡2👍1

7.28K views09:45

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

⚛️ Flying w/ Photons: Neural Render ⚛️

👉Novel neural rendering technique that seeks to synthesize videos of light propagating through a scene from novel, moving camera viewpoints. Pico-Seconds time resolution!

👉Review https://t.ly/ZqL3a
👉Paper arxiv.org/pdf/2404.06493.pdf
👉Project anaghmalik.com/FlyingWithPhotons/
👉Code github.com/anaghmalik/FlyingWithPhotons

🤯6⚡3❤2👍1🤣1

7.48K views13:39

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

☄️ Tracking Any 2D Pixels in 3D ☄️

👉 SpatialTracker lifts 2D pixels to 3D using monocular depth, represents the 3D content of each frame efficiently using a triplane representation, and performs iterative updates using a transformer to estimate 3D trajectories.

👉Review https://t.ly/B28Cj
👉Paper https://lnkd.in/d8ers_nm
👉Project https://lnkd.in/deHjtZuE
👉Code https://lnkd.in/dMe3TvFT

❤10🔥5⚡1👏1

8.3K views06:44

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🪐YOLO-CIANNA: Neural Astro🪐

👉 CIANNA is a general-purpose deep learning framework for (but not only for) astronomical data analysis. Source Code released 💙

👉Review https://t.ly/441XS
👉Paper arxiv.org/pdf/2402.05925.pdf
👉Code github.com/Deyht/CIANNA
👉Wiki github.com/Deyht/CIANNA/wiki

👍7⚡5❤4🔥2🥰2

8.4K viewsedited 09:00

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🧤Neuro MusculoSkeletal-MANO🧤

👉SJTU unveils MusculoSkeletal-MANO, novel musculoskeletal system with a learnable parametric hand model. Source Code announced 💙

👉Review https://t.ly/HOQrn
👉Paper arxiv.org/pdf/2404.10227.pdf
👉Project https://ms-mano.robotflow.ai/
👉Code announced (no repo yet)

🔥3⚡1❤1👍1👏1

7.25K viewsedited 14:58

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

⚽SoccerNET: Athlete Tracking⚽

👉SoccerNet Challenge is a novel high level computer vision task that is specific to sports analytics. It aims at recognizing the state of a sport game, i.e., identifying and localizing all sports individuals (players, referees, ..) on the field.

👉Review https://t.ly/Mdu9s
👉Paper arxiv.org/pdf/2404.11335.pdf
👉Code github.com/SoccerNet/sn-gamestate

❤9👍8🔥3⚡2🤯1

7.51K viewsedited 13:40

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🎲 Articulated Objs from MonoClips 🎲

👉REACTO is the new SOTA to address the challenge of reconstructing general articulated 3D objects from single monocular video

👉Review https://t.ly/REuM8
👉Paper https://lnkd.in/d6PWagij
👉Project https://lnkd.in/dpg3x4tm
👉Repo https://lnkd.in/dRZWj6_N

🤯6👍1🔥1👏1

7.17K views12:20

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🪼 All You Need is SAM (+Flow) 🪼

👉Oxford unveils the new SOTA for moving object segmentation via SAM + Optical Flow. Two novel models & Source Code announced 💙

👉Review https://t.ly/ZRYtp
👉Paper https://lnkd.in/d4XqkEGF
👉Project https://lnkd.in/dHpmx3FF
👉Repo coming: https://github.com/Jyxarthur/

❤12👍7🔥2🤯2

7.65K viewsedited 12:23

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🛞 6Img-to-3D driving scenarios 🛞

👉EPFL (+ Continental) unveils 6Img-to-3D, novel transformer-based encoder-renderer method to create 3D onbounded outdoor driving scenarios with only six pics

👉Review https://shorturl.at/dZ018
👉Paper arxiv.org/pdf/2404.12378.pdf
👉Project 6img-to-3d.github.io/
👉Code github.com/continental/6Img-to-3D

🔥5❤1👍1

7.53K views07:35

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🌹 Physics-Based 3D Video-Gen 🌹

👉PhysDreamer, a physics-based approach that leverages the object dynamics priors learned by video generation models. It enables realistic 3D interaction with objects

👉Review https://t.ly/zxXf9
👉Paper arxiv.org/pdf/2404.13026.pdf
👉Project physdreamer.github.io/
👉Code github.com/a1600012888/PhysDreamer

👍14❤9🤯4👏1

8.03K viewsedited 06:46

AI with Papers - Artificial Intelligence & Deep Learning

0:04

This media is not supported in your browser

VIEW IN TELEGRAM

🎡 NER-Net: Seeing at Night-Time 🎡

👉Huazhong (+Beijing) unveils a novel event-based nighttime imaging solution under non-uniform illumination, plus a paired multi-illumination level real-world dataset. Repo online, code coming 💙

👉Review https://t.ly/Z9JMJ
👉Paper arxiv.org/pdf/2404.11884.pdf
👉Repo github.com/Liu-haoyue/NER-Net
👉Clip https://www.youtube.com/watch?v=zpfTLCF1Kw4

🤯3🔥2❤1👍1

8.45K viewsedited 12:20

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🌊 FlowMap: dense depth video 🌊

👉MIT (+CSAIL) unveils FlowMap, a novel E2E differentiable method that solves for precise camera poses, camera intrinsics, and perframe dense depth of a video sequence. Source Code released 💙

👉Review https://t.ly/CBH48
👉Paper arxiv.org/pdf/2404.15259.pdf
👉Project cameronosmith.github.io/flowmap
👉Code github.com/dcharatan/flowmap

🔥18❤3👍2

8.43K viewsedited 06:50

AI with Papers - Artificial Intelligence & Deep Learning

0:11

This media is not supported in your browser

VIEW IN TELEGRAM

👗TELA: Text to 3D Clothed Human👗

👉 TELA is a novel approach for the new task of clothing disentangled 3D human model generation from texts. This novel approach unleashes the potential of many downstream applications (e.g., virtual try-on).

👉Review https://t.ly/6N7JV
👉Paper https://arxiv.org/pdf/2404.16748
👉Project https://jtdong.com/tela_layer/
👉Code https://github.com/DongJT1996/TELA

👍5🔥4🤯3👏1🍾1

7.56K views07:27

About

Blog

Apps

Platform