AI with Papers - Artificial Intelligence & Deep Learning – Telegram

AI with Papers - Artificial Intelligence & Deep Learning

@AI_DeepLearning

15K subscribers

96 photos

238 videos

11 files

1.27K links

All the AI with papers. Every day fresh updates on Deep Learning, Machine Learning, and Computer Vision (with Papers).

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/

Download Telegram

About

Blog

Apps

Platform

AI with Papers - Artificial Intelligence & Deep Learning

15K subscribers

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

⚽SoccerNET: Athlete Tracking⚽

👉SoccerNet Challenge is a novel high level computer vision task that is specific to sports analytics. It aims at recognizing the state of a sport game, i.e., identifying and localizing all sports individuals (players, referees, ..) on the field.

👉Review https://t.ly/Mdu9s
👉Paper arxiv.org/pdf/2404.11335.pdf
👉Code github.com/SoccerNet/sn-gamestate

❤9👍8🔥3⚡2🤯1

7.47K viewsedited 13:40

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🎲 Articulated Objs from MonoClips 🎲

👉REACTO is the new SOTA to address the challenge of reconstructing general articulated 3D objects from single monocular video

👉Review https://t.ly/REuM8
👉Paper https://lnkd.in/d6PWagij
👉Project https://lnkd.in/dpg3x4tm
👉Repo https://lnkd.in/dRZWj6_N

🤯6👍1🔥1👏1

7.12K views12:20

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🪼 All You Need is SAM (+Flow) 🪼

👉Oxford unveils the new SOTA for moving object segmentation via SAM + Optical Flow. Two novel models & Source Code announced 💙

👉Review https://t.ly/ZRYtp
👉Paper https://lnkd.in/d4XqkEGF
👉Project https://lnkd.in/dHpmx3FF
👉Repo coming: https://github.com/Jyxarthur/

❤12👍7🔥2🤯2

7.61K viewsedited 12:23

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🛞 6Img-to-3D driving scenarios 🛞

👉EPFL (+ Continental) unveils 6Img-to-3D, novel transformer-based encoder-renderer method to create 3D onbounded outdoor driving scenarios with only six pics

👉Review https://shorturl.at/dZ018
👉Paper arxiv.org/pdf/2404.12378.pdf
👉Project 6img-to-3d.github.io/
👉Code github.com/continental/6Img-to-3D

🔥5❤1👍1

7.49K views07:35

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🌹 Physics-Based 3D Video-Gen 🌹

👉PhysDreamer, a physics-based approach that leverages the object dynamics priors learned by video generation models. It enables realistic 3D interaction with objects

👉Review https://t.ly/zxXf9
👉Paper arxiv.org/pdf/2404.13026.pdf
👉Project physdreamer.github.io/
👉Code github.com/a1600012888/PhysDreamer

👍14❤9🤯4👏1

7.99K viewsedited 06:46

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🎡 NER-Net: Seeing at Night-Time 🎡

👉Huazhong (+Beijing) unveils a novel event-based nighttime imaging solution under non-uniform illumination, plus a paired multi-illumination level real-world dataset. Repo online, code coming 💙

👉Review https://t.ly/Z9JMJ
👉Paper arxiv.org/pdf/2404.11884.pdf
👉Repo github.com/Liu-haoyue/NER-Net
👉Clip https://www.youtube.com/watch?v=zpfTLCF1Kw4

🤯3🔥2❤1👍1

8.42K viewsedited 12:20

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🌊 FlowMap: dense depth video 🌊

👉MIT (+CSAIL) unveils FlowMap, a novel E2E differentiable method that solves for precise camera poses, camera intrinsics, and perframe dense depth of a video sequence. Source Code released 💙

👉Review https://t.ly/CBH48
👉Paper arxiv.org/pdf/2404.15259.pdf
👉Project cameronosmith.github.io/flowmap
👉Code github.com/dcharatan/flowmap

🔥18❤3👍2

8.4K viewsedited 06:50

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

👗TELA: Text to 3D Clothed Human👗

👉 TELA is a novel approach for the new task of clothing disentangled 3D human model generation from texts. This novel approach unleashes the potential of many downstream applications (e.g., virtual try-on).

👉Review https://t.ly/6N7JV
👉Paper https://arxiv.org/pdf/2404.16748
👉Project https://jtdong.com/tela_layer/
👉Code https://github.com/DongJT1996/TELA

👍5🔥4🤯3👏1🍾1

7.53K views07:27

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🪷 Tunnel Try-on: SOTA VTON 🪷

👉"Tunnel Try-on", the first diffusion-based video virtual try-on model that demonstrates SOTA performance in complex scenarios. No code announced :(

👉Review https://t.ly/joMtJ
👉Paper arxiv.org/pdf/2404.17571
👉Project mengtingchen.github.io/tunnel-try-on-page/

❤9🔥4👍1🥰1🍾1

8.08K views07:03

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🏝️1000x Scalable Neural 3D Fields🏝️

👉Highly-scalable neural 3D Fields: 1000x reductions in memory maintaining speed/quality: 10 MB vs. 10 GB! Code released 💙

👉Review https://t.ly/sLTK5
👉Paper https://lnkd.in/dEYM8-t2
👉Project https://lnkd.in/djptdujx
👉Code https://lnkd.in/dcCnFZ2n

🤯13👍5🔥4❤3🥰1

7.75K views07:08

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🌐3D Scenes w/ Depth Inpainting🌐

👉Oxford announced two novel contributions to the field of 3D scene generation: a new benchmark and a novel depth completion model. 🤗-Demo and Source Code released💙

👉Review https://t.ly/BKiny
👉Paper arxiv.org/pdf/2404.19758
👉Project research.paulengstler.com/invisible-stitch/
👉Code github.com/paulengstler/invisible-stitch
👉Demo huggingface.co/spaces/paulengstler/invisible-stitch

❤3👏2👍1🔥1🥰1🤯1🍾1

8.26K viewsedited 11:36

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🌊 Diffusive 3D Human Recovery 🌊

👉The Rutgers University unveils ScoreHMR at #CVPR24; novel approach for 3D human pose and shape reconstruction. Impressive results.

👉Review https://t.ly/G0k2D
👉Paper https://arxiv.org/pdf/2403.09623
👉Code https://github.com/statho/ScoreHMR
👉Project https://statho.github.io/ScoreHMR/

🤯11👍6❤1👏1🤣1

7.63K views11:44

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🏷️DiffMOT (#CVPR24): diffusion-MOT🏷️

👉DiffMOT is a novel real-time diffusion-based MOT approach to tackle the complex nonlinear motion. Impressive results & Source Code released💙

👉Review https://t.ly/ztlHi
👉Paper https://lnkd.in/d4K3c-nt
👉Project https://diffmot.github.io/
👉Code github.com/Kroery/DiffMOT

❤12👍4🔥3🤯3

7.42K views07:21

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🍏 XFeat: Neural Features Matching 🍏

👉XFeat (Accelerated Features) is lightweight/accurate architecture for efficient visual correspondence. It revisits fundamental design choices in CNN for detecting, extracting & matching local features

👉Review https://t.ly/ppb38
👉Paper arxiv.org/pdf/2404.19174
👉Code https://lnkd.in/dFzTpzN8
👉Project https://lnkd.in/d8JnV-iu

❤17🤯6⚡3👏1🍾1

7.85K views06:40

AI with Papers - Artificial Intelligence & Deep Learning

🦑 Hyper-Detailed Image Descriptions 🦑

👉#Google unveils ImageInWords (IIW), a carefully designed HIL annotation framework for curating hyper-detailed image descriptions and a new dataset resulting from this process

👉Review https://t.ly/engkl
👉Paper arxiv.org/pdf/2405.02793
👉Repo github.com/google/imageinwords
👉Project google.github.io/imageinwords
👉Data huggingface.co/datasets/google/imageinwords

❤11🔥3👍2🤯2🍾1

7.94K viewsedited 16:01

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🔫 Free-Moving Reconstruction 🔫

👉EPFL (+#MagicLeap) unveils a novel approach for reconstructing free-moving object from monocular RGB clip. Free interaction with objects in front of a moving cam without relying on any prior, and optimizes the sequence globally without any segments. Great but no code announced🥺

👉Review https://t.ly/2xhtj
👉Paper arxiv.org/pdf/2405.05858
👉Project haixinshi.github.io/fmov/

👍6🤯4⚡1❤1🥰1

8.49K views08:55

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

💥FeatUp: Any Model at Any Resolution💥

👉FeatUp is a task-model agnostic framework to restore lost spatial information in deep features. It outperforms other methods in class activation map generation, transfer learning for segmentation & depth, and end-to-end training for semantic segm. Source Code released💙

👉Review https://t.ly/Evq_g
👉Paper https://lnkd.in/gweaN4s6
👉Project https://lnkd.in/gWcGXdxt
👉Code https://lnkd.in/gweq5NY4

🔥19❤4👍3👏1🍾1

8.01K viewsedited 06:52

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🐏AniTalker: Universal Talking Humans🐏

👉SJTU (+AISpeech) unveils AniTalker, a framework that transforms a single static portrait and input audio into animated talking videos with naturally flowing movements.

👉Review https://t.ly/MD4yX
👉Paper https://arxiv.org/pdf/2405.03121
👉Project https://x-lance.github.io/AniTalker/
👉Repo https://github.com/X-LANCE/AniTalker

🔥6❤4👍2⚡1🤯1

7.18K viewsedited 12:38

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

👻 3D Humans Motion from Text 👻

👉Zhejiang (+ANT) unveils a novel method to generate human motions containing accurate human-object interactions in 3D scenes based on textural descriptions. Code announced, coming 💙

👉Review https://t.ly/eOZnU
👉Paper https://arxiv.org/pdf/2405.07784
👉Project https://zju3dv.github.io/text_scene_motion/

👍3🔥2❤1

7.45K viewsedited 06:57

AI with Papers - Artificial Intelligence & Deep Learning

This media is not supported in your browser

VIEW IN TELEGRAM

🪬UHM: Authentic Hand by Phone🪬

👉 META unveils UHM, novel 3D high-fidelity avatarization of your (yes, the your one) hand. Adaptation pipeline fits the pre-trained UHM via phone scan. Source Code released 💙

👉Review https://t.ly/fU5rA
👉Paper https://lnkd.in/dyGaiAnq
👉Code https://lnkd.in/d9B_XFAA

👍4❤1🔥1🤯1

7.5K views15:51