AI with Papers - Artificial Intelligence & Deep Learning
15K subscribers
95 photos
237 videos
11 files
1.27K links
All the AI with papers. Every day fresh updates on Deep Learning, Machine Learning, and Computer Vision (with Papers).

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
🍋 Long Video via Transformers 🍋

👉TECO is a vector-quantized latent dynamics prediction for long video

😎Review https://bit.ly/3Ch0tWD
😎Project wilson1yan.github.io/teco/
😎Paper arxiv.org/pdf/2210.02396.pdf
😎Code github.com/wilson1yan/teco
👏7
This media is not supported in your browser
VIEW IN TELEGRAM
🔥SIMPLI: ligh novel-view synthesis🔥

👉Lightweight novel-view synthesis by #Samsung for arbitrary forward-facing scenes

😎Review https://bit.ly/3CivSYZ
😎Project samsunglabs.github.io/MLI
😎Code github.com/SamsungLabs/MLI
😎Paper samsunglabs.github.io/MLI/paper/paper.pdf
👍8
This media is not supported in your browser
VIEW IN TELEGRAM
🥏 EVA3D: new SOTA in #3D humans 🥏

👉EVA3D: new SOTA for unconditional NeRF-human generation from 2D only

😎Review https://bit.ly/3Th9qX7
😎Code github.com/hongfz16/EVA3D
😎Paper arxiv.org/pdf/2210.04888.pdf
😎Project hongfz16.github.io/projects/EVA3D.html
🔥14👍2
This media is not supported in your browser
VIEW IN TELEGRAM
🍏 f-DM: Diffusion Models by Apple 🍏

👉Spectacular work by #Apple on DMs: HQ generation with better efficiency and semantic

😎Review https://bit.ly/3Tils2u
😎Project https://jiataogu.me/fdm/
😎Paper arxiv.org/pdf/2210.04955.pdf
10😱2👍1
This media is not supported in your browser
VIEW IN TELEGRAM
🏅GENIE by #Nvidia -> Faster Generation🏅

👉Higher-Order Denoising Diffusion Solvers for faster and better synthesis

😎Review https://bit.ly/3CRjtwr
😎Project nv-tlabs.github.io/GENIE/
😎Paper arxiv.org/pdf/2210.05475.pdf
😎Code github.com/nv-tlabs/GENIE
🔥10👍4
This media is not supported in your browser
VIEW IN TELEGRAM
🥬 "Perception Test" by #DeepMind 🥬

👉Huge dataset with obj & point tracks, temporal sounds, multiple & grounded vQA

😎Review https://bit.ly/3Vqh96Q
😎Dataset github.com/deepmind/perception_test
😎Project www.deepmind.com/blog/measuring-perception-in-ai-models
👍15🔥4😱3
This media is not supported in your browser
VIEW IN TELEGRAM
🦑 Instant Map-free Relocalization 🦑

👉#Niantic unveils a novel instant, metric scaled re-localization with one single photo

😎Review https://bit.ly/3S1Gdyh
😎Paper arxiv.org/pdf/2210.05494.pdf
😎Project research.nianticlabs.com/mapfree-reloc-benchmark
😎Data research.nianticlabs.com/mapfree-reloc-benchmark/dataset
🔥13👍2
This media is not supported in your browser
VIEW IN TELEGRAM
🧮 Novel DM for 3D Shapes by #Nvidia 🧮

👉Hierarchical Latent Point Diffusion Model (LION) for 3D shape generation

😎Review https://bit.ly/3yDhZ6I
😎Paper arxiv.org/pdf/2210.06978.pdf
😎Project https://nv-tlabs.github.io/LION/
😎Code(soon) github.com/nv-tlabs/LION
11😱2🔥1
This media is not supported in your browser
VIEW IN TELEGRAM
🪲#6D estimation fully in the wild🪲

👉First ever self-supervised 6D pose estimation training in the wild

😎Review https://bit.ly/3yHdHuS
😎Paper arxiv.org/pdf/2210.07199.pdf
😎Project kywind.github.io/self-pose
😎Code (soon)
👍15🤯8😱4
This media is not supported in your browser
VIEW IN TELEGRAM
Stable Diffusion in #Blender

👉Render with SuperPowers: novel scene render via text prompt

😎Review https://bit.ly/3s1mEeN
😎Code github.com/benrugg/AI-Render
🤯8👍52
This media is not supported in your browser
VIEW IN TELEGRAM
Markerless Body-Object Interaction

👉Novel whole-bodies/objects interaction method from multi-view RGB-D data

😎Review https://bit.ly/3yO56GY
😎Data intercap.is.tue.mpg.de/login.php
😎Project https://intercap.is.tue.mpg.de
😎Code github.com/YinghaoHuang91
😎Paper intercap.is.tue.mpg.de/media/upload/main.pdf
🔥6👍2🍾1
This media is not supported in your browser
VIEW IN TELEGRAM
🔥 Dressing Avatars by #META 🔥

👉Novel deep photorealistic appearance method for physically-simulated clothing in #metaverse

😎Review https://bit.ly/3yRBW9Y
😎Paper arxiv.org/pdf/2206.15470.pdf
🤯7👍5🍾21
This media is not supported in your browser
VIEW IN TELEGRAM
🪂 Parallel NeRF for 6-DoF pose 🪂

👉#Nvidia unveils a parallel NeRF for 6-DoF target pose estimation

😎Review https://bit.ly/3guWWwA
😎Paper arxiv.org/pdf/2210.10108.pdf
😎Project https://pnerfp.github.io/
👍8🔥3
This media is not supported in your browser
VIEW IN TELEGRAM
🦙LaMAR: Localization/Mapping for #AR🦙

👉A new benchmark for #AR in large and unconstrained scenes

😎Review https://bit.ly/3DjlnWU
😎Paper lamar.ethz.ch/files/LaMAR.pdf
😎Project https://lamar.ethz.ch/
😎Code github.com/microsoft/lamar-benchmark
👍7🔥4💯4
This media is not supported in your browser
VIEW IN TELEGRAM
🔥New SOTA in Panoptic Segmentation🔥

👉#Google (with Hinton🤯) unveils Pix2Seq-D: novel generalist framework for panoptic segmentation

😎Review https://bit.ly/3DmpbGM
😎Paper arxiv.org/pdf/2210.06366.pdf
🔥9👍5🤯3
This media is not supported in your browser
VIEW IN TELEGRAM
🎨 UniColor: Unified Colorization 🎨

👉The first unified framework for colorization via stroke, exemplar, text, and a mix of them

😎Review https://bit.ly/3gESR9y
😎Paper arxiv.org/pdf/2209.11223.pdf
😎Project luckyhzt.github.io/unicolor
😎Code (SOON)
🤯18🔥6👍1
This media is not supported in your browser
VIEW IN TELEGRAM
🤯 Full-Body from head/hand signals 🤯

👉#Meta unveils AvatarPoser: first full-body pose method via user’s head/hands

😎Review https://bit.ly/3gESR9y
😎Paper arxiv.org/pdf/2207.13784.pdf
😎Code github.com/eth-siplab/AvatarPoser
👍9👏31
This media is not supported in your browser
VIEW IN TELEGRAM
🤖JRBD: Egocentric Perception of Humans🤖

👉Stanford -> JRDB-Pose: Dataset with 600,000+ body pose annotations!

😎Review https://bit.ly/3gEZBE4
😎Paper arxiv.org/pdf/1910.11792.pdf
😎Project jrdb.erc.monash.edu/
👍8💯4