This media is not supported in your browser
VIEW IN TELEGRAM
🍋 Long Video via Transformers 🍋
👉TECO is a vector-quantized latent dynamics prediction for long video
😎Review https://bit.ly/3Ch0tWD
😎Project wilson1yan.github.io/teco/
😎Paper arxiv.org/pdf/2210.02396.pdf
😎Code github.com/wilson1yan/teco
👉TECO is a vector-quantized latent dynamics prediction for long video
😎Review https://bit.ly/3Ch0tWD
😎Project wilson1yan.github.io/teco/
😎Paper arxiv.org/pdf/2210.02396.pdf
😎Code github.com/wilson1yan/teco
👏7
This media is not supported in your browser
VIEW IN TELEGRAM
🔥SIMPLI: ligh novel-view synthesis🔥
👉Lightweight novel-view synthesis by #Samsung for arbitrary forward-facing scenes
😎Review https://bit.ly/3CivSYZ
😎Project samsunglabs.github.io/MLI
😎Code github.com/SamsungLabs/MLI
😎Paper samsunglabs.github.io/MLI/paper/paper.pdf
👉Lightweight novel-view synthesis by #Samsung for arbitrary forward-facing scenes
😎Review https://bit.ly/3CivSYZ
😎Project samsunglabs.github.io/MLI
😎Code github.com/SamsungLabs/MLI
😎Paper samsunglabs.github.io/MLI/paper/paper.pdf
👍8
This media is not supported in your browser
VIEW IN TELEGRAM
🥏 EVA3D: new SOTA in #3D humans 🥏
👉EVA3D: new SOTA for unconditional NeRF-human generation from 2D only
😎Review https://bit.ly/3Th9qX7
😎Code github.com/hongfz16/EVA3D
😎Paper arxiv.org/pdf/2210.04888.pdf
😎Project hongfz16.github.io/projects/EVA3D.html
👉EVA3D: new SOTA for unconditional NeRF-human generation from 2D only
😎Review https://bit.ly/3Th9qX7
😎Code github.com/hongfz16/EVA3D
😎Paper arxiv.org/pdf/2210.04888.pdf
😎Project hongfz16.github.io/projects/EVA3D.html
🔥14👍2
This media is not supported in your browser
VIEW IN TELEGRAM
🍏 f-DM: Diffusion Models by Apple 🍏
👉Spectacular work by #Apple on DMs: HQ generation with better efficiency and semantic
😎Review https://bit.ly/3Tils2u
😎Project https://jiataogu.me/fdm/
😎Paper arxiv.org/pdf/2210.04955.pdf
👉Spectacular work by #Apple on DMs: HQ generation with better efficiency and semantic
😎Review https://bit.ly/3Tils2u
😎Project https://jiataogu.me/fdm/
😎Paper arxiv.org/pdf/2210.04955.pdf
❤10😱2👍1
This media is not supported in your browser
VIEW IN TELEGRAM
🏅GENIE by #Nvidia -> Faster Generation🏅
👉Higher-Order Denoising Diffusion Solvers for faster and better synthesis
😎Review https://bit.ly/3CRjtwr
😎Project nv-tlabs.github.io/GENIE/
😎Paper arxiv.org/pdf/2210.05475.pdf
😎Code github.com/nv-tlabs/GENIE
👉Higher-Order Denoising Diffusion Solvers for faster and better synthesis
😎Review https://bit.ly/3CRjtwr
😎Project nv-tlabs.github.io/GENIE/
😎Paper arxiv.org/pdf/2210.05475.pdf
😎Code github.com/nv-tlabs/GENIE
🔥10👍4
This media is not supported in your browser
VIEW IN TELEGRAM
🥬 "Perception Test" by #DeepMind 🥬
👉Huge dataset with obj & point tracks, temporal sounds, multiple & grounded vQA
😎Review https://bit.ly/3Vqh96Q
😎Dataset github.com/deepmind/perception_test
😎Project www.deepmind.com/blog/measuring-perception-in-ai-models
👉Huge dataset with obj & point tracks, temporal sounds, multiple & grounded vQA
😎Review https://bit.ly/3Vqh96Q
😎Dataset github.com/deepmind/perception_test
😎Project www.deepmind.com/blog/measuring-perception-in-ai-models
👍15🔥4😱3
This media is not supported in your browser
VIEW IN TELEGRAM
🔥 Matterport 3D Semantics Dataset 🔥
👉#Meta opens HM3DSEM, the largest #3D real-world dataset with dense semantic
😎Review https://bit.ly/3yF4W4G
😎Paper arxiv.org/pdf/2210.05633.pdf
😎Project aihabitat.org/datasets/hm3d-semantics
😎Data github.com/matterport/habitat-matterport-3dresearch
👉#Meta opens HM3DSEM, the largest #3D real-world dataset with dense semantic
😎Review https://bit.ly/3yF4W4G
😎Paper arxiv.org/pdf/2210.05633.pdf
😎Project aihabitat.org/datasets/hm3d-semantics
😎Data github.com/matterport/habitat-matterport-3dresearch
👍13
This media is not supported in your browser
VIEW IN TELEGRAM
🦑 Instant Map-free Relocalization 🦑
👉#Niantic unveils a novel instant, metric scaled re-localization with one single photo
😎Review https://bit.ly/3S1Gdyh
😎Paper arxiv.org/pdf/2210.05494.pdf
😎Project research.nianticlabs.com/mapfree-reloc-benchmark
😎Data research.nianticlabs.com/mapfree-reloc-benchmark/dataset
👉#Niantic unveils a novel instant, metric scaled re-localization with one single photo
😎Review https://bit.ly/3S1Gdyh
😎Paper arxiv.org/pdf/2210.05494.pdf
😎Project research.nianticlabs.com/mapfree-reloc-benchmark
😎Data research.nianticlabs.com/mapfree-reloc-benchmark/dataset
🔥13👍2
This media is not supported in your browser
VIEW IN TELEGRAM
🧮 Novel DM for 3D Shapes by #Nvidia 🧮
👉Hierarchical Latent Point Diffusion Model (LION) for 3D shape generation
😎Review https://bit.ly/3yDhZ6I
😎Paper arxiv.org/pdf/2210.06978.pdf
😎Project https://nv-tlabs.github.io/LION/
😎Code(soon) github.com/nv-tlabs/LION
👉Hierarchical Latent Point Diffusion Model (LION) for 3D shape generation
😎Review https://bit.ly/3yDhZ6I
😎Paper arxiv.org/pdf/2210.06978.pdf
😎Project https://nv-tlabs.github.io/LION/
😎Code(soon) github.com/nv-tlabs/LION
❤11😱2🔥1
This media is not supported in your browser
VIEW IN TELEGRAM
🪲#6D estimation fully in the wild🪲
👉First ever self-supervised 6D pose estimation training in the wild
😎Review https://bit.ly/3yHdHuS
😎Paper arxiv.org/pdf/2210.07199.pdf
😎Project kywind.github.io/self-pose
😎Code (soon)
👉First ever self-supervised 6D pose estimation training in the wild
😎Review https://bit.ly/3yHdHuS
😎Paper arxiv.org/pdf/2210.07199.pdf
😎Project kywind.github.io/self-pose
😎Code (soon)
👍15🤯8😱4
This media is not supported in your browser
VIEW IN TELEGRAM
⛽ Stable Diffusion in #Blender ⛽
👉Render with SuperPowers: novel scene render via text prompt
😎Review https://bit.ly/3s1mEeN
😎Code github.com/benrugg/AI-Render
👉Render with SuperPowers: novel scene render via text prompt
😎Review https://bit.ly/3s1mEeN
😎Code github.com/benrugg/AI-Render
🤯8👍5❤2
This media is not supported in your browser
VIEW IN TELEGRAM
⚽Markerless Body-Object Interaction⚽
👉Novel whole-bodies/objects interaction method from multi-view RGB-D data
😎Review https://bit.ly/3yO56GY
😎Data intercap.is.tue.mpg.de/login.php
😎Project https://intercap.is.tue.mpg.de
😎Code github.com/YinghaoHuang91
😎Paper intercap.is.tue.mpg.de/media/upload/main.pdf
👉Novel whole-bodies/objects interaction method from multi-view RGB-D data
😎Review https://bit.ly/3yO56GY
😎Data intercap.is.tue.mpg.de/login.php
😎Project https://intercap.is.tue.mpg.de
😎Code github.com/YinghaoHuang91
😎Paper intercap.is.tue.mpg.de/media/upload/main.pdf
🔥6👍2🍾1
This media is not supported in your browser
VIEW IN TELEGRAM
🔥 Dressing Avatars by #META 🔥
👉Novel deep photorealistic appearance method for physically-simulated clothing in #metaverse
😎Review https://bit.ly/3yRBW9Y
😎Paper arxiv.org/pdf/2206.15470.pdf
👉Novel deep photorealistic appearance method for physically-simulated clothing in #metaverse
😎Review https://bit.ly/3yRBW9Y
😎Paper arxiv.org/pdf/2206.15470.pdf
🤯7👍5🍾2❤1
AI with Papers - Artificial Intelligence & Deep Learning
🎷🎷OMNI3D: #3D Objects in the Wild🎷🎷 👉#3D detection: 234k images, 3M+ instances & 97 categories 𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬: ✅OMNI3D from publicly released dataset ✅234k pics, 3M+ annotation with 3D box ✅97 categories such as sofa, table, cars ✅Fast (450x) and exact algorithm…
This media is not supported in your browser
VIEW IN TELEGRAM
🔥Meta Omni3D: code is out!🔥
👉Source Code, models, and data just released by #META !
😎Review https://bit.ly/3MIWxD9
😎Paper arxiv.org/pdf/2207.10660.pdf
😎Project garrickbrazil.com/omni3d/
😎Code github.com/facebookresearch/omni3d
👉Source Code, models, and data just released by #META !
😎Review https://bit.ly/3MIWxD9
😎Paper arxiv.org/pdf/2207.10660.pdf
😎Project garrickbrazil.com/omni3d/
😎Code github.com/facebookresearch/omni3d
🔥14👍2😍1
This media is not supported in your browser
VIEW IN TELEGRAM
🪂 Parallel NeRF for 6-DoF pose 🪂
👉#Nvidia unveils a parallel NeRF for 6-DoF target pose estimation
😎Review https://bit.ly/3guWWwA
😎Paper arxiv.org/pdf/2210.10108.pdf
😎Project https://pnerfp.github.io/
👉#Nvidia unveils a parallel NeRF for 6-DoF target pose estimation
😎Review https://bit.ly/3guWWwA
😎Paper arxiv.org/pdf/2210.10108.pdf
😎Project https://pnerfp.github.io/
👍8🔥3
This media is not supported in your browser
VIEW IN TELEGRAM
🦙LaMAR: Localization/Mapping for #AR🦙
👉A new benchmark for #AR in large and unconstrained scenes
😎Review https://bit.ly/3DjlnWU
😎Paper lamar.ethz.ch/files/LaMAR.pdf
😎Project https://lamar.ethz.ch/
😎Code github.com/microsoft/lamar-benchmark
👉A new benchmark for #AR in large and unconstrained scenes
😎Review https://bit.ly/3DjlnWU
😎Paper lamar.ethz.ch/files/LaMAR.pdf
😎Project https://lamar.ethz.ch/
😎Code github.com/microsoft/lamar-benchmark
👍7🔥4💯4
This media is not supported in your browser
VIEW IN TELEGRAM
🔥New SOTA in Panoptic Segmentation🔥
👉#Google (with Hinton🤯) unveils Pix2Seq-D: novel generalist framework for panoptic segmentation
😎Review https://bit.ly/3DmpbGM
😎Paper arxiv.org/pdf/2210.06366.pdf
👉#Google (with Hinton🤯) unveils Pix2Seq-D: novel generalist framework for panoptic segmentation
😎Review https://bit.ly/3DmpbGM
😎Paper arxiv.org/pdf/2210.06366.pdf
🔥9👍5🤯3
This media is not supported in your browser
VIEW IN TELEGRAM
🎨 UniColor: Unified Colorization 🎨
👉The first unified framework for colorization via stroke, exemplar, text, and a mix of them
😎Review https://bit.ly/3gESR9y
😎Paper arxiv.org/pdf/2209.11223.pdf
😎Project luckyhzt.github.io/unicolor
😎Code (SOON)
👉The first unified framework for colorization via stroke, exemplar, text, and a mix of them
😎Review https://bit.ly/3gESR9y
😎Paper arxiv.org/pdf/2209.11223.pdf
😎Project luckyhzt.github.io/unicolor
😎Code (SOON)
🤯18🔥6👍1
This media is not supported in your browser
VIEW IN TELEGRAM
🤯 Full-Body from head/hand signals 🤯
👉#Meta unveils AvatarPoser: first full-body pose method via user’s head/hands
😎Review https://bit.ly/3gESR9y
😎Paper arxiv.org/pdf/2207.13784.pdf
😎Code github.com/eth-siplab/AvatarPoser
👉#Meta unveils AvatarPoser: first full-body pose method via user’s head/hands
😎Review https://bit.ly/3gESR9y
😎Paper arxiv.org/pdf/2207.13784.pdf
😎Code github.com/eth-siplab/AvatarPoser
👍9👏3❤1
This media is not supported in your browser
VIEW IN TELEGRAM
🤖JRBD: Egocentric Perception of Humans🤖
👉Stanford -> JRDB-Pose: Dataset with 600,000+ body pose annotations!
😎Review https://bit.ly/3gEZBE4
😎Paper arxiv.org/pdf/1910.11792.pdf
😎Project jrdb.erc.monash.edu/
👉Stanford -> JRDB-Pose: Dataset with 600,000+ body pose annotations!
😎Review https://bit.ly/3gEZBE4
😎Paper arxiv.org/pdf/1910.11792.pdf
😎Project jrdb.erc.monash.edu/
👍8💯4