AI with Papers - Artificial Intelligence & Deep Learning
15K subscribers
95 photos
235 videos
11 files
1.26K links
All the AI with papers. Every day fresh updates on Deep Learning, Machine Learning, and Computer Vision (with Papers).

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
πŸŽƒ In-N-Out: 3D-aware OOD video editing πŸŽƒ

πŸ‘‰Novel 3D-aware video editing able to manipulate OOD objects (e.g. heavy makeup, accessories)

😎Review https://bit.ly/3jN0CMu
😎Paper arxiv.org/pdf/2302.04871.pdf
😎Project https://in-n-out-3d.github.io
πŸ”₯4❀2🀯2πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ₯Έ MEGANE: Generative Morphable Eyeglass πŸ₯Έ

πŸ‘‰#META unveils the most advanced #3D compositional morphable AI for eyeglasses (HD geometry/photometric interaction)

😎Review https://bit.ly/3jOWifu
😎Paper arxiv.org/pdf/2302.04868.pdf
😎Project junxuan-li.github.io/megane
πŸ”₯9🀯3πŸ‘2🀩1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ’˜ 3D-aware Blending with NeRF πŸ’˜

πŸ‘‰Novel 3D-aware blending method via generative NeRFs

😎Review https://bit.ly/3lBEJA2
😎Paper arxiv.org/pdf/2302.06608.pdf
😎Project blandocs.github.io/blendnerf
😎Code github.com/naver-ai/BlendNeRF
❀8
This media is not supported in your browser
VIEW IN TELEGRAM
πŸŒ… Semantics-guided natural synthesis πŸŒ…

πŸ‘‰Alibaba #AI unveils a novel semantics-guided synthesis of natural scenes

😎Review https://bit.ly/4115MVJ
😎Paper arxiv.org/pdf/2302.07224.pdf
😎Project zju3dv.github.io/paintingnature
πŸ‘5πŸ”₯1🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
🦞 SOTA ALERT: YOWOv2 is out! 🦞

πŸ‘‰ The 2nd-gen of YOWO, real-time detection of spatio-temporal actions

😎Review https://bit.ly/3IscY60
😎Paper arxiv.org/pdf/2302.06848v1.pdf
😎Code github.com/yjh0410/YOWOv2
πŸ”₯17πŸ‘2
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ“¬ DIVOTrack: crossview MOT dataset πŸ“¬

πŸ‘‰ DIVOTrack + CrossMOT: the ultimate solution for MOT in realistic scenario

😎Review https://bit.ly/3YSFZgL
😎Paper arxiv.org/pdf/2302.07676.pdf
😎Code github.com/shengyuhao/DIVOTrack
πŸ”₯6πŸ‘2🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
🦩 One-Shot Face via LSs of StyleGAN2 🦩

πŸ‘‰ Novel video generation framework with edits, facial motions, deformations & identity

😎Review https://bit.ly/3xuChhF
😎Paper arxiv.org/pdf/2302.07848.pdf
😎Project trevineoorloff.github.io/FaceVideoReenactment_HybridLatents.io/
🀯3😱2⚑1πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
🌢️ 3D-aware conditional generative AI 🌢️

πŸ‘‰ Pix2Pix3D: 3D-aware conditional generative AI for controllable photorealistic synthesis

😎Review https://bit.ly/3I80MWS
😎Paper arxiv.org/pdf/2302.08509.pdf
😎Project www.cs.cmu.edu/~pix2pix3D
😎Code github.com/dunbar12138/pix2pix3D
πŸ”₯4πŸ‘2⚑1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ›‘οΈ TPV: Tesla's O-Net competitor πŸ›‘οΈ

πŸ‘‰From Beijing an open-source approach for vision-centric autonomous driving #3D perception

😎Review https://bit.ly/3lNvVYc
😎Paper arxiv.org/pdf/2302.07817.pdf
😎Code github.com/wzzheng/TPVFormer
πŸ‘7πŸ”₯3🀯3😱1
πŸ€ #NBA Mixed Reality is NUTS πŸ€

πŸ‘‰The premiere of the streaming app of the #NBA is totally INSANE. A mix of #AI, CG and much moreπŸ‘‡

πŸ€More: https://bit.ly/3IJ3uUp
🀯10πŸ‘5❀1😱1🀩1πŸ’©1
This media is not supported in your browser
VIEW IN TELEGRAM
🫳 Neural Relighting of Hands 🫴

πŸ‘‰#META unveil the first neural relighting for personalized hands in real-time under novel illumination

😎Review https://bit.ly/3SblmKC
😎Paper arxiv.org/pdf/2302.04866.pdf
😎Project sh8.io/#/relightable_hands
πŸ₯°4πŸ‘3😱3πŸ”₯2
This media is not supported in your browser
VIEW IN TELEGRAM
πŸͺ VoxFormer: 2D->#3D Voxel ViTπŸͺ

πŸ‘‰#Nvidia VoxFormer: #3D volumetric semantics from 2D images

😎Review https://bit.ly/3Kw9Yab
😎Paper arxiv.org/pdf/2302.12251.pdf
😎Code github.com/NVlabs/VoxFormer
πŸ”₯11🀯3πŸ‘2
This media is not supported in your browser
VIEW IN TELEGRAM
πŸͺžDisCO: Selfie Correction with 3D-GANπŸͺž

πŸ‘‰Snap (et al.) unveils a GAN-based method for correcting distortions in close-up faces

😎Review https://bit.ly/3StGGuX
😎Paper arxiv.org/pdf/2302.12253.pdf
😎Project https://portrait-disco.github.io
πŸ”₯8πŸ₯°3⚑1πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
⚽️ Vid2Avatar: 3D Avatar from Videos ⚽️

πŸ‘‰Vid2Avatar: detailed 3D avatar from monocular videos in the wild

😎Review https://bit.ly/3ISbceD
😎Paper arxiv.org/pdf/2302.11566.pdf
😎Project moygcc.github.io/vid2avatar
😎Code (soon) github.com/MoyGcc
🀯18πŸ‘11πŸ”₯8😱1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ‰ SLAHMR: 4D People from Clip in-the-Wild πŸ‰

πŸ‘‰UC-Berkeley unveils SLAHMR: novel method to reconstruct global human trajectories from videos

😎Review https://bit.ly/3SzTIaj
😎Paper arxiv.org/pdf/2302.12827.pdf
😎Project vye16.github.io/slahmr/
😎Code github.com/vye16/slahmr
πŸ‘10πŸ”₯8❀2
πŸ‡ SplineCam: Neural Decision Boundary πŸ‡

πŸ‘‰#META -> SplineCam: a step towards neural visualization / interpretability

😎Review https://bit.ly/3mgoOaH
😎Paper arxiv.org/pdf/2302.12828.pdf
😎Project imtiazhumayun.github.io/splinecam
😎Code github.com/AhmedImtiazPrio/SplineCAM
🀯8πŸ‘3
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ‘‘ ControNet: Conditional Control of Diffusion πŸ‘‘

πŸ‘‰Controlling Stable Diffusion via conditional inputs like edges, segmentation, keypoints, etc. Extra: a super-nice tutorial.

😎Review https://bit.ly/3YgjrWt
😎Paper arxiv.org/pdf/2302.05543.pdf
😎Code github.com/lllyasviel/ControlNet
😎Tutorial https://github.com/Mikubill/sd-webui-controlnet/discussions/204
🀯15πŸ‘8πŸ”₯3❀1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ›Έ TAU: video traffic analytics via UAVs πŸ›Έ

πŸ‘‰ Prince Sultan University unveils TAU: AI-integrated video analytics framework from UAVs' POV

😎Review https://bit.ly/3EQIh8F
😎Paper arxiv.org/pdf/2303.00337.pdf
😎Project github.com/bilel-bj/TAU
πŸ”₯10πŸ‘3πŸ₯°1🍾1
This media is not supported in your browser
VIEW IN TELEGRAM
🩻 Independent Tokens for 3D Human 🩻

πŸ‘‰Tencent open-sourcing a novel method to estimate #3D human pose and shape from monocular videos

😎Review https://bit.ly/3Zz0uiH
😎Paper arxiv.org/pdf/2303.00298.pdf
😎Code github.com/yangsenius/INT_HMR_Model
😎Project yangsenius.github.io/INT_HMR_Model/index.html
πŸ”₯5πŸ‘1😒1
This media is not supported in your browser
VIEW IN TELEGRAM
🌸 3DGP: ImageNet in #3D 🌸

πŸ‘‰ Snap unveils 3DGP: a novel 3D generator with Generic Priors

😎Review https://bit.ly/3KWHUgG
😎Paper arxiv.org/pdf/2303.01416.pdf
😎Project snap-research.github.io/3dgp/
😎Code github.com/snap-research/3dgp
πŸ”₯8⚑1πŸ‘1
Media is too big
VIEW IN TELEGRAM
πŸ—ΊοΈ S-NeRF: NeRF for Street Views πŸ—ΊοΈ

πŸ‘‰S-NeRF: novel view synthesis of streets & foreground moving vehicles jointly

😎Review https://bit.ly/3KZUN9w
😎Paper arxiv.org/pdf/2303.00749.pdf
😎Project ziyang-xie.github.io/s-nerf/
😎Code (soon)
πŸ‘9πŸ”₯3🀯1