AI with Papers - Artificial Intelligence & Deep Learning
15K subscribers
95 photos
235 videos
11 files
1.26K links
All the AI with papers. Every day fresh updates on Deep Learning, Machine Learning, and Computer Vision (with Papers).

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
πŸͺ° EasyMocap: Open Neural Mocap πŸͺ°

πŸ‘‰EasyMocap: open-source marker-less mocap with novel view synthesis from RGB

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬 (of last paper added):
βœ…Editable free-viewpoint video
βœ…Layered neural representation of humans
βœ…Multi-pax -> instances, weakly-supervised
βœ…HQ neural representation of the humans
βœ…Addressing camera error by human poses

More: https://bit.ly/3p6lUDO
🀯6πŸ‘3πŸ‘3❀2
This media is not supported in your browser
VIEW IN TELEGRAM
🎰 Texturify: Neural Textures Generator 🎰

πŸ‘‰A step towards automated content creation. HQ textures directly on surface of 3D object

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…TUM + Max Planck + Apple 🍏
βœ…Realistic, HQ textures from 2D pics
βœ…3D shape geometry, no 3D supervision
βœ…3D-aware surface-based generation net

More: https://bit.ly/3BW7UUU
πŸ‘8
This media is not supported in your browser
VIEW IN TELEGRAM
🍨 Scaling Neural Indoor Scene 🍨

πŸ‘‰Neural scene rendering for indoor: scalable in both training/rendering

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Neural scene rendering for indoor
βœ…#3D into tiles with MLPs to scale up
βœ…Parallel training of tile-based MLPs
βœ…View-indep. components (via surf-MLP)

More: https://bit.ly/3bH94IX
πŸ”₯2πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯Stable Diffusion on clips. INSANEπŸ”₯

πŸ‘‰The most advanced latent text-to-image DM. #RunwayML just announced is going to apply it on clips

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Latent DM on 512p from LAION-5B
βœ…Frozen CLIP ViT-L/14 text encoder
βœ…Lightweight, runs on a 10GB-GPU
βœ…Checkpoints only for research

More: https://bit.ly/3QfkRx3
🀯13😱12πŸ‘2πŸ”₯1
This media is not supported in your browser
VIEW IN TELEGRAM
🐍 Implicitron: "democratizing" NeRF🐍

πŸ‘‰#META opens a novel framework for NeRF-world in #PyTorch3D #pytorch

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Implicit representations (NeRF) / Render
βœ…RaySampler/PointSampler & more
βœ…NeRF’s MLP, IDR’s FF, SRN, etc.
βœ…Renderers: MEAR, LSTMRenderer, etc.

More: https://bit.ly/3bPyJPJ
πŸ”₯4🀯2
This media is not supported in your browser
VIEW IN TELEGRAM
🧰 FGT: flow-guided inpainting 🧰

πŸ‘‰#Microsoft (+USTC) unveils FGT: flow-guided ViT for video inpainting 🀯

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…OF into transformer for attention++
βœ…Flow completion net w/ local feats.
βœ…Dual perspective spatial MHSA
βœ…Local attention with global content

More: https://bit.ly/3pk5J5S
❀11πŸ‘5
This media is not supported in your browser
VIEW IN TELEGRAM
🍏NeuMan: Human NeRF in the wild🍏

πŸ‘‰#Apple opens a novel human pose/view from just a single in-the-wild video

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…No extra devices/annotations
βœ…Both Human (novel poses) + Scene
βœ…E2E SMPL optimization + error-corr.
βœ…Applications such as "telegathering"

More: https://bit.ly/3K4iTO6
πŸ‘15
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ₯‘ CLIP-based Neural Style Transfer πŸ₯‘

πŸ‘‰From #Nvidia a novel method for transferring the style to a #3D object

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Texture style for 3D by CLIP-ResNet50
βœ…Nearest-neighbor feature matching loss
βœ…CLIP-based loss extraction of textures
βœ…NNFM for multiple style pics / control
βœ…No source code or models available πŸ˜’

More: https://bit.ly/3c32dK5
🀯12πŸ”₯5❀4πŸ‘2😱2😁1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯ KeypointNeRF: code is out! πŸ”₯

πŸ‘‰KeypointNeRF by #Meta: "NeRF"-avatars

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Generalizable NeRF for virtual avatar
βœ…Sparse 3D keypoints for SOTA avatar
βœ…Novel unseen subjects from 2/3 views
βœ…"iPhone" captures for #metaverse

More: https://bit.ly/3pyl17e
πŸ”₯8πŸ‘3πŸ‘Ž1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ₯­Massive GTA-V human datasetπŸ₯­

πŸ‘‰GTA-Human: outperforming SOTA with a purely synthetic training.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…600+ gender, age, ethnicity & clothing
βœ…20,000+ clips, variety of human activities
βœ…6 categories of location, different BGs
βœ…Occlusions, lighting, and weather system

More: https://bit.ly/3wpZyRD
πŸ”₯14❀2πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
🍈DeepBillboards: old-school trick for #VR🍈

πŸ‘‰DeepBillboards models a 3D object implicitly using neural net on the user’s viewing direction

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…#Google Brain +Tsukuba + Tokyo
βœ…Rendering at higher res., improving #VR
βœ…NeRF into interactive VR with accuracy++
βœ…NeRF (or any others) directly in #Unity

More: https://bit.ly/3CsTQ5y
πŸ‘6πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
🌐RelPose: Probabilistic Relative Pose🌐

πŸ‘‰A novel method for core component in #SLAM / NeRF-powered apps.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Core component of SfM/SLAM
βœ…Pre-processing for neural (NeRF)
βœ…Energy-based over rotations
βœ…SOTA on both seen/unseen objects

More: https://bit.ly/3T60TXw
πŸ”₯12πŸ‘2πŸ‘2❀1
This media is not supported in your browser
VIEW IN TELEGRAM
🍈 #StableDiffusion archive is out🍈

πŸ‘‰Lexica art is a Stable Diffusion prompt search engine. Real-time, countless #stablediffusion results for everyone. I had fun with the GOAT, #Maradona.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Maradona scoring against a capybara...
βœ…A poster of space jam with Maradona...
βœ…Painting of Maradona very detailed...
βœ…Painting of Maradona in heaven...

More: https://bit.ly/3PTXHLH
❀9πŸ‘5
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ¦‰PANDORA: Polarized Neural DecompositionπŸ¦‰

πŸ‘‰CIL lab unveils PANDORA: polarimetric inverse rendering approach via INR

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Geometry, reflectance & illumination
βœ…normal, signed distance field, mesh
βœ…Diffuse-specular separation
βœ…Hi-fI incident illumination

More https://bit.ly/3CzGp3F
πŸ‘3πŸ”₯3
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯IDOL (#CVPR2022 winner): code is out!πŸ”₯

πŸ‘‰IDOL for VIS: outperforming all online/offline methods, the new SOTA!

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Online usually inferior by >10AP
βœ…Online based on contrast-learning
βœ…Discriminative++ instance embeddings
βœ…Full exploiting history for stability

More https://bit.ly/3dXCDXw
🀯16πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯ #AIwithPapers: we are 4,000+! πŸ”₯

πŸ’™πŸ’›Lot of people joined, and we talked about #StableDiffusion only twice! Can't believe it.πŸ’™πŸ’›

😈 Invite your friends -> https://t.me/AI_DeepLearning
πŸ”₯10
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”΅ Deep Saliency: driving the attention πŸ”΅

πŸ‘‰Google unveils a family of operators to "drive" human saliency

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Editing image to drive Saliency
βœ…Transforms to hide distractors
βœ…Warping operator for distractor
βœ…GAN-op for less-saliency altern.

More: https://bit.ly/3KoQQc2
πŸ‘9🀩4