AI with Papers - Artificial Intelligence & Deep Learning
15K subscribers
95 photos
236 videos
11 files
1.26K links
All the AI with papers. Every day fresh updates on Deep Learning, Machine Learning, and Computer Vision (with Papers).

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ‘ World-Object Detection via ViT πŸ‘

πŸ‘‰Google unveils OWL-ViT: open-vocabulary detector based on ViTs 🀯

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…ViTs for Open-World Localization
βœ…Img-level to open-vocabulary detection
βœ…SOTA one-shot (img.cond.) detection

More: https://bit.ly/3Sy3jOj
🀯12πŸ‘3
This media is not supported in your browser
VIEW IN TELEGRAM
🎹🎹 Learning Piano in #AR 🎹🎹

πŸ‘‰PianoVision (on #META #Quest2) accelerates the piano learning via Passthrough #AR & hand tracking

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Sheet Insight to learn sight-read
βœ…MIDI keyboard connectivity
βœ…Air piano for no physical pianos
βœ…Multiplayer Music Instruction
βœ…PianoVision Music Hall in #VR

More: https://bit.ly/3zYvwGX
❀15🀯6πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
🧊EPro-PnP: Persp-n-Points Detection🧊

πŸ‘‰EPro-PnP: probabilistic PnP layer for general e2e pose estimation

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Probabilistic PnP for general e2e pose
βœ…Top-tier in 6DoF by inserting into CDPN
βœ…Deformable accurate detection
βœ…2D-3D corresp. learned from scratch

More: https://bit.ly/3BNPXYr
πŸ‘11
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ₯‡#NVIDIA wins SIGGRAPH's Best PaperπŸ₯‡

πŸ‘‰Instant #NeRF awarded as a best paper at SIGGRAPH 2022!

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Speed-up of several orders of magnitude
βœ…HQ neural primitives in a matter of secs
βœ…Render in tens of milliseconds at 1080p
βœ…Source code and resources available!

More: https://bit.ly/3Qt8c9D
πŸ‘16πŸ”₯6❀3πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸͺ° EasyMocap: Open Neural Mocap πŸͺ°

πŸ‘‰EasyMocap: open-source marker-less mocap with novel view synthesis from RGB

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬 (of last paper added):
βœ…Editable free-viewpoint video
βœ…Layered neural representation of humans
βœ…Multi-pax -> instances, weakly-supervised
βœ…HQ neural representation of the humans
βœ…Addressing camera error by human poses

More: https://bit.ly/3p6lUDO
🀯6πŸ‘3πŸ‘3❀2
This media is not supported in your browser
VIEW IN TELEGRAM
🎰 Texturify: Neural Textures Generator 🎰

πŸ‘‰A step towards automated content creation. HQ textures directly on surface of 3D object

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…TUM + Max Planck + Apple 🍏
βœ…Realistic, HQ textures from 2D pics
βœ…3D shape geometry, no 3D supervision
βœ…3D-aware surface-based generation net

More: https://bit.ly/3BW7UUU
πŸ‘8
This media is not supported in your browser
VIEW IN TELEGRAM
🍨 Scaling Neural Indoor Scene 🍨

πŸ‘‰Neural scene rendering for indoor: scalable in both training/rendering

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Neural scene rendering for indoor
βœ…#3D into tiles with MLPs to scale up
βœ…Parallel training of tile-based MLPs
βœ…View-indep. components (via surf-MLP)

More: https://bit.ly/3bH94IX
πŸ”₯2πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯Stable Diffusion on clips. INSANEπŸ”₯

πŸ‘‰The most advanced latent text-to-image DM. #RunwayML just announced is going to apply it on clips

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Latent DM on 512p from LAION-5B
βœ…Frozen CLIP ViT-L/14 text encoder
βœ…Lightweight, runs on a 10GB-GPU
βœ…Checkpoints only for research

More: https://bit.ly/3QfkRx3
🀯13😱12πŸ‘2πŸ”₯1
This media is not supported in your browser
VIEW IN TELEGRAM
🐍 Implicitron: "democratizing" NeRF🐍

πŸ‘‰#META opens a novel framework for NeRF-world in #PyTorch3D #pytorch

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Implicit representations (NeRF) / Render
βœ…RaySampler/PointSampler & more
βœ…NeRF’s MLP, IDR’s FF, SRN, etc.
βœ…Renderers: MEAR, LSTMRenderer, etc.

More: https://bit.ly/3bPyJPJ
πŸ”₯4🀯2
This media is not supported in your browser
VIEW IN TELEGRAM
🧰 FGT: flow-guided inpainting 🧰

πŸ‘‰#Microsoft (+USTC) unveils FGT: flow-guided ViT for video inpainting 🀯

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…OF into transformer for attention++
βœ…Flow completion net w/ local feats.
βœ…Dual perspective spatial MHSA
βœ…Local attention with global content

More: https://bit.ly/3pk5J5S
❀11πŸ‘5
This media is not supported in your browser
VIEW IN TELEGRAM
🍏NeuMan: Human NeRF in the wild🍏

πŸ‘‰#Apple opens a novel human pose/view from just a single in-the-wild video

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…No extra devices/annotations
βœ…Both Human (novel poses) + Scene
βœ…E2E SMPL optimization + error-corr.
βœ…Applications such as "telegathering"

More: https://bit.ly/3K4iTO6
πŸ‘15
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ₯‘ CLIP-based Neural Style Transfer πŸ₯‘

πŸ‘‰From #Nvidia a novel method for transferring the style to a #3D object

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Texture style for 3D by CLIP-ResNet50
βœ…Nearest-neighbor feature matching loss
βœ…CLIP-based loss extraction of textures
βœ…NNFM for multiple style pics / control
βœ…No source code or models available πŸ˜’

More: https://bit.ly/3c32dK5
🀯12πŸ”₯5❀4πŸ‘2😱2😁1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯ KeypointNeRF: code is out! πŸ”₯

πŸ‘‰KeypointNeRF by #Meta: "NeRF"-avatars

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Generalizable NeRF for virtual avatar
βœ…Sparse 3D keypoints for SOTA avatar
βœ…Novel unseen subjects from 2/3 views
βœ…"iPhone" captures for #metaverse

More: https://bit.ly/3pyl17e
πŸ”₯8πŸ‘3πŸ‘Ž1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ₯­Massive GTA-V human datasetπŸ₯­

πŸ‘‰GTA-Human: outperforming SOTA with a purely synthetic training.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…600+ gender, age, ethnicity & clothing
βœ…20,000+ clips, variety of human activities
βœ…6 categories of location, different BGs
βœ…Occlusions, lighting, and weather system

More: https://bit.ly/3wpZyRD
πŸ”₯14❀2πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
🍈DeepBillboards: old-school trick for #VR🍈

πŸ‘‰DeepBillboards models a 3D object implicitly using neural net on the user’s viewing direction

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…#Google Brain +Tsukuba + Tokyo
βœ…Rendering at higher res., improving #VR
βœ…NeRF into interactive VR with accuracy++
βœ…NeRF (or any others) directly in #Unity

More: https://bit.ly/3CsTQ5y
πŸ‘6πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
🌐RelPose: Probabilistic Relative Pose🌐

πŸ‘‰A novel method for core component in #SLAM / NeRF-powered apps.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Core component of SfM/SLAM
βœ…Pre-processing for neural (NeRF)
βœ…Energy-based over rotations
βœ…SOTA on both seen/unseen objects

More: https://bit.ly/3T60TXw
πŸ”₯12πŸ‘2πŸ‘2❀1
This media is not supported in your browser
VIEW IN TELEGRAM
🍈 #StableDiffusion archive is out🍈

πŸ‘‰Lexica art is a Stable Diffusion prompt search engine. Real-time, countless #stablediffusion results for everyone. I had fun with the GOAT, #Maradona.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Maradona scoring against a capybara...
βœ…A poster of space jam with Maradona...
βœ…Painting of Maradona very detailed...
βœ…Painting of Maradona in heaven...

More: https://bit.ly/3PTXHLH
❀9πŸ‘5
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ¦‰PANDORA: Polarized Neural DecompositionπŸ¦‰

πŸ‘‰CIL lab unveils PANDORA: polarimetric inverse rendering approach via INR

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Geometry, reflectance & illumination
βœ…normal, signed distance field, mesh
βœ…Diffuse-specular separation
βœ…Hi-fI incident illumination

More https://bit.ly/3CzGp3F
πŸ‘3πŸ”₯3