AI with Papers - Artificial Intelligence & Deep Learning
15K subscribers
95 photos
235 videos
11 files
1.26K links
All the AI with papers. Every day fresh updates on Deep Learning, Machine Learning, and Computer Vision (with Papers).

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ“Ί NeRF-ing "The Big Bang Theory" πŸ“Ί

πŸ‘‰Berkeley unveils an approach for accurate estimation of actor’s 3D pose & location

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Input: images across the whole season
βœ…3D context (i.e. cams, structure, body)
βœ…Integrating context in 3D estimation
βœ…Re-ID, gaze, cinematography, pic editing
βœ…Knock, Knock, Penny!

More: https://bit.ly/3OLuaUb
πŸ”₯7🀯5πŸ₯°2❀1
This media is not supported in your browser
VIEW IN TELEGRAM
🎩ShAPO: SOTA in object understanding🎩

πŸ‘‰Joint multi-object detection, #3D texture, 6D object pose & size estimation.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Disentangled shape & appearance
βœ…Efficient octree-based differentiable
βœ…Object-centric understanding pipeline
βœ…Detection, reconstruction , 6D & size
βœ…SOTA in reconstruction & pose est.

More: https://bit.ly/3oHN5EQ
πŸ‘7🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ™οΈ CityNeRF: Neural Rendering of City Scenes πŸ™οΈ

πŸ‘‰Progressive NeRF model and training set on city-scenes

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…BungeeNeRF: novel progressive NeRF
βœ…Details on drastically varied scales
βœ…Growing with residual block structure
βœ…Inclusive multi-level data supervision

More: https://bit.ly/3cS9vk7
πŸ₯°7πŸ‘3🀯3😱1
This media is not supported in your browser
VIEW IN TELEGRAM
🍦🍦 Rewriting Geometry of GAN 🍦🍦

πŸ‘‰Drive GAN synthesizing many unseen objects with the desired shape

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…User-friendly "warping" with geometry
βœ…Low-rank update to layer for editing
βœ…Latent augmentation based on style-mix
βœ…Endless objects with defined changes
βœ…Latent space interpolation, image editing

More: https://bit.ly/3zIfOj8
πŸ‘8😱7😁3πŸ‘Ž2❀1πŸ”₯1
This media is not supported in your browser
VIEW IN TELEGRAM
🍏🍏 GAUDI: the Neural Architect 🍏🍏

πŸ‘‰Novel generative model for immersive 3D scenes from a moving camera

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Hundreds of thousands pics/scenes
βœ…Novel denoising optimization objective
βœ…New SOTA across multiple datasets
βœ…Un/conditional on images/text

More: https://bit.ly/3Bt65ye
πŸ”₯6
This media is not supported in your browser
VIEW IN TELEGRAM
🚜NeDDF: the NeRF evolution!🚜

πŸ‘‰Novel 3D representation that reciprocally constrains distance & density fields

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…NeRF provides no distance
βœ…Extending for arbitrary density
βœ…Density via dist-field & gradient
βœ…Alleviating the instability

More: https://bit.ly/3Bte8LC
πŸ‘7
Media is too big
VIEW IN TELEGRAM
πŸ”₯AND/OR: Composable Diffusion ModelsπŸ”₯

πŸ‘‰Novel neural compositional generation via Composable Diffusion Models

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…DM as energy-based models
βœ…Connecting diffusion models
βœ…Conjunction & negation, on top of DM
βœ…Zero-shot combinatorial generalization

More: https://bit.ly/3PYv1Cs
🀯5πŸ‘3❀2
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯ MobileNeRF is out -> Pure Fire! πŸ”₯

πŸ‘‰MobileNeRF is out: the mobile evolution of NeRF via textured polygons.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Same quality, 10x faster than SNeRG
βœ…Memory-- by storing surface textures
βœ…Integrated GPUs: less memory/power
βœ…Suitable for browser & viewer is HTML

More: https://bit.ly/3PUKPWy
πŸ”₯25πŸ‘5
This media is not supported in your browser
VIEW IN TELEGRAM
🧣NeRF for Outdoor Scene Relighting🧣

πŸ‘‰NeRF-OSR: the first neural radiance fields approach for outdoor scene relighting

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…NeRF-method for outdoor relighting
βœ…Simultaneous illumination/viewpoint
βœ…Control over shading, shadow, albedo
βœ…Self-Supervised training from outdoor
βœ…Dataset: 3240 viewpoints, 110+ times

More: https://bit.ly/3vBiH2G
πŸ”₯5πŸ‘3❀1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ‘©β€πŸ¦° Real-Time Neural Hair πŸ‘©β€πŸ¦°

πŸ‘‰Accurate hair geometry & appearance from multi-pics

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Bonn, CMU and Reality Labs
βœ…Photorealistic Real-Time render
βœ…HQ strand geometry/appearance
βœ…Novel scalp texture description
βœ…Intuitive manipulation of 3D hair

More: https://bit.ly/3vBiH2G
❀8πŸ‘6
This media is not supported in your browser
VIEW IN TELEGRAM
πŸš€ #VR by NASA - 1985 πŸš€

πŸ‘‰Q: is #VR the technology that developed least in the last 40 years? πŸ€”

Let's talk: https://bit.ly/3JxDZ7i
🀯7🀩2πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯ MinVIS, a new SOTA is out πŸ”₯

πŸ‘‰#Nvidia miniVIS: no video-based architectures nor training procedures🀯

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Video architecture/train not required
βœ…MinVIS outperforms the previous SOTA
βœ…Occluded VIS (OVIS): >10% improvement
βœ…1% of labeled frames >> fully-supervised

More: https://bit.ly/3pcYzk1
πŸ”₯12
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯πŸ”₯MultiNeRF: three NeRFs are out!πŸ”₯πŸ”₯

πŸ‘‰Google opens the code of three #cvpr2022 papers: Mip-NeRF 360, Ref-NeRF, RawNeRF

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Paper_1: Mip-NeRF 360
βœ…Paper_2: Ref-NeRF
βœ…Paper_3: NeRF in the Dark

More: https://bit.ly/3QjpRRc
πŸ‘13❀4🀯4
This media is not supported in your browser
VIEW IN TELEGRAM
β˜€οΈLocoProp: Neural Layers Compositionβ˜€οΈ

πŸ‘‰Google AI unveils LocoProp: novel neural paradigm for modular composition of layers.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Backprop++ via Local Loss Optimization
βœ…Layer-based w-reg, target output, loss
βœ…Multiple local update via first-order opt.
βœ…Superior performance and efficiency

More: https://bit.ly/3Q40YJn
πŸ”₯13
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯PCVOS: clip-wise mask VOSπŸ”₯

πŸ‘‰PCVOS: new semi-supervised video object segmentation method

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Reformulating semi-supervised VOS
βœ…Novel per-clip inference perspective
βœ…Clip-wise operation on intra-clip
βœ…PCVOS: model for per-clip inference
βœ…New SOTA on multiple benchmarks

More: https://bit.ly/3vJtmbz
πŸ‘10😁2❀1🀩1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ‘ World-Object Detection via ViT πŸ‘

πŸ‘‰Google unveils OWL-ViT: open-vocabulary detector based on ViTs 🀯

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…ViTs for Open-World Localization
βœ…Img-level to open-vocabulary detection
βœ…SOTA one-shot (img.cond.) detection

More: https://bit.ly/3Sy3jOj
🀯12πŸ‘3
This media is not supported in your browser
VIEW IN TELEGRAM
🎹🎹 Learning Piano in #AR 🎹🎹

πŸ‘‰PianoVision (on #META #Quest2) accelerates the piano learning via Passthrough #AR & hand tracking

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Sheet Insight to learn sight-read
βœ…MIDI keyboard connectivity
βœ…Air piano for no physical pianos
βœ…Multiplayer Music Instruction
βœ…PianoVision Music Hall in #VR

More: https://bit.ly/3zYvwGX
❀15🀯6πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
🧊EPro-PnP: Persp-n-Points Detection🧊

πŸ‘‰EPro-PnP: probabilistic PnP layer for general e2e pose estimation

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Probabilistic PnP for general e2e pose
βœ…Top-tier in 6DoF by inserting into CDPN
βœ…Deformable accurate detection
βœ…2D-3D corresp. learned from scratch

More: https://bit.ly/3BNPXYr
πŸ‘11
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ₯‡#NVIDIA wins SIGGRAPH's Best PaperπŸ₯‡

πŸ‘‰Instant #NeRF awarded as a best paper at SIGGRAPH 2022!

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Speed-up of several orders of magnitude
βœ…HQ neural primitives in a matter of secs
βœ…Render in tens of milliseconds at 1080p
βœ…Source code and resources available!

More: https://bit.ly/3Qt8c9D
πŸ‘16πŸ”₯6❀3πŸ‘1