AI with Papers - Artificial Intelligence & Deep Learning
15K subscribers
95 photos
235 videos
11 files
1.26K links
All the AI with papers. Every day fresh updates on Deep Learning, Machine Learning, and Computer Vision (with Papers).

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
πŸŽƒNew SOTA in UDA Semantic Seg.πŸŽƒ

πŸ‘‰HRDA: multi-res Unsupervised Domain Adaptive Semantic Seg. -> SOTA

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…ETH + MPG + KU Leuven 🀯
βœ…HRDA: multi-res approach for UDA
βœ…Manageable GPU memory footprint
βœ…Small objects & fine segmentation detail
βœ…New SOTA on GTA and Synthia dataset

More: https://bit.ly/3cKtDEp
🀯8πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
βš—οΈ SemAbs: 3D Scene Understanding βš—οΈ

πŸ‘‰Framework that equips 2D Vision-Language Models (VLMs) with new 3D spatial capabilities

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…2D VLMs with 3D reasoning skills
βœ…ViTs Efficient MS Relevancy Extraction
βœ…Novel Open-World understanding tasks
βœ…Completing partially observed objects
βœ…Finding hidden objects from language

More: https://bit.ly/3PYYk7d
πŸ”₯7❀1πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
🦚 TinyCD: Neural Change Detection 🦚

πŸ‘‰TinyCD: new SOTA in change detection with up to 150x fewer parameters.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…SOTA with up to 150X fewer params
βœ…Mixing blocks for s.t. cross-correlation
βœ…PW-MLP for pixel wise classification
βœ…MAMB: novel block for skip connection

More: https://bit.ly/3zFEngk
❀16πŸ‘2πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
🦊 3D-Aware "StyleGANv2" version 🦊

πŸ‘‰Upgrading StyleGANv2 into a novel 3D-aware GAN with just a minimal set of changes🀯

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…MPI-like 3D-aware GAN w/ single-view
βœ…GMPI: generative multiplane image
βœ…2D GAN 3D-aware with a minimal changes
βœ…Encoding 3D-aware inductive biases

More: https://bit.ly/3OJ5gnS
🀯6πŸ‘4❀1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ“Ί NeRF-ing "The Big Bang Theory" πŸ“Ί

πŸ‘‰Berkeley unveils an approach for accurate estimation of actor’s 3D pose & location

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Input: images across the whole season
βœ…3D context (i.e. cams, structure, body)
βœ…Integrating context in 3D estimation
βœ…Re-ID, gaze, cinematography, pic editing
βœ…Knock, Knock, Penny!

More: https://bit.ly/3OLuaUb
πŸ”₯7🀯5πŸ₯°2❀1
This media is not supported in your browser
VIEW IN TELEGRAM
🎩ShAPO: SOTA in object understanding🎩

πŸ‘‰Joint multi-object detection, #3D texture, 6D object pose & size estimation.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Disentangled shape & appearance
βœ…Efficient octree-based differentiable
βœ…Object-centric understanding pipeline
βœ…Detection, reconstruction , 6D & size
βœ…SOTA in reconstruction & pose est.

More: https://bit.ly/3oHN5EQ
πŸ‘7🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ™οΈ CityNeRF: Neural Rendering of City Scenes πŸ™οΈ

πŸ‘‰Progressive NeRF model and training set on city-scenes

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…BungeeNeRF: novel progressive NeRF
βœ…Details on drastically varied scales
βœ…Growing with residual block structure
βœ…Inclusive multi-level data supervision

More: https://bit.ly/3cS9vk7
πŸ₯°7πŸ‘3🀯3😱1
This media is not supported in your browser
VIEW IN TELEGRAM
🍦🍦 Rewriting Geometry of GAN 🍦🍦

πŸ‘‰Drive GAN synthesizing many unseen objects with the desired shape

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…User-friendly "warping" with geometry
βœ…Low-rank update to layer for editing
βœ…Latent augmentation based on style-mix
βœ…Endless objects with defined changes
βœ…Latent space interpolation, image editing

More: https://bit.ly/3zIfOj8
πŸ‘8😱7😁3πŸ‘Ž2❀1πŸ”₯1
This media is not supported in your browser
VIEW IN TELEGRAM
🍏🍏 GAUDI: the Neural Architect 🍏🍏

πŸ‘‰Novel generative model for immersive 3D scenes from a moving camera

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Hundreds of thousands pics/scenes
βœ…Novel denoising optimization objective
βœ…New SOTA across multiple datasets
βœ…Un/conditional on images/text

More: https://bit.ly/3Bt65ye
πŸ”₯6
This media is not supported in your browser
VIEW IN TELEGRAM
🚜NeDDF: the NeRF evolution!🚜

πŸ‘‰Novel 3D representation that reciprocally constrains distance & density fields

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…NeRF provides no distance
βœ…Extending for arbitrary density
βœ…Density via dist-field & gradient
βœ…Alleviating the instability

More: https://bit.ly/3Bte8LC
πŸ‘7
Media is too big
VIEW IN TELEGRAM
πŸ”₯AND/OR: Composable Diffusion ModelsπŸ”₯

πŸ‘‰Novel neural compositional generation via Composable Diffusion Models

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…DM as energy-based models
βœ…Connecting diffusion models
βœ…Conjunction & negation, on top of DM
βœ…Zero-shot combinatorial generalization

More: https://bit.ly/3PYv1Cs
🀯5πŸ‘3❀2
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯ MobileNeRF is out -> Pure Fire! πŸ”₯

πŸ‘‰MobileNeRF is out: the mobile evolution of NeRF via textured polygons.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Same quality, 10x faster than SNeRG
βœ…Memory-- by storing surface textures
βœ…Integrated GPUs: less memory/power
βœ…Suitable for browser & viewer is HTML

More: https://bit.ly/3PUKPWy
πŸ”₯25πŸ‘5
This media is not supported in your browser
VIEW IN TELEGRAM
🧣NeRF for Outdoor Scene Relighting🧣

πŸ‘‰NeRF-OSR: the first neural radiance fields approach for outdoor scene relighting

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…NeRF-method for outdoor relighting
βœ…Simultaneous illumination/viewpoint
βœ…Control over shading, shadow, albedo
βœ…Self-Supervised training from outdoor
βœ…Dataset: 3240 viewpoints, 110+ times

More: https://bit.ly/3vBiH2G
πŸ”₯5πŸ‘3❀1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ‘©β€πŸ¦° Real-Time Neural Hair πŸ‘©β€πŸ¦°

πŸ‘‰Accurate hair geometry & appearance from multi-pics

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Bonn, CMU and Reality Labs
βœ…Photorealistic Real-Time render
βœ…HQ strand geometry/appearance
βœ…Novel scalp texture description
βœ…Intuitive manipulation of 3D hair

More: https://bit.ly/3vBiH2G
❀8πŸ‘6
This media is not supported in your browser
VIEW IN TELEGRAM
πŸš€ #VR by NASA - 1985 πŸš€

πŸ‘‰Q: is #VR the technology that developed least in the last 40 years? πŸ€”

Let's talk: https://bit.ly/3JxDZ7i
🀯7🀩2πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯ MinVIS, a new SOTA is out πŸ”₯

πŸ‘‰#Nvidia miniVIS: no video-based architectures nor training procedures🀯

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Video architecture/train not required
βœ…MinVIS outperforms the previous SOTA
βœ…Occluded VIS (OVIS): >10% improvement
βœ…1% of labeled frames >> fully-supervised

More: https://bit.ly/3pcYzk1
πŸ”₯12
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯πŸ”₯MultiNeRF: three NeRFs are out!πŸ”₯πŸ”₯

πŸ‘‰Google opens the code of three #cvpr2022 papers: Mip-NeRF 360, Ref-NeRF, RawNeRF

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Paper_1: Mip-NeRF 360
βœ…Paper_2: Ref-NeRF
βœ…Paper_3: NeRF in the Dark

More: https://bit.ly/3QjpRRc
πŸ‘13❀4🀯4
This media is not supported in your browser
VIEW IN TELEGRAM
β˜€οΈLocoProp: Neural Layers Compositionβ˜€οΈ

πŸ‘‰Google AI unveils LocoProp: novel neural paradigm for modular composition of layers.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Backprop++ via Local Loss Optimization
βœ…Layer-based w-reg, target output, loss
βœ…Multiple local update via first-order opt.
βœ…Superior performance and efficiency

More: https://bit.ly/3Q40YJn
πŸ”₯13
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯PCVOS: clip-wise mask VOSπŸ”₯

πŸ‘‰PCVOS: new semi-supervised video object segmentation method

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Reformulating semi-supervised VOS
βœ…Novel per-clip inference perspective
βœ…Clip-wise operation on intra-clip
βœ…PCVOS: model for per-clip inference
βœ…New SOTA on multiple benchmarks

More: https://bit.ly/3vJtmbz
πŸ‘10😁2❀1🀩1