AI with Papers - Artificial Intelligence & Deep Learning
15K subscribers
95 photos
235 videos
11 files
1.26K links
All the AI with papers. Every day fresh updates on Deep Learning, Machine Learning, and Computer Vision (with Papers).

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
🟦🟨 StyleGAN on Internet pics 🟦🟨

πŸ‘‰StyleGAN on raw uncurated images collected from Internet

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Outliers & multi-modal
βœ…Self-distillation approach
βœ…Self-filtering of outliers
βœ…Perceptual clustering

More: https://bit.ly/33Z1d5H
❀2πŸ‘1πŸ”₯1🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
🦜The new SOTA for Unsupervised 🦜

πŸ‘‰Self-supervised transformer to discover objects in images

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Visual tokens as nodes in graph
βœ…Edges as connectivity score
βœ…The second smallest eV = fg
βœ…Suitable for unsupervised saliency
βœ…Weakly supervised obj. detection
βœ…Code under MIT License


More: https://bit.ly/3sqbFg3
πŸ‘4πŸ”₯3🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ₯¦ GAN-generated CryptoPunks πŸ₯¦

πŸ‘‰A simple (and funny) SN-GAN to generate cryptopunks

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Spectral normalization (2018)
βœ…Easy to incorporate into training
βœ…A project by Teddy Koker 🎩

More: https://bit.ly/35C1rQI
❀3😁3πŸ‘1πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ€ͺSEER: self-AI from BILLIONS picπŸ€ͺ

πŸ‘‰META + INRIA trained models on billions of random images without any pre-processing or assumptions

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Self-supervised on pics from web
βœ…Discovering properties in datasets
βœ…More fair, less biased & less harmful
βœ…Better OOD generalization
βœ…Source code available!

More: https://bit.ly/3vy69dd
πŸ”₯4πŸ‘3🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
🐲A novel AI-controllable synthesis🐲

πŸ‘‰Modeling local semantic parts separately and synthesizing images in a compositional way

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Structure & texture locally controlled
βœ…Disentanglement between areas
βœ…Fine-grained editing of images
βœ…Extendible via transfer learning
βœ…Just accepted to #CVPR2022

More: https://bit.ly/3IBgkBy
😱3🀯2❀1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ₯£ #AI-Generation with Dream Fields πŸ₯£

πŸ‘‰Neural rendering with multi-modal image and text representations

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Aligned image & text models
βœ…3D from natural language
βœ…No additional data
βœ…D.F. neural-scene

More: https://bit.ly/3Mhwm5D
πŸ‘10πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸŸͺ Mip-NeRF 360 for unbounded scenes πŸŸͺ

πŸ‘‰An extension of NeRF to overcome the challenges presented by unbounded scenes

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Realistic synthesized views
βœ…Intricate/unbounded scenes
βœ…Detailed depth maps
βœ…Mean-squared error -54%
βœ…No code provided πŸ˜₯

More: https://bit.ly/36ZxsD4
🀯4❀1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ“ PINA: personal Neural Avatar πŸ“

πŸ‘‰A novel method to acquire neural avatars from RGB-D videos

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…A virtual copy of themselves
βœ…Realistic clothing deformations
βœ…Shape & non-rigid deformation
βœ…Avatars from RGB-D sequences
βœ…Creative Commons Zero v1.0

More: https://bit.ly/3HAtRIh
πŸ‘4❀1πŸ‘1😁1
This media is not supported in your browser
VIEW IN TELEGRAM
🐦 EfficientVIS: new SOTA for VIS 🐦

πŸ‘‰Simultaneous classification, segmentation, and tracking multiple object instances in videos

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Efficient and fully end-to-end
βœ…Iterative query-video interaction
βœ…First RoI-wise clip-level RT-VIS
βœ…Requires 15Γ— fewer epochs

More: https://bit.ly/3KfqurN
πŸ‘10πŸ”₯3πŸ‘Ž1🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
🐠#AI-clips from single frame🐠

πŸ‘‰Moving objects in #3D while generating a video by a sequence of desired actions

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…A playable environments
βœ…A single starting image🀯
βœ…Controllable camera
βœ…Unsupervised learning

More: https://bit.ly/35VDrYO
❀3πŸ‘1🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
🧊Kubric: AI dataset generator🧊

πŸ‘‰Open-source #Python framework for photo-realistic scenes: full control, rich annotations, TBs of fresh data 🀯

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Synthetic datasets with GT
βœ…From NeRF to optical flow
βœ…Full control over data
βœ…Ok privacy & licensing
βœ…Apache License 2.0

More: https://bit.ly/3hQCaFs
πŸ”₯6πŸ‘1🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸͺ‚Β΅Transfer for enormous NNs πŸͺ‚

πŸ‘‰Microsoft unveils how to tune enormous neural networks

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…New HP tuning: Β΅Transfer
βœ…Zero-shot transfer to full-model
βœ…Outperforming BERT-large
βœ…Outperforming 6.7B GPT-3
βœ…Code under MIT license

More: https://bit.ly/3qc37Ij
πŸ”₯2🀯2❀1
This media is not supported in your browser
VIEW IN TELEGRAM
🐧Semantic via only text supervision🐧

πŸ‘‰GroupViT with a text encoder on a large-scale image-text dataset: semantic with any pixel-level annotations in training!

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Hierarc. Grouping Vision Transf.
βœ…Additional text encoder
βœ…NO pixel-level annotations
βœ…Semantic-seg task via zero-shot
βœ…Source code available soon

More:https://bit.ly/3hPGeWr
πŸ‘6πŸ₯°1🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
⌚4D-Net: Lidar + RGB synchronization⌚

πŸ‘‰Google unveils 4D-Net to combine 3D LiDAR and onboard RGB camera

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Point clouds/images in time
βœ…Fusing multiple modalities in 4D
βœ…Novel sampling for 3D P.C. in time
βœ…New SOTA for 3D detection

More: https://bit.ly/3hZCFwN
πŸ‘12πŸ”₯2🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
🐌 New SOTA in video synthesis! 🐌

πŸ‘‰Snap unveils a novel multimodal video generation framework via text/images

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Multimodal video generation
βœ…Bidirectional transformer
βœ…Video token with self-learn.
βœ…Text augmentation for robustness
βœ…Longer sequence synthesis

More: https://bit.ly/3hZLXsG
🀯4πŸ‘1πŸ”₯1πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
🎁 StyelNeRF source code is out 🎁

πŸ‘‰3D consistent photo-realistic image synthesis

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…NeRF + style generator
βœ…3D consistency for HD image
βœ…Novel regularization loss
βœ…Camera control on styles

More: https://bit.ly/3t5xC49
πŸ”₯4πŸ₯°1🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
🦎CLD-based generative #AI by #Nvidia🦎

πŸ‘‰Nvidia unveils a novel critically-damped Langevin diffusion (CLD) for synthetic data

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…A novel diffusion process for SGMs
βœ…Novel score matching obj. for CLD
βœ…Hybrid denoising score matching
βœ…Efficient sampling from CLD model
βœ…Source code under a specific license

More: https://bit.ly/35MToBe
πŸ”₯2🀩2πŸ‘1🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ›ΈUFO: segmentation @140+ FPSπŸ›Έ

πŸ‘‰Unified Transformer Framework for Co-Segmentation, Co-Saliency & Salient Object Detection. All in one!

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Unified framework for co-segmentation
βœ…Co-segmentation, co-saliency, saliency
βœ…Block for long-range dependencies
βœ…Able to reach for 140 FPS in inference
βœ…The new SOTA on multiple datasets
βœ…Source code under MIT License

More: https://bit.ly/3KLd9b9
πŸ”₯6πŸ‘1🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ‘— Multi-GANs fashion πŸ‘—

πŸ‘‰Global GAN blended with other GANs for faces, shoes, etc.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Multi-GAN framework
βœ…Several generators
βœ…Free of artifacts
βœ…Full-body generation
βœ…Humans, 1024x1024

More: https://bit.ly/37mfOte
πŸ”₯2πŸ‘2❀1🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
🚧 FLAG: #3D Avatar Generation 🚧

πŸ‘‰A flow-based generative model of the 3D human body from sparse observations.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…FLow-based Avatar Generative
βœ…Conditional distro of body pose
βœ…Exact pose likelihood process
βœ…Invertibility -> oracle latent code

More: https://bit.ly/3CQpk3p
πŸ‘2πŸ”₯1🀯1