AI with Papers - Artificial Intelligence & Deep Learning
15K subscribers
95 photos
235 videos
11 files
1.26K links
All the AI with papers. Every day fresh updates on Deep Learning, Machine Learning, and Computer Vision (with Papers).

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
ðŸĨĶ GAN-generated CryptoPunks ðŸĨĶ

👉A simple (and funny) SN-GAN to generate cryptopunks

𝐇ðĒð ðĄðĨðĒð ðĄð­ðŽ:
✅Spectral normalization (2018)
✅Easy to incorporate into training
✅A project by Teddy Koker ðŸŽĐ

More: https://bit.ly/35C1rQI
âĪ3😁3👍1👏1
This media is not supported in your browser
VIEW IN TELEGRAM
ðŸĪŠSEER: self-AI from BILLIONS picðŸĪŠ

👉META + INRIA trained models on billions of random images without any pre-processing or assumptions

𝐇ðĒð ðĄðĨðĒð ðĄð­ðŽ:
✅Self-supervised on pics from web
✅Discovering properties in datasets
✅More fair, less biased & less harmful
✅Better OOD generalization
✅Source code available!

More: https://bit.ly/3vy69dd
ðŸ”Ĩ4👍3ðŸĪŊ1
This media is not supported in your browser
VIEW IN TELEGRAM
ðŸēA novel AI-controllable synthesisðŸē

👉Modeling local semantic parts separately and synthesizing images in a compositional way

𝐇ðĒð ðĄðĨðĒð ðĄð­ðŽ:
✅Structure & texture locally controlled
✅Disentanglement between areas
✅Fine-grained editing of images
✅Extendible via transfer learning
✅Just accepted to #CVPR2022

More: https://bit.ly/3IBgkBy
ðŸ˜ą3ðŸĪŊ2âĪ1
This media is not supported in your browser
VIEW IN TELEGRAM
ðŸĨĢ #AI-Generation with Dream Fields ðŸĨĢ

👉Neural rendering with multi-modal image and text representations

𝐇ðĒð ðĄðĨðĒð ðĄð­ðŽ:
✅Aligned image & text models
✅3D from natural language
✅No additional data
✅D.F. neural-scene

More: https://bit.ly/3Mhwm5D
👍10👏1
This media is not supported in your browser
VIEW IN TELEGRAM
🟊 Mip-NeRF 360 for unbounded scenes 🟊

👉An extension of NeRF to overcome the challenges presented by unbounded scenes

𝐇ðĒð ðĄðĨðĒð ðĄð­ðŽ:
✅Realistic synthesized views
✅Intricate/unbounded scenes
✅Detailed depth maps
✅Mean-squared error -54%
✅No code provided ðŸ˜Ĩ

More: https://bit.ly/36ZxsD4
ðŸĪŊ4âĪ1
This media is not supported in your browser
VIEW IN TELEGRAM
🐓 PINA: personal Neural Avatar 🐓

👉A novel method to acquire neural avatars from RGB-D videos

𝐇ðĒð ðĄðĨðĒð ðĄð­ðŽ:
✅A virtual copy of themselves
✅Realistic clothing deformations
✅Shape & non-rigid deformation
✅Avatars from RGB-D sequences
✅Creative Commons Zero v1.0

More: https://bit.ly/3HAtRIh
👍4âĪ1👏1😁1
This media is not supported in your browser
VIEW IN TELEGRAM
ðŸĶ EfficientVIS: new SOTA for VIS ðŸĶ

👉Simultaneous classification, segmentation, and tracking multiple object instances in videos

𝐇ðĒð ðĄðĨðĒð ðĄð­ðŽ:
✅Efficient and fully end-to-end
✅Iterative query-video interaction
✅First RoI-wise clip-level RT-VIS
✅Requires 15× fewer epochs

More: https://bit.ly/3KfqurN
👍10ðŸ”Ĩ3👎1ðŸĪŊ1
This media is not supported in your browser
VIEW IN TELEGRAM
🐠#AI-clips from single frame🐠

👉Moving objects in #3D while generating a video by a sequence of desired actions

𝐇ðĒð ðĄðĨðĒð ðĄð­ðŽ:
✅A playable environments
✅A single starting imageðŸĪŊ
✅Controllable camera
✅Unsupervised learning

More: https://bit.ly/35VDrYO
âĪ3👏1ðŸĪŊ1
This media is not supported in your browser
VIEW IN TELEGRAM
🧊Kubric: AI dataset generator🧊

👉Open-source #Python framework for photo-realistic scenes: full control, rich annotations, TBs of fresh data ðŸĪŊ

𝐇ðĒð ðĄðĨðĒð ðĄð­ðŽ:
✅Synthetic datasets with GT
✅From NeRF to optical flow
✅Full control over data
✅Ok privacy & licensing
✅Apache License 2.0

More: https://bit.ly/3hQCaFs
ðŸ”Ĩ6👍1ðŸĪŊ1
This media is not supported in your browser
VIEW IN TELEGRAM
🊂ÂĩTransfer for enormous NNs 🊂

👉Microsoft unveils how to tune enormous neural networks

𝐇ðĒð ðĄðĨðĒð ðĄð­ðŽ:
✅New HP tuning: ÂĩTransfer
✅Zero-shot transfer to full-model
✅Outperforming BERT-large
✅Outperforming 6.7B GPT-3
✅Code under MIT license

More: https://bit.ly/3qc37Ij
ðŸ”Ĩ2ðŸĪŊ2âĪ1
This media is not supported in your browser
VIEW IN TELEGRAM
🐧Semantic via only text supervision🐧

👉GroupViT with a text encoder on a large-scale image-text dataset: semantic with any pixel-level annotations in training!

𝐇ðĒð ðĄðĨðĒð ðĄð­ðŽ:
✅Hierarc. Grouping Vision Transf.
✅Additional text encoder
✅NO pixel-level annotations
✅Semantic-seg task via zero-shot
✅Source code available soon

More:https://bit.ly/3hPGeWr
👍6ðŸĨ°1ðŸĪŊ1
This media is not supported in your browser
VIEW IN TELEGRAM
⌚4D-Net: Lidar + RGB synchronization⌚

👉Google unveils 4D-Net to combine 3D LiDAR and onboard RGB camera

𝐇ðĒð ðĄðĨðĒð ðĄð­ðŽ:
✅Point clouds/images in time
✅Fusing multiple modalities in 4D
✅Novel sampling for 3D P.C. in time
✅New SOTA for 3D detection

More: https://bit.ly/3hZCFwN
👍12ðŸ”Ĩ2ðŸĪŊ1
This media is not supported in your browser
VIEW IN TELEGRAM
🐌 New SOTA in video synthesis! 🐌

👉Snap unveils a novel multimodal video generation framework via text/images

𝐇ðĒð ðĄðĨðĒð ðĄð­ðŽ:
✅Multimodal video generation
✅Bidirectional transformer
✅Video token with self-learn.
✅Text augmentation for robustness
✅Longer sequence synthesis

More: https://bit.ly/3hZLXsG
ðŸĪŊ4👍1ðŸ”Ĩ1👏1
This media is not supported in your browser
VIEW IN TELEGRAM
🎁 StyelNeRF source code is out 🎁

👉3D consistent photo-realistic image synthesis

𝐇ðĒð ðĄðĨðĒð ðĄð­ðŽ:
✅NeRF + style generator
✅3D consistency for HD image
✅Novel regularization loss
✅Camera control on styles

More: https://bit.ly/3t5xC49
ðŸ”Ĩ4ðŸĨ°1ðŸĪŊ1
This media is not supported in your browser
VIEW IN TELEGRAM
ðŸĶŽCLD-based generative #AI by #NvidiaðŸĶŽ

👉Nvidia unveils a novel critically-damped Langevin diffusion (CLD) for synthetic data

𝐇ðĒð ðĄðĨðĒð ðĄð­ðŽ:
✅A novel diffusion process for SGMs
✅Novel score matching obj. for CLD
✅Hybrid denoising score matching
✅Efficient sampling from CLD model
✅Source code under a specific license

More: https://bit.ly/35MToBe
ðŸ”Ĩ2ðŸĪĐ2👍1ðŸĪŊ1
This media is not supported in your browser
VIEW IN TELEGRAM
ðŸ›ļUFO: segmentation @140+ FPSðŸ›ļ

👉Unified Transformer Framework for Co-Segmentation, Co-Saliency & Salient Object Detection. All in one!

𝐇ðĒð ðĄðĨðĒð ðĄð­ðŽ:
✅Unified framework for co-segmentation
✅Co-segmentation, co-saliency, saliency
✅Block for long-range dependencies
✅Able to reach for 140 FPS in inference
✅The new SOTA on multiple datasets
✅Source code under MIT License

More: https://bit.ly/3KLd9b9
ðŸ”Ĩ6👍1ðŸĪŊ1
This media is not supported in your browser
VIEW IN TELEGRAM
👗 Multi-GANs fashion 👗

👉Global GAN blended with other GANs for faces, shoes, etc.

𝐇ðĒð ðĄðĨðĒð ðĄð­ðŽ:
✅Multi-GAN framework
✅Several generators
✅Free of artifacts
✅Full-body generation
✅Humans, 1024x1024

More: https://bit.ly/37mfOte
ðŸ”Ĩ2👏2âĪ1ðŸĪŊ1
This media is not supported in your browser
VIEW IN TELEGRAM
🚧 FLAG: #3D Avatar Generation 🚧

👉A flow-based generative model of the 3D human body from sparse observations.

𝐇ðĒð ðĄðĨðĒð ðĄð­ðŽ:
✅FLow-based Avatar Generative
✅Conditional distro of body pose
✅Exact pose likelihood process
✅Invertibility -> oracle latent code

More: https://bit.ly/3CQpk3p
👏2ðŸ”Ĩ1ðŸĪŊ1
This media is not supported in your browser
VIEW IN TELEGRAM
💃 Dancing in the wild with StyleGAN 💃

👉StyleGAN-based animations for AR/VR apps

𝐇ðĒð ðĄðĨðĒð ðĄð­ðŽ:
✅Video based motion retargeting
✅A StyleGAN architecture based
✅Novel explicit motion representation
✅SOTA qualitatively & quantitatively

More: https://bit.ly/3CZbL1W
👍6ðŸĪŊ3ðŸĨ°2
This media is not supported in your browser
VIEW IN TELEGRAM
🊀TensoRF: the 4D evolution of NeRF 🊀

👉TensoRF, a novel radiance fields via 4D-tensor: 3D voxel grid with per-voxel multi-channel feats.

𝐇ðĒð ðĄðĨðĒð ðĄð­ðŽ:
✅VM decomposition technique
✅Low-rank tensor factorization
✅Lower memory footprint (speed)
✅TensoRF is the new SOTA in R.F.
✅Code under the MIT License

More: https://bit.ly/3qffZgI
👍2ðŸ”Ĩ1