AI with Papers - Artificial Intelligence & Deep Learning
15K subscribers
95 photos
235 videos
11 files
1.26K links
All the AI with papers. Every day fresh updates on Deep Learning, Machine Learning, and Computer Vision (with Papers).

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ¦’ Text2LIVE: Text-Driven Neural Editing πŸ¦’

πŸ‘‰#Amazon unveils a novel #AI for text-driven edit of videos. Insane! 🀯

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Semantic edits of real-world videos
βœ…Edit layer–RGBA representing target
βœ…Edit layers synthesized on single input
βœ…No masks or a pre-trained generator

More: https://bit.ly/3NVP6aE
🀯18πŸ‘9πŸ”₯8❀1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ“ŸπŸ“ŸAI-Designed Circuits with Deep RLπŸ“ŸπŸ“Ÿ

πŸ‘‰#Nvidia unveils an #AI to design circuits from scratch, smaller and faster than SOTA ones

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Parallel prefix circuits for Hi-Perf
βœ…RL framework to explore the circuit space
βœ…Smaller, Faster, Power-- from the scratch

More: https://bit.ly/3yY9dk7
🀯13πŸ‘5πŸ”₯3
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ‘½ Neural I2I with a few shoots πŸ‘½

πŸ‘‰#Alibaba unveils a novel portrait stylization. Limited samples (∼100) -> HD outputs

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Calibration first, translation later
βœ…Balanced distribution to calibrate bias
βœ…Spatially semantic constraints via geometry
βœ…Source code and models soon available!

More: https://bit.ly/3IwOmHO
❀10πŸ‘5😱1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ€Ήβ€β™‚οΈ K-Means Mask Transformer πŸ€Ήβ€β™‚οΈ

πŸ‘‰#Google AI unveils kMaX-DeepLab, novel E2E method for segmentation

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…kMaX-DeepLab: k-means Mask Xformer
βœ…Rethinking relationship pixels / object
βœ…Cross-attention -> k-means clustering
βœ…The new SOTA on several dataset

More: https://bit.ly/3O2QV5I
πŸ”₯11πŸ‘2πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
β˜€οΈ 4D Neural Relightable Humans β˜€οΈ

πŸ‘‰Relighting4D: free-viewpoints relighting of humans under unknown illuminations

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Relight dynamic, free viewpoints
βœ…Disentangled reflectance/geometry
βœ…SOTA on synthetic/real datasets
βœ…Code/models under MIT License

More: https://bit.ly/3RF3yH9
πŸ”₯9πŸ‘2
This media is not supported in your browser
VIEW IN TELEGRAM
🍰 Long-Term Object Segmentation 🍰

πŸ‘‰XMem: object segmentation for long clips with unified feature memory stores

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Inspired by Atkinson–Shiffrin model
βœ…Stores with different temporal scales
βœ…Memory consolidation algorithm
βœ…Compact/powerful long-term memory
βœ…Source code and models available

More: https://bit.ly/3PP0EOn
🀯16πŸ‘5πŸ‘3
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯Grand Unification of Object TrackingπŸ”₯

πŸ‘‰UNICORN: unified method for SOT, MOT, VOS, & MOTS with a single neural net. 🀯

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Great unification for 4 tracking tasks
βœ…Bridging methods / pixel-wise corresp.
βœ…SOTA on 8 challenging benchmarks
βœ…Source code under MIT License

More: https://bit.ly/3o74h6g
πŸ‘13πŸ”₯3🀯1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯OmniBenchmark: CV beyond ImageNetπŸ”₯

πŸ‘‰ 21 realms, 7,000+ concepts and 1M+ images. Far beyond ImageNet!

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…vs. ImageNet: 2.5x realms, 9x concepts
βœ…Conciseness: no concept overlapping
βœ…ReCo: Relational Contrastive Learning
βœ…New supervised contrastive learning SOTA

More: https://bit.ly/3RJRKU0
πŸ”₯11🀩3
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ’£ HD Neural Avatar @130FPS πŸ’£

πŸ‘‰Samsung unveils MegaPortraits: novel one-shot creation of HD neural human avatar

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…One-shot neural avatars, SOTA up 512p
βœ…"Upgrading" to megapixel via more pics
βœ…First Neural Head Avatars in HD
βœ…Up to to 130 FPS via #GPU

More: https://bit.ly/3oboWWT
πŸ”₯22πŸ‘1πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
🦚 TimeLens++: Event-based Interpolation 🦚

πŸ‘‰Novel event-based interpolation with non-linear flow & multi-scale fusion

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Novel motion spline estimator
βœ…Non-linear continuous event/frames flow
βœ…Multi-feature fusion, gated compression
βœ…Novel hybrid dataset with 100+ videos

More: https://bit.ly/3yJyY6g
πŸ”₯16πŸ‘4
This media is not supported in your browser
VIEW IN TELEGRAM
πŸͺ°NUWA-Infinity is out!πŸͺ°

πŸ‘‰βˆž generation by #Microsoft: arbitrarily-sized HD images and long videos 🀯

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Unconditional Image Gen.
βœ…Text-to-Image/Text-to-Clip
βœ…Animation / Out-painting
βœ…Hi-res, arbitrary long clip
βœ…NCP for patches caching

More: https://bit.ly/3zmBf9f
πŸ”₯7πŸ‘2❀1πŸ‘1🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯ #AIwithPapers: we are 3,500+! πŸ”₯

πŸ’™πŸ’› Ready for YOLO 10, 11, Ο€, ∞, Ξ¨, and more? The more we are, the faster we catch'em all πŸ’™πŸ’›

😈 Invite your friends -> https://t.me/AI_DeepLearning
πŸ‘12❀10😁5πŸ”₯3
This media is not supported in your browser
VIEW IN TELEGRAM
🎷🎷OMNI3D: #3D Objects in the Wild🎷🎷

πŸ‘‰#3D detection: 234k images, 3M+ instances & 97 categories

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…OMNI3D from publicly released dataset
βœ…234k pics, 3M+ annotation with 3D box
βœ…97 categories such as sofa, table, cars
βœ…Fast (450x) and exact algorithm for IoU
βœ…Cube R-CNN: novel 3D object detector

More: https://bit.ly/3cznjzG
πŸ‘11
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ‘ΉMultiface Neural Rendering πŸ‘Ή

πŸ‘‰A new multi-view, Hi-Res data collected at #META Reality Labs for neural face

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Mugsy, large scale multi-cam apparatus
βœ…High-Res sync facial performance
βœ…Closing the gap in accessing HQ data
βœ…Suitable for #VR & #mixedreality

More: https://bit.ly/3b6XfeL
🀯8πŸ‘3
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ’„DEVIANT: SOTA in mono-3D detectionπŸ’„

πŸ‘‰A novel Depth EquiVarIAnt NeTwork for 3D monocular detection in the wild

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Michigan + #Meta + Ford 🀯
βœ…Depth-equi. + scale equiv. steerable
βœ…New SOTA on KITTI & Waymo
βœ…Ok cross-dataset -> generalization

More: https://bit.ly/3OEFtgK
πŸ”₯16πŸ‘2❀1
This media is not supported in your browser
VIEW IN TELEGRAM
🧱 Assembling #LEGO with #AI 🧱

πŸ‘‰Step-by-step assembly manual created by human into machine-interpretable instructions

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Stanford + MIT + #Google 🀯
βœ…MEPNet: Manual-to-Executable-Plan Net
βœ…Manual to machine-executable plan
βœ…2D manual - 3D geometric shape
βœ…Reasoning on 3D alignments of legos

More: https://bit.ly/3PCwn5C
πŸ”₯9❀3