AI with Papers - Artificial Intelligence & Deep Learning
15K subscribers
95 photos
235 videos
11 files
1.26K links
All the AI with papers. Every day fresh updates on Deep Learning, Machine Learning, and Computer Vision (with Papers).

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
😍Animated hand in 1972, damn romantic😍

πŸ‘‰Q: is #VR the technology that developed least in the last 30 years? πŸ€”

More: https://bit.ly/3snxNaq
πŸ‘7❀3🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
⏏️Ensembling models for GAN training⏏️

πŸ‘‰Pretrained vision models to improve the GAN training. FID by 1.5 to 2Γ—!

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…CV models as ensemble of discriminators
βœ…Improving GAN in limited / large-scale set
βœ…10k samples matches StyleGAN2 w/ 1.6M
βœ…Source code / models under MIT license

More: https://bit.ly/3wgUVsr
🀯6πŸ”₯2
This media is not supported in your browser
VIEW IN TELEGRAM
🀯Cooperative Driving + AUTOCASTSIM🀯

πŸ‘‰COOPERNAUT: cross-vehicle perception for vision-based cooperative driving

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…UTexas + #Stanford + #Sony #AI
βœ…LiDAR into compact point-based
βœ…Network-augmented simulator
βœ…Source code and models available

More: https://bit.ly/3sr5HLk
πŸ”₯6🀯3πŸ₯°1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ’„NeuralHDHair: 3D Neural HairπŸ’„

πŸ‘‰NeuralHDHair: fully automatic system for modeling HD hair from a single image

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…IRHairNet for hair geometric features
βœ…GrowingNet: 3D hair strands in parallel
βœ…VIFu: novel voxel-aligned implicit function
βœ…SOTA in 3D hair modeling from single pic

More: https://bit.ly/38iR0mQ
πŸ‘5πŸ₯°3❀1
This media is not supported in your browser
VIEW IN TELEGRAM
🐑DyNeRF: Neural 3D Video Synthesis🐑

πŸ‘‰#Meta unveils DyNeRF, novel rendering HQ 3D video

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Novel NeRF-based on temp-latent codes
βœ…Novel training based on hierarchical step
βœ…Datasets of time-synch/calibrated clips
βœ…Attribution-NonCommercial 4.0 Int.

More: https://bit.ly/3MlBRA9
🀯8πŸ‘2πŸ”₯1🀩1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ‹GATO: agent for multiple tasksπŸ‹

πŸ‘‰The same network with the same weights can play Atari, caption pics, chat, and more🀯

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…General-purpose agent, multiple tasks
βœ…Multi-modal-task, multi-embodiment
βœ…Inspired by large-scale language model

More: https://bit.ly/3LbBOWb
🀯10❀3πŸ‘2πŸ”₯2
This media is not supported in your browser
VIEW IN TELEGRAM
πŸͺNeRF powered by keypointsπŸͺ

πŸ‘‰ETHZ + META unveil how to encode relative spatial #3D info via sparse 3D keypoints

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Sparse 3D keypoints for SOTA avatars
βœ…Unseen subjects from 2/3 views
βœ…Never-before-seen iPhone captures

More: https://bit.ly/39NQqhe
🀯5πŸ”₯2❀1πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
🐌Self-Supervised human co-evolution🐌

πŸ‘‰Self-supervised 3D by co-evolution of pose estimator, imitator, and hallucinator

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Novel self-supervised 3D pose
βœ…Co-evo of pose, imitator, hallucinator
βœ…Realist 3D pose and 2D-3D supervision
βœ…Source code / model under MIT license

More: https://bit.ly/37J5ImL
πŸ”₯4πŸ‘3❀1🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
🐲 Diff-SDF #3D Rendering 🐲

πŸ‘‰Reconstruction with no complex reg. or priors, using only a per-pixel RGB loss

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Diff-render to optimize geometry/albedo
βœ…No ad-hoc object mask or supervision
βœ…Extended sphere tracing algorithm

More: https://bit.ly/3yKWPnI
🀯10πŸ‘4πŸ”₯2❀1🀩1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ‘„LVD: new SOTA for #3D humanπŸ‘„

πŸ‘‰Corona et al. unveils a novel 3D human model fitting

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Solution via neural field
βœ…Not sensitive to initialization
βœ…SOTA in shape from single pic
βœ…SOTA in fitting 3D scans

More: https://bit.ly/3Ng4lLr
πŸ‘4πŸ”₯2🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ³οΈβ€πŸŒˆDeep Clustering on ImageNet & Co.πŸ³οΈβ€πŸŒˆ

πŸ‘‰World's first deep nonparametric clustering on large dataset such as ImageNet

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Deep clustering that infers nr. of clusters
βœ…Loss: amortized inference in mixt-models
βœ…Deep nonparametric clustering on ImageNet
βœ…Code and model available under MIT license

More: https://bit.ly/38p62rn
πŸ”₯9🀯3πŸ‘2🀩2
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ’₯HQ-EΒ²FGVI just releasedπŸ’₯πŸ’₯

πŸ‘‰Flow-Guided Video Inpainting through three trainable modules

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Flow, pixel-prop, content hallucination
βœ…Three stage-modules, jointly optimized
βœ…The new SOTA, promising efficiency
βœ…Code and Models under MIT license

More: https://bit.ly/3Ln0ICj
🀯10πŸ‘1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸͺ” AvatarCLIP: Text-Driven Avatar πŸͺ”

πŸ‘‰Zero-shot text-driven for #3D avatar in #metaverse

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…First text-driven synthesis
βœ…Shape, texture, and motion
βœ…Animation-ready, HQ texture/geometry
βœ…Zero-shot text-guided ref-based motion
βœ…Code and model under MIT license

More: https://bit.ly/3LjTWgB
πŸ”₯4πŸ‘2🀯2❀1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯#AIwithPapers: we are 2,500!πŸ”₯

πŸ’™πŸ’›Only 2 Billion papers remaining on arXiv. The more we are, the faster we readπŸ’™πŸ’›

😈 Invite your friends -> https://t.me/AI_DeepLearning
πŸ”₯9❀4πŸ‘2πŸ€”2πŸ‘1
πŸ’₯Podcasting AI & CVπŸ’₯

πŸ‘‰πŸΌFor people fluent in Italian: 1 hour podcast in which I talk about AI, CV, Startup and more (included this wonderful project).

More: https://bit.ly/38DtBwB
πŸ‘6❀3πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯Inpainting: new SOTA! INSANEπŸ”₯

πŸ‘‰Novel two-stream approach: inpainting at the next level!

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…High-freq locally, low-freq globally
βœ…Local to global -> error correction
βœ…44% / 26% improvements FID/scores
βœ…Source code, more clips available

More: https://bit.ly/3ltIX9R
πŸ‘8🀯3πŸ”₯1πŸ₯°1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯Super-Human Crossword SolverπŸ”₯

πŸ‘‰Solving crosswords outperforming best humans

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Crossword solving based on NNs
βœ…Q&A, structured decoding, local search
βœ…Wide domains with perfect accuracy
βœ…Large question-answer dataset

More: https://bit.ly/3a3zzqQ
πŸ”₯4🀯3πŸ‘2πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ₯ΈImagen: far beyond DALLΒ·E 2πŸ₯Έ

πŸ‘‰#Google: unprecedented photorealism and deep level of language understanding

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Dynamic thresh diffusion sampling
βœ…Efficient U-Net, efficient++ variant
βœ…DrawBench, new text-to-image
βœ…The new SOTA, COCO FID of 7.27

More: https://bit.ly/3lVtkbz
πŸ”₯9🀯6πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸͺ€Tracking over SOTA detectorsπŸͺ€

πŸ‘‰Lightweight Python lib for real-time 2D object tracking πŸ’₯

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Layer of tracking over SOTA detectors
βœ…Suitable for complex video processing
βœ…Source code under BSD 3-Clause
βœ…Maintained by Tryolabs team

More: https://bit.ly/3wKtGqg
πŸ‘7πŸ”₯3🀩3