AI with Papers - Artificial Intelligence & Deep Learning
15K subscribers
95 photos
235 videos
11 files
1.26K links
All the AI with papers. Every day fresh updates on Deep Learning, Machine Learning, and Computer Vision (with Papers).

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ‘‹Forecasting interactions via attentionπŸ‘‹

πŸ‘‰Predicting the hand motion trajectory and the future contact points on the next active object

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Object-Centric Transformer (OCT)
βœ…Self-attention Transformer mechanism
βœ…Framework to handle uncertainty
βœ…SOTA on Epic-Kitchens and EGTEA

More: https://bit.ly/3v3PpbI
πŸ‘4πŸ”₯2πŸ‘1πŸ€”1🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ‡SmeLU: Smooth Activation FunctionπŸ‡

πŸ‘‰Google unveils a new smooth activation function: easy to implement, cheap & less error-prone

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Smooth to mitigate irreproducibility
βœ…Cheap function, better than GELU/Swish
βœ…0-1 slope through quadratic middle region
βœ…SmeLU as convolution of ReLU with box
βœ…Best reproducibility-accuracy tradeoff

More: https://bit.ly/3xcskXm
😱8πŸ‘4❀1πŸ”₯1😁1🀯1
πŸ“Hyper-Dense Landmarks at 150FPSπŸ“

πŸ‘‰#Microsoft unveils the SOTA in dense landmarking + #3D reconstruction. MAGIC.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Accurate 10Γ— as many landmarks as usual
βœ…Synthetic data, perfect annotations
βœ…NO appearance, light, diff-rendering
βœ…#3D @150+FPS with a single CPU thread
βœ…SOTA in monocular 3D reconstruction

More: https://bit.ly/37pQS40
πŸ‘6πŸ”₯4🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
β˜€οΈSunStage: Selfie with the Sunβ˜€οΈ

πŸ‘‰Accurate/tailored reconstruction of facial geometry/reflectance

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Novel personalized scanning
βœ…Disentanglement of scene params
βœ…Geometry, materials, lighting, poses
βœ…Photorealistic with a single selfie video

More: https://bit.ly/36W1Oqx
πŸ”₯3πŸ‘2πŸ₯°1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ“« Generative Neural Avatars πŸ“«

πŸ‘‰3D shapes of people in a variety of garments with corresponding skinning weight

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…ETH + Uni-TΓΌbingen + Max Planck
βœ…Animatable #3D human in garment
βœ…Directly from raw posed 3D scans
βœ…NO canonical, registration, manual w.
βœ…Geometric detail in clothing deformation


More: https://bit.ly/3M7mCdB
πŸ‘3πŸ”₯2πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ—¨οΈConversational program synthesisπŸ—¨οΈ

πŸ‘‰Conversational synthesis to translate English into executable code

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Conversational program synthesis
βœ…New multi-turn progr.benchmark
βœ…Open Custom library: JAXFORMER
βœ…Source code under BSD-3 license

More: https://bit.ly/3jjWWhk
🀯4πŸ₯°2πŸ”₯1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
🧯Long Video Diffusion Models🧯

πŸ‘‰#Google unveils a novel diffusion model for video generation

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Straightforward extension of 2D UNet
βœ…Longer by new conditional generation
βœ…SOTA in unconditional generation

More: https://bit.ly/35Y2rzg
πŸ”₯4πŸŽ‰2🀩1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸš™ AutoRF: #3D objects in-the-wild πŸš™

πŸ‘‰From #Meta: #3D object from just a single, in-the wild, image

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Novel view synthesis from in-the-wild
βœ…Normalized, object-centric representation
βœ…Disentangling shape, appearance & pose
βœ…Exploiting BBS & panoptic segmentation
βœ…Shape/appearance properties for objects


More: https://bit.ly/3O4ONeQ
🀯7😱2πŸ”₯1
This media is not supported in your browser
VIEW IN TELEGRAM
🌠GAN-based Darkest Dataset🌠

πŸ‘‰Berkeley + #Intel announce first photorealistic dataset under starlight (no moon, <0.001 lx)

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…"Darkest" dataset ever seen
βœ…Moonless, no external illumination
βœ…GAN-tuned physics-based model
βœ…Clips with dancing, volleyball, flags...

More: https://bit.ly/3LXxMkN
πŸ‘3🀯2πŸ”₯1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ€–Populating with digital humansπŸ€–

πŸ‘‰ETHZ unveils GAMMA to populate the #3D scene with digital humans

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…GenerAtive Motion primitive MArkers
βœ…Realistic, controllable, infinite motions
βœ…Tree-based search to preserve quality
βœ…SOTA in realistic/controllable motion

More: https://bit.ly/3OgY4AG
😱5πŸ‘4πŸ”₯2πŸ‘1🀯1🀩1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯#AIwithPapers: we are ~2,000!πŸ”₯

πŸ’™πŸ’› Simply amazing. Thank you all πŸ’™πŸ’›

😈 Invite your friends -> https://t.me/AI_DeepLearning
❀18πŸ”₯8πŸ₯°4πŸ‘3
This media is not supported in your browser
VIEW IN TELEGRAM
😼GARF: Gaussian Activated NeRF😼

πŸ‘‰GARF: Gaussian Activated R.F. for Hi-Fi reconstruction/pose

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…NeRF from imperfect camera poses
βœ…NO hyper-parameter tuning/initialization
βœ…Theoretical insight on Gaussian activation
βœ…Unlocking NeRF for real-world application?

More: https://bit.ly/36bvdfU
πŸ‘4🀩2❀1πŸ‘1🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
🎭Novel pre-training strategy for #AI🎭

πŸ‘‰EPFL unveils the Multi-modal Multi-task Masked Autoencoders (MultiMAE)

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Multimodal: additional modal. over RGB
βœ…Multi-task: multiple outputs over RGB
βœ…General: MultiMAE by pseudo-labeling
βœ…Classification, segmentation, depth
βœ…Code under NonCommercial 4.0 Int.

More: https://bit.ly/3jRhNsN
πŸ”₯7🀯2πŸ‘1πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ§ͺ A new SOTA in Dataset Distillation πŸ§ͺ

πŸ‘‰A new approach by Matching Training Trajectories is out!

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Distilling data "to match" bigger one
βœ…Distilled data to guide a network
βœ…Trajectories of experts from real data
βœ…SOTA + distilling higher-res visual data

More: https://bit.ly/3JwYOxW
πŸ‘5πŸ”₯1🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
🧀 Two-Hand tracking via GCN 🧀

πŸ‘‰The first-ever GCN for two interacting hands in single RGB image

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Reconstruction by GCN mesh regression
βœ…PIFA: pyramid attention for local occlusion
βœ…CHA: cross hand attention for interaction
βœ…SOTA + generalization in-the-wild scenario
βœ…Source code available under GNU 🀯

More: https://bit.ly/3KH5FWO
πŸ‘10πŸ‘4🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ•ΉοΈVideo K-Net, SOTA in SegmentationπŸ•ΉοΈ

πŸ‘‰Simple, strong, and unified framework for fully end-to-end video panoptic segmentation

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Learnable kernels from K-Net
βœ…K-Net learns to segment & track
βœ…Appearance / cross-T kernel interaction
βœ…New SOTA without bells and whistles πŸ€·β€β™‚οΈ

More: https://bit.ly/3uEEZQR
πŸ‘6πŸ”₯1🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
🐭DeepLabCut: tracking animals in the wild🐭

πŸ‘‰A toolbox for markerless pose estimation of animals performing various tasks

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Multi-animal pose estimation
βœ…Datasets for multi-animal pose
βœ…Key-points, limbs, animal identity
βœ…Optimal key-points without input

More: https://bit.ly/37L1mLE
πŸ”₯6πŸ€”4πŸ‘2🀯2❀1πŸ‘1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
🍑Neural Articulated Human Body🍑

πŸ‘‰Novel neural implicit representation for articulated body

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…COmpositional Articulated People
βœ…Large variety of shapes & poses
βœ…Novel encoder-decoder architecture

More: https://bit.ly/3xvn7dl
πŸ‘4πŸ₯°2πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
🦚 2K Resolution Generative #AI 🦚

πŸ‘‰Novel continuous-scale training with variable output resolutions

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Mixed-resolution data
βœ…Arbitrary scales during training
βœ…Generations beyond 1024Γ—1024
βœ…Variant of FID metric for scales
βœ…Source code under MIT license

More: https://bit.ly/3uNfVY6
🀯11πŸ‘2πŸ”₯2😱1🀩1