AI with Papers - Artificial Intelligence & Deep Learning
14.9K subscribers
95 photos
235 videos
11 files
1.26K links
All the AI with papers. Every day fresh updates on Deep Learning, Machine Learning, and Computer Vision (with Papers).

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/
Download Telegram
πŸ”₯ #AIwithPapers: we are 2,900+! πŸ”₯

πŸ’™πŸ’› Cheers from "Black Metal Lady Gaga" plotted by DallE-mini πŸ’™πŸ’›

😈 Invite your friends -> https://t.me/AI_DeepLearning
😁8πŸ‘3❀2
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ…Segmentation with INSANE OcclusionsπŸ…

πŸ‘‰CMU unveils WALT: segmenting in severe occlusion scenarios. Performance over human.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…WALT: Watch & Learn Time-lapse
βœ…4K/1080p cams on streets over a year
βœ…Performance over human-supervised
βœ…Object-occluder-occluded neural layers
βœ…Source code under MIT license

More: https://bit.ly/3n7pvjO
🀯14πŸ‘4πŸ”₯3
This media is not supported in your browser
VIEW IN TELEGRAM
🐠Largest Dataset for #autonomousdriving🐠

πŸ‘‰SHIFT: largest synthetic dataset for #selfdrivingcars. Shifts in cloud, rain, fog, time of day, vehicle & pedestrian density🀯

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…4,800+ clips, multi-view sensor suite
βœ…Semantic/instance, M/stereo depth
βœ…2D/3D object detection, MOT
βœ…Optical flow, point cloud registration
βœ…Visual-Odo, trajectory & human pose

More: https://bit.ly/3HJBUUT
🀯9πŸ‘5❀2
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ¦‘Big Egocentric Dataset by #Meta πŸ¦‘

πŸ‘‰Novel dataset to speed-up research on egocentric MR/AI

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…159 sequences, multiple sensors
βœ…Scenarios: cooking, exercising, etc.
βœ…β€˜Desktop Activities’ via multi-view mocap
βœ…Dataset available upon request

More: https://bit.ly/3QDccVW
πŸ”₯8πŸ‘3
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ¦‹Transf-Codebook HD-Face RestorationπŸ¦‹

πŸ‘‰S-Lab unveils CodeFormer: hyper-datailed face restoration from degraded clips

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Face restoration as a code prediction
βœ…Discrete CB prior in small proxy space
βœ…Controllable transformation for LQ->HQ
βœ…Robustness and global coherence
βœ…Code and models soon available

More: https://bit.ly/3QEa9B5
πŸ”₯13πŸ‘7❀1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ” Fully Controllable "NeRF" Faces πŸ”

πŸ‘‰Neural control of pose/expressions from single portrait video

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…NeRF-control of the human head
βœ…Loss of rigidity by dynamic NeRF
βœ…3D full control/modelling of faces
βœ…No source code or models yet 😒

More: https://bit.ly/3OEjwi7
πŸ”₯8πŸ‘3❀2
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ«€I M AVATAR: source code is out!πŸ«€

πŸ‘‰Neural implicit head avatars from monocular videos

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…#3D morphing-based implicit avatar
βœ…Detailed Geometry/appearance
βœ…D-Rendering e2e learning from clips
βœ…Novel synthetic dataset for evaluation

More: https://bit.ly/3A2yzy9
πŸ‘8πŸ‘4
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ—ΊοΈNeural Translation Image -> MapπŸ—ΊοΈ

πŸ‘‰A novel method for instantaneous mapping as a translation problem

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Bird’s-eye-view (BEV) map from image
βœ…A restricted data-efficient transformer
βœ…Monotonic attention from lang.domain
βœ…SOTA across several datasets

More: https://bit.ly/39MQ76Z
πŸ”₯20πŸ‘6😱1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ₯Ά E2V-SDE: biggest troll ever? πŸ₯Ά

πŸ‘‰E2V-SDE paper (accepted to #CVPR2022) consists of texts copied from 10+ previously published papers πŸ˜‚

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Latent ODEs for Irregularly-Sampled TS
βœ…Stochastic Adversarial Video Prediction
βœ…Continuous Latent Process Flows
βœ…More papers....


More: https://bit.ly/3bsL8Zw (AUDIO ON!)
πŸ‘9
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯πŸ”₯YOLOv6 is out: PURE FIRE!πŸ”₯πŸ”₯

πŸ‘‰YOLOv6 is a single-stage object detection framework for industrial applications

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Efficient Decoupled Head with SIoU Loss
βœ…Hardware-friendly for Backbone/Neck
βœ…520+ FPS on T4 + TensorRT FP16
βœ…Released under GNU General Public v3.0

More: https://bit.ly/3OLjncK
πŸ”₯37πŸ‘6
This media is not supported in your browser
VIEW IN TELEGRAM
πŸͺ BlazePose: Real-Time Human Tracking πŸͺ

πŸ‘‰Novel real-time #3D human landmarks from #google. Suitable for mobile.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…MoCap from single RGB on mobile
βœ…Avatar, Fitness, #Yoga & AR/VR
βœ…Full body pose from monocular
βœ…Novel 3D ground truth acquisition
βœ…Additional hand landmarks
βœ…Fully integrated in #MediaPipe

More: https://bit.ly/3uvyiAv
πŸ”₯14πŸ‘4
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯YOLOv7: YOLO for segmentationπŸ”₯

πŸ‘‰YOLOv7: adding a lot of newer skills to the YOLO architecture family.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…YOLOv7, not a successor of YOLO family!
βœ…Framework for detection & segmentation
βœ…Applications based on #META detectron2
βœ…DETR & ViT detection out-of-box
βœ…Easy support for pipeline thought #ONNX
βœ…YOLOv4 + InstanceSegm. via single stage
βœ…The latest YOLOv6 training is supported!
βœ…Source code under GPL license.

More: https://bit.ly/3ysSJAp
πŸ”₯22🀯9πŸ‘5😁2
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯πŸ”₯ HD Dichotomous Segmentation πŸ”₯πŸ”₯

πŸ‘‰ A new task to segment highly accurate objects from natural images.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…5,000+ HD images + accurate binary mask
βœ…IS-Net baseline in high-dim feature spaces
βœ…HCE: model vs. human interventions
βœ…Source code (should be) available soon

More: https://bit.ly/3ah2BDO
πŸ”₯13
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”₯πŸ”₯ Neural Segmentation on fire πŸ”₯πŸ”₯

πŸ‘‰Novel methods for segmentation with mask calibration. Robustness++ in VOS.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Study: VOS robustness vs. perturbations
βœ…Adaptive object proxy (AOP) aggregation
βœ…Less errors due unstable pixel-level match
βœ…Code/models (should be) available soon

More: https://bit.ly/3yhIY6Q
πŸ‘15❀1πŸ”₯1
This media is not supported in your browser
VIEW IN TELEGRAM
😊😎 Seq-DeepFake via Transformers 😎😊

πŸ‘‰S-Lab opens Seq-DeepFake: Detecting Sequential DeepFake Manipulation

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Seq-DeepFake: sequences of facial edits
βœ…Dataset: 85k #deepfake manipulation
βœ…Powerful Seq-DeepFake Transformer
βœ…Code, dataset and models available!

More: https://bit.ly/3ACQXhi
πŸ‘15πŸ”₯2❀1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ¦’ Text2LIVE: Text-Driven Neural Editing πŸ¦’

πŸ‘‰#Amazon unveils a novel #AI for text-driven edit of videos. Insane! 🀯

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Semantic edits of real-world videos
βœ…Edit layer–RGBA representing target
βœ…Edit layers synthesized on single input
βœ…No masks or a pre-trained generator

More: https://bit.ly/3NVP6aE
🀯18πŸ‘9πŸ”₯8❀1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ“ŸπŸ“ŸAI-Designed Circuits with Deep RLπŸ“ŸπŸ“Ÿ

πŸ‘‰#Nvidia unveils an #AI to design circuits from scratch, smaller and faster than SOTA ones

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Parallel prefix circuits for Hi-Perf
βœ…RL framework to explore the circuit space
βœ…Smaller, Faster, Power-- from the scratch

More: https://bit.ly/3yY9dk7
🀯13πŸ‘5πŸ”₯3
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ‘½ Neural I2I with a few shoots πŸ‘½

πŸ‘‰#Alibaba unveils a novel portrait stylization. Limited samples (∼100) -> HD outputs

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Calibration first, translation later
βœ…Balanced distribution to calibrate bias
βœ…Spatially semantic constraints via geometry
βœ…Source code and models soon available!

More: https://bit.ly/3IwOmHO
❀10πŸ‘5😱1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ€Ήβ€β™‚οΈ K-Means Mask Transformer πŸ€Ήβ€β™‚οΈ

πŸ‘‰#Google AI unveils kMaX-DeepLab, novel E2E method for segmentation

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…kMaX-DeepLab: k-means Mask Xformer
βœ…Rethinking relationship pixels / object
βœ…Cross-attention -> k-means clustering
βœ…The new SOTA on several dataset

More: https://bit.ly/3O2QV5I
πŸ”₯11πŸ‘2πŸ‘1