AI with Papers - Artificial Intelligence & Deep Learning
15K subscribers
96 photos
241 videos
12 files
1.28K links
All the AI with papers. Every day fresh updates on Deep Learning, Machine Learning, and Computer Vision (with Papers).

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
ðŸĨŽHW-Accelerated Neuro-EvolutionðŸĨŽ

👉Scalable, general purpose, hardware accelerated neuro-evolution toolkit by Google

𝐇ðĒð ðĄðĨðĒð ðĄð­ðŽ:
✅Parallel on multiple TPU/GPUs
✅Neuro-evo algorithms with NNs
✅WaterWorld, Abstract paint, more
✅From Google, not an official product
✅Code under Apache License 2.0

More: https://bit.ly/3szEi9w
👍3ðŸ”Ĩ2ðŸĪŊ1ðŸ˜ą1
This media is not supported in your browser
VIEW IN TELEGRAM
🚛 DeepETA: #Uber ETA via #AI🚛

👉Uber unveils the low-latency deep architecture for global ETA prediction

𝐇ðĒð ðĄðĨðĒð ðĄð­ðŽ:
✅Latency / Accuracy / Generality
✅7 NNs architectures tested
✅Encoder-decoder + Self-Attention
✅Linear transformer (kernel trick)
✅Feature sparsity for speed

More: https://bit.ly/3gFWmJh
👍3ðŸ”Ĩ1ðŸĪŊ1
This media is not supported in your browser
VIEW IN TELEGRAM
✏ïļCLIPasso: Semantic Sketching via CLIP✏ïļ

👉Sketching method guided by geometric and semantic simplifications (CLIP)

𝐇ðĒð ðĄðĨðĒð ðĄð­ðŽ:
✅EPFL, TAU and IDC Herzliya
✅CLIP image encoder for sketching
✅Sketching as a set of Bezier curves
✅Param-optimization on CLIP-loss
✅Source code and models available

More: https://bit.ly/3oLEDF4
ðŸ”Ĩ2ðŸĨ°2👍1
This media is not supported in your browser
VIEW IN TELEGRAM
🊂SAHI: slicing detection/segmentation🊂

👉An open-source lightweight library for large scale object detection & instance segmentation

𝐇ðĒð ðĄðĨðĒð ðĄð­ðŽ:
✅Slicing Aided Hyper Inference
✅Large-scale detection/segment.
✅Sliced inference and merging
✅Utils for conversion, slicing, etc.
✅Code licensed under MIT License

More: https://bit.ly/3uMJoBZ
ðŸ”Ĩ3âĪ2ðŸĪŊ1
This media is not supported in your browser
VIEW IN TELEGRAM
🎁100,000,000 image-text pairs!🎁

👉Large-scale Chinese cross-modal dataset for benchmarking different multi-modal pre-training methods.

𝐇ðĒð ðĄðĨðĒð ðĄð­ðŽ:
✅100 Million <image, text> pairs
✅>200px size, aspect ratio (1/3~3)
✅Models of ResNet, ViT & SwinT
✅Methods: CLIP, FILIP and LiT
✅Privacy/Sensitive words ðŸĪ”

More: https://bit.ly/34BqlzX
👍5ðŸĪ”1
This media is not supported in your browser
VIEW IN TELEGRAM
🧁33 Million synthetic pedestrians🧁

👉A novel large, fully synthetic dataset

𝐇ðĒð ðĄðĨðĒð ðĄð­ðŽ:
✅Exploiting the #gta5 engine
✅764 full-HD videos @20 fps
✅33M+ person instances
✅BBs & segmentation masks
✅2D/3D keypoints & depth

More: https://bit.ly/36njlY1
👍6ðŸĪŊ1
This media is not supported in your browser
VIEW IN TELEGRAM
ðŸĨMarker-free 6D-point trackingðŸĨ

👉Full position and rotation of skeletal joints, with only a RGB frame

𝐇ðĒð ðĄðĨðĒð ðĄð­ðŽ:
✅Full 3-axis joint rotations
✅V-markers, emulating mocap
✅#3D from monocular with NN
✅Generalization, no retraining
✅SOTA rotation/position est.

More: https://bit.ly/34GdoF5
ðŸ”Ĩ12ðŸĪŊ1
This media is not supported in your browser
VIEW IN TELEGRAM
🧞 Synthetic dataset for #Retail 🧞

👉A large-scale photorealistic synthetic dataset with annotations for semantic segmentation, instance segmentation, depth estimation, and object detection.

𝐇ðĒð ðĄðĨðĒð ðĄð­ðŽ:
✅Dataset from Standard.AI
✅2,134 unique scenes
✅25k+ annotated samples
✅Introducing the "change detection"
✅Multi-view representation learning
✅NonCommercial-ShareAlike 4.0

More: https://bit.ly/3uXqubB
ðŸĪŊ6ðŸĨ°3👍1ðŸ”Ĩ1👏1
This media is not supported in your browser
VIEW IN TELEGRAM
🌈 Graph Neural Nets Forecasting🌈

👉Data-driven approach for forecasting global weather using graph neural networks

𝐇ðĒð ðĄðĨðĒð ðĄð­ðŽ:
✅Data-driven forecasting via GNNs
✅Model: 6.7M parameters, float32
✅6-hours forecast in 0.04 secs.
✅A 5-day forecast in 0.8 secs.

More: https://bit.ly/3LH4CXR
👏4👍2ðŸĪ”1
Media is too big
VIEW IN TELEGRAM
ðŸĨŦWatch Those Words!ðŸĨŦ

👉Berkeley unveils a novel approach to discover cheap-fake and visually persuasive deep-fakes

𝐇ðĒð ðĄðĨðĒð ðĄð­ðŽ:
✅Regardless of falsification
✅Semantic person-specific
✅Word-conditioned analysis
✅Generalization across fakes

More: https://bit.ly/3oXWmcd
👍5ðŸ˜ą1
This media is not supported in your browser
VIEW IN TELEGRAM
🔋V2X-sim for #selfdriving is out!🔋

👉V2X: collaboration between a vehicle and any surrounding entity

𝐇ðĒð ðĄðĨðĒð ðĄð­ðŽ:
✅Suitable for #selfdrivingcars
✅Rec. from road & vehicles
✅Multi-streams/perception
✅Detection, tracking, & segmentation
✅RGB, depth, semantic, BEV & LiDAR

More: https://bit.ly/3H6veOI
ðŸ”Ĩ6ðŸĪĐ1
This media is not supported in your browser
VIEW IN TELEGRAM
🍏Infinite Synthetic dataset for Fitness🍏

👉Opensource synthetic images for fitness, single/multi-person, and realistic variation in lighting, camera angles, and occlusions

𝐇ðĒð ðĄðĨðĒð ðĄð­ðŽ:
✅60k images, 1-5 avatars
✅15 categories, 21 variations
✅Blender and ray-tracing
✅SMPL-X + facial expression
✅Cloth/skin tone sampled
✅147 4K HDRI panoramas
✅Creative Commons 4.0

More: https://bit.ly/33B1R9q
ðŸĪĐ5âĪ1👍1
This media is not supported in your browser
VIEW IN TELEGRAM
♊ DITTO: Digital Twins from Interaction ♊

👉Digitizing objects for #metaverse through interactive perception

𝐇ðĒð ðĄðĨðĒð ðĄð­ðŽ:
✅DIgital Twin of arTiculated Objects
✅Geometry & kinematic articulation
✅Articulation & 3D via perception
✅Source code under MIT License

More:https://bit.ly/3LMazCV
ðŸ”Ĩ5âĪ2👍1ðŸĪŊ1
This media is not supported in your browser
VIEW IN TELEGRAM
ðŸĪ– Robotic Telekinesis from Youtube ðŸĪ–

👉CMU unveils a Robot that observes humans and imitates their actions in real-time

𝐇ðĒð ðĄðĨðĒð ðĄð­ðŽ:
✅Enabling robo-hand teleoperation
✅Suitable for untrained operator
✅Single uncalibrated RGB camera
✅Leveraging unlabeled #youtube
✅No active fine-tuning or setup
✅No collision via Adv-Training

More: https://bit.ly/3H7zUnh
ðŸ”Ĩ3ðŸĪŊ2👍1👏1
This media is not supported in your browser
VIEW IN TELEGRAM
💄DIGAN: #AI for video generation💄

👉A novel INR-based generative adversarial network for video generation

𝐇ðĒð ðĄðĨðĒð ðĄð­ðŽ:
✅Dynamics-aware generator
✅INR-based clip generator
✅Manipulating space/time
✅Identifying unnatural motion

More: https://bit.ly/3H6sHE4
ðŸ”Ĩ4ðŸĪŊ1
This media is not supported in your browser
VIEW IN TELEGRAM
ðŸĶ„FILM Neural Frame InterpolationðŸĶ„

👉Frame interpolation that synthesizes multiple intermediate frames from two input images with large in-between motion

𝐇ðĒð ðĄðĨðĒð ðĄð­ðŽ:
✅Single unified network
✅High quality output
✅SOTA on the Xiph
✅Apache License 2.0

More: https://bit.ly/3pl4ZxH
ðŸ”Ĩ5👍2ðŸĨ°1
This media is not supported in your browser
VIEW IN TELEGRAM
🔈Neural Maintenance via listening🔈

👉Novel neural-method to detect whether a machine is "healthy" or requires maintenance

𝐇ðĒð ðĄðĨðĒð ðĄð­ðŽ:
✅Defects at an early stage
✅FDWT, fast discrete wavelet
✅Learnable wavelet/denoising
✅Unsupervised learnable FDWT
✅The new SOTA in PM

More: https://bit.ly/3hiKWeX
ðŸĪŊ6ðŸĪ”1
This media is not supported in your browser
VIEW IN TELEGRAM
ðŸŸĶðŸŸĻ StyleGAN on Internet pics ðŸŸĶðŸŸĻ

👉StyleGAN on raw uncurated images collected from Internet

𝐇ðĒð ðĄðĨðĒð ðĄð­ðŽ:
✅Outliers & multi-modal
✅Self-distillation approach
✅Self-filtering of outliers
✅Perceptual clustering

More: https://bit.ly/33Z1d5H
âĪ2👍1ðŸ”Ĩ1ðŸĪŊ1
This media is not supported in your browser
VIEW IN TELEGRAM
ðŸĶœThe new SOTA for Unsupervised ðŸĶœ

👉Self-supervised transformer to discover objects in images

𝐇ðĒð ðĄðĨðĒð ðĄð­ðŽ:
✅Visual tokens as nodes in graph
✅Edges as connectivity score
✅The second smallest eV = fg
✅Suitable for unsupervised saliency
✅Weakly supervised obj. detection
✅Code under MIT License


More: https://bit.ly/3sqbFg3
👍4ðŸ”Ĩ3ðŸĪŊ1
This media is not supported in your browser
VIEW IN TELEGRAM
ðŸĨĶ GAN-generated CryptoPunks ðŸĨĶ

👉A simple (and funny) SN-GAN to generate cryptopunks

𝐇ðĒð ðĄðĨðĒð ðĄð­ðŽ:
✅Spectral normalization (2018)
✅Easy to incorporate into training
✅A project by Teddy Koker ðŸŽĐ

More: https://bit.ly/35C1rQI
âĪ3😁3👍1👏1
This media is not supported in your browser
VIEW IN TELEGRAM
ðŸĪŠSEER: self-AI from BILLIONS picðŸĪŠ

👉META + INRIA trained models on billions of random images without any pre-processing or assumptions

𝐇ðĒð ðĄðĨðĒð ðĄð­ðŽ:
✅Self-supervised on pics from web
✅Discovering properties in datasets
✅More fair, less biased & less harmful
✅Better OOD generalization
✅Source code available!

More: https://bit.ly/3vy69dd
ðŸ”Ĩ4👍3ðŸĪŊ1