AI with Papers - Artificial Intelligence & Deep Learning
15K subscribers
95 photos
235 videos
11 files
1.26K links
All the AI with papers. Every day fresh updates on Deep Learning, Machine Learning, and Computer Vision (with Papers).

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ¦•JoJoGAN: One Shot Face StylizationπŸ¦•

πŸ‘‰UIUC researchers unveil a novel method for one-shot image stylization.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Stylization from single input
βœ…Finetuning StyleGAN for stylization
βœ…No supervision, good generalization
βœ…MIT License (commercial allowed)

More: https://bit.ly/3ASVzyb
❀5πŸ‘2πŸ‘2
This media is not supported in your browser
VIEW IN TELEGRAM
🧦SOTA in OOD detection for safer #AI🧦

πŸ‘‰Out-of-distribution (OOD) detection produces wrong/overconfident predictions.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Novel framework for OOD
βœ…Synthesizing virtual outliers
βœ…Novel unknown-aware training
βœ…Code and model available

More: https://bit.ly/3JnFIL9
πŸ”₯3πŸ‘2🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸŒ…StyleGAN-XL neural synthesisπŸŒ…

πŸ‘‰From TΓΌbingen, StyleGAN-XL: new SOTA for large diverse dataset.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…First 1024p-gen for large data
βœ…Growing strategy on StyleGAN3
βœ…Beyond the narrow domains
βœ…Pivotal Tuning Inversion (TPI)
βœ…SOTA vs. GAN & diffusion models

More: https://bit.ly/3HK9MQk
πŸ”₯6πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ“ŒThis keypoint is pure GLUEπŸ“Œ

πŸ‘‰Keypoints play a central role in computer vision.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Novel Object-centric keypoint
βœ…Novel sim2real training method
βœ…Intra-salience / inter-distinctness
βœ…Enforcing semantic consistency
βœ…Close to fully-supervised method!

More: https://bit.ly/3rth1qh
πŸ”₯5πŸ₯°1🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ’‘ LEDNet: seeing in the dark πŸ’‘

πŸ‘‰Researchers from NTU unveil LEDNet to see in the dark

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Novel data synthesis for low-light
βœ…Low-light/deblurring dataset
βœ…12k low-blur/normal-sharp pairs
βœ…LEDNet: lowlight + deblurring


More: https://bit.ly/3HIyYqM
πŸ‘6πŸ‘4πŸ”₯3🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ‘©β€πŸ¦°Back in the 50's with GANπŸ‘©β€πŸ¦°

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…A few thousand vintage faces
βœ…Models available for download
βœ…Stylegan2-ffhqu-1024x1024
βœ…NO Commercial allowed

More: https://bit.ly/3LlOyKX
🀯2❀1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
🦠VNCA: bio-inspired generative model 🦠

πŸ‘‰A novel generative model loosely inspired by the biological processes of cellular growth and differentiation

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Variational Neural Cellular Automata
βœ…Probabilistic generative model
βœ…Learn from common vector format
βœ…Learn purely s.o. generative process
βœ…Far away from SOTA, but interesting

More: https://bit.ly/3oGb2wG
πŸ‘4πŸ”₯1🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
🍊Block-NeRF: Neural View Synthesis🍊

πŸ‘‰Large-scale scene reconstruction by multiple compact NeRFs that each fit into memory.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Berkeley + Google + Waymo = 🀯
βœ…Scaling NeRF to city-scale scenes
βœ…Trick: multiple simple NeRFs
βœ…Time decoupled, arbitrarily large scene
βœ…Data over months & different conditions

More: https://bit.ly/3GGVHBV
πŸ‘4πŸ”₯3🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ₯¬HW-Accelerated Neuro-EvolutionπŸ₯¬

πŸ‘‰Scalable, general purpose, hardware accelerated neuro-evolution toolkit by Google

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Parallel on multiple TPU/GPUs
βœ…Neuro-evo algorithms with NNs
βœ…WaterWorld, Abstract paint, more
βœ…From Google, not an official product
βœ…Code under Apache License 2.0

More: https://bit.ly/3szEi9w
πŸ‘3πŸ”₯2🀯1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸš› DeepETA: #Uber ETA via #AIπŸš›

πŸ‘‰Uber unveils the low-latency deep architecture for global ETA prediction

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Latency / Accuracy / Generality
βœ…7 NNs architectures tested
βœ…Encoder-decoder + Self-Attention
βœ…Linear transformer (kernel trick)
βœ…Feature sparsity for speed

More: https://bit.ly/3gFWmJh
πŸ‘3πŸ”₯1🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
✏️CLIPasso: Semantic Sketching via CLIP✏️

πŸ‘‰Sketching method guided by geometric and semantic simplifications (CLIP)

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…EPFL, TAU and IDC Herzliya
βœ…CLIP image encoder for sketching
βœ…Sketching as a set of Bezier curves
βœ…Param-optimization on CLIP-loss
βœ…Source code and models available

More: https://bit.ly/3oLEDF4
πŸ”₯2πŸ₯°2πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸͺ‚SAHI: slicing detection/segmentationπŸͺ‚

πŸ‘‰An open-source lightweight library for large scale object detection & instance segmentation

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Slicing Aided Hyper Inference
βœ…Large-scale detection/segment.
βœ…Sliced inference and merging
βœ…Utils for conversion, slicing, etc.
βœ…Code licensed under MIT License

More: https://bit.ly/3uMJoBZ
πŸ”₯3❀2🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
🎁100,000,000 image-text pairs!🎁

πŸ‘‰Large-scale Chinese cross-modal dataset for benchmarking different multi-modal pre-training methods.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…100 Million <image, text> pairs
βœ…>200px size, aspect ratio (1/3~3)
βœ…Models of ResNet, ViT & SwinT
βœ…Methods: CLIP, FILIP and LiT
βœ…Privacy/Sensitive words πŸ€”

More: https://bit.ly/34BqlzX
πŸ‘5πŸ€”1
This media is not supported in your browser
VIEW IN TELEGRAM
🧁33 Million synthetic pedestrians🧁

πŸ‘‰A novel large, fully synthetic dataset

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Exploiting the #gta5 engine
βœ…764 full-HD videos @20 fps
βœ…33M+ person instances
βœ…BBs & segmentation masks
βœ…2D/3D keypoints & depth

More: https://bit.ly/36njlY1
πŸ‘6🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ₯Marker-free 6D-point trackingπŸ₯

πŸ‘‰Full position and rotation of skeletal joints, with only a RGB frame

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Full 3-axis joint rotations
βœ…V-markers, emulating mocap
βœ…#3D from monocular with NN
βœ…Generalization, no retraining
βœ…SOTA rotation/position est.

More: https://bit.ly/34GdoF5
πŸ”₯12🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
🧼 Synthetic dataset for #Retail 🧼

πŸ‘‰A large-scale photorealistic synthetic dataset with annotations for semantic segmentation, instance segmentation, depth estimation, and object detection.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Dataset from Standard.AI
βœ…2,134 unique scenes
βœ…25k+ annotated samples
βœ…Introducing the "change detection"
βœ…Multi-view representation learning
βœ…NonCommercial-ShareAlike 4.0

More: https://bit.ly/3uXqubB
🀯6πŸ₯°3πŸ‘1πŸ”₯1πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
🌈 Graph Neural Nets Forecasting🌈

πŸ‘‰Data-driven approach for forecasting global weather using graph neural networks

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Data-driven forecasting via GNNs
βœ…Model: 6.7M parameters, float32
βœ…6-hours forecast in 0.04 secs.
βœ…A 5-day forecast in 0.8 secs.

More: https://bit.ly/3LH4CXR
πŸ‘4πŸ‘2πŸ€”1
Media is too big
VIEW IN TELEGRAM
πŸ₯«Watch Those Words!πŸ₯«

πŸ‘‰Berkeley unveils a novel approach to discover cheap-fake and visually persuasive deep-fakes

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Regardless of falsification
βœ…Semantic person-specific
βœ…Word-conditioned analysis
βœ…Generalization across fakes

More: https://bit.ly/3oXWmcd
πŸ‘5😱1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ”‹V2X-sim for #selfdriving is out!πŸ”‹

πŸ‘‰V2X: collaboration between a vehicle and any surrounding entity

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Suitable for #selfdrivingcars
βœ…Rec. from road & vehicles
βœ…Multi-streams/perception
βœ…Detection, tracking, & segmentation
βœ…RGB, depth, semantic, BEV & LiDAR

More: https://bit.ly/3H6veOI
πŸ”₯6🀩1
This media is not supported in your browser
VIEW IN TELEGRAM
🍏Infinite Synthetic dataset for Fitness🍏

πŸ‘‰Opensource synthetic images for fitness, single/multi-person, and realistic variation in lighting, camera angles, and occlusions

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…60k images, 1-5 avatars
βœ…15 categories, 21 variations
βœ…Blender and ray-tracing
βœ…SMPL-X + facial expression
βœ…Cloth/skin tone sampled
βœ…147 4K HDRI panoramas
βœ…Creative Commons 4.0

More: https://bit.ly/33B1R9q
🀩5❀1πŸ‘1