AI with Papers - Artificial Intelligence & Deep Learning
15K subscribers
95 photos
235 videos
11 files
1.26K links
All the AI with papers. Every day fresh updates on Deep Learning, Machine Learning, and Computer Vision (with Papers).

Curated by Alessandro Ferrari | https://www.linkedin.com/in/visionarynet/
Download Telegram
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ¦–The new #MediaPipe is INSANE πŸ¦–

πŸ‘‰Google just launched two new highly optimized body segmentation models

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Full body 3D pose
βœ…Designed for yoga, fitness & dance
βœ…Measurements for virtual tailor
βœ…Selfie Segmentation on call

More: https://bit.ly/3s6sjjx
πŸ‘5πŸ”₯4🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ₯Έ Clothed avatars for #metaverse πŸ₯Έ

πŸ‘‰Telepresence, AR/VR, anthropometry, and virtual try-on.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Differential loss of explicit mesh
βœ…Details via neural rendering
βœ…Explicit mesh updating
βœ…Consistency loss for quality++
βœ…Hi-Fi surfaces by S.S. optimization

More: https://bit.ly/3ohAN6d
πŸ”₯6πŸ‘2🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ¦•JoJoGAN: One Shot Face StylizationπŸ¦•

πŸ‘‰UIUC researchers unveil a novel method for one-shot image stylization.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Stylization from single input
βœ…Finetuning StyleGAN for stylization
βœ…No supervision, good generalization
βœ…MIT License (commercial allowed)

More: https://bit.ly/3ASVzyb
❀5πŸ‘2πŸ‘2
This media is not supported in your browser
VIEW IN TELEGRAM
🧦SOTA in OOD detection for safer #AI🧦

πŸ‘‰Out-of-distribution (OOD) detection produces wrong/overconfident predictions.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Novel framework for OOD
βœ…Synthesizing virtual outliers
βœ…Novel unknown-aware training
βœ…Code and model available

More: https://bit.ly/3JnFIL9
πŸ”₯3πŸ‘2🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸŒ…StyleGAN-XL neural synthesisπŸŒ…

πŸ‘‰From TΓΌbingen, StyleGAN-XL: new SOTA for large diverse dataset.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…First 1024p-gen for large data
βœ…Growing strategy on StyleGAN3
βœ…Beyond the narrow domains
βœ…Pivotal Tuning Inversion (TPI)
βœ…SOTA vs. GAN & diffusion models

More: https://bit.ly/3HK9MQk
πŸ”₯6πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ“ŒThis keypoint is pure GLUEπŸ“Œ

πŸ‘‰Keypoints play a central role in computer vision.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Novel Object-centric keypoint
βœ…Novel sim2real training method
βœ…Intra-salience / inter-distinctness
βœ…Enforcing semantic consistency
βœ…Close to fully-supervised method!

More: https://bit.ly/3rth1qh
πŸ”₯5πŸ₯°1🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ’‘ LEDNet: seeing in the dark πŸ’‘

πŸ‘‰Researchers from NTU unveil LEDNet to see in the dark

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Novel data synthesis for low-light
βœ…Low-light/deblurring dataset
βœ…12k low-blur/normal-sharp pairs
βœ…LEDNet: lowlight + deblurring


More: https://bit.ly/3HIyYqM
πŸ‘6πŸ‘4πŸ”₯3🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ‘©β€πŸ¦°Back in the 50's with GANπŸ‘©β€πŸ¦°

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…A few thousand vintage faces
βœ…Models available for download
βœ…Stylegan2-ffhqu-1024x1024
βœ…NO Commercial allowed

More: https://bit.ly/3LlOyKX
🀯2❀1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
🦠VNCA: bio-inspired generative model 🦠

πŸ‘‰A novel generative model loosely inspired by the biological processes of cellular growth and differentiation

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Variational Neural Cellular Automata
βœ…Probabilistic generative model
βœ…Learn from common vector format
βœ…Learn purely s.o. generative process
βœ…Far away from SOTA, but interesting

More: https://bit.ly/3oGb2wG
πŸ‘4πŸ”₯1🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
🍊Block-NeRF: Neural View Synthesis🍊

πŸ‘‰Large-scale scene reconstruction by multiple compact NeRFs that each fit into memory.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Berkeley + Google + Waymo = 🀯
βœ…Scaling NeRF to city-scale scenes
βœ…Trick: multiple simple NeRFs
βœ…Time decoupled, arbitrarily large scene
βœ…Data over months & different conditions

More: https://bit.ly/3GGVHBV
πŸ‘4πŸ”₯3🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ₯¬HW-Accelerated Neuro-EvolutionπŸ₯¬

πŸ‘‰Scalable, general purpose, hardware accelerated neuro-evolution toolkit by Google

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Parallel on multiple TPU/GPUs
βœ…Neuro-evo algorithms with NNs
βœ…WaterWorld, Abstract paint, more
βœ…From Google, not an official product
βœ…Code under Apache License 2.0

More: https://bit.ly/3szEi9w
πŸ‘3πŸ”₯2🀯1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸš› DeepETA: #Uber ETA via #AIπŸš›

πŸ‘‰Uber unveils the low-latency deep architecture for global ETA prediction

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Latency / Accuracy / Generality
βœ…7 NNs architectures tested
βœ…Encoder-decoder + Self-Attention
βœ…Linear transformer (kernel trick)
βœ…Feature sparsity for speed

More: https://bit.ly/3gFWmJh
πŸ‘3πŸ”₯1🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
✏️CLIPasso: Semantic Sketching via CLIP✏️

πŸ‘‰Sketching method guided by geometric and semantic simplifications (CLIP)

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…EPFL, TAU and IDC Herzliya
βœ…CLIP image encoder for sketching
βœ…Sketching as a set of Bezier curves
βœ…Param-optimization on CLIP-loss
βœ…Source code and models available

More: https://bit.ly/3oLEDF4
πŸ”₯2πŸ₯°2πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸͺ‚SAHI: slicing detection/segmentationπŸͺ‚

πŸ‘‰An open-source lightweight library for large scale object detection & instance segmentation

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Slicing Aided Hyper Inference
βœ…Large-scale detection/segment.
βœ…Sliced inference and merging
βœ…Utils for conversion, slicing, etc.
βœ…Code licensed under MIT License

More: https://bit.ly/3uMJoBZ
πŸ”₯3❀2🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
🎁100,000,000 image-text pairs!🎁

πŸ‘‰Large-scale Chinese cross-modal dataset for benchmarking different multi-modal pre-training methods.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…100 Million <image, text> pairs
βœ…>200px size, aspect ratio (1/3~3)
βœ…Models of ResNet, ViT & SwinT
βœ…Methods: CLIP, FILIP and LiT
βœ…Privacy/Sensitive words πŸ€”

More: https://bit.ly/34BqlzX
πŸ‘5πŸ€”1
This media is not supported in your browser
VIEW IN TELEGRAM
🧁33 Million synthetic pedestrians🧁

πŸ‘‰A novel large, fully synthetic dataset

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Exploiting the #gta5 engine
βœ…764 full-HD videos @20 fps
βœ…33M+ person instances
βœ…BBs & segmentation masks
βœ…2D/3D keypoints & depth

More: https://bit.ly/36njlY1
πŸ‘6🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
πŸ₯Marker-free 6D-point trackingπŸ₯

πŸ‘‰Full position and rotation of skeletal joints, with only a RGB frame

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Full 3-axis joint rotations
βœ…V-markers, emulating mocap
βœ…#3D from monocular with NN
βœ…Generalization, no retraining
βœ…SOTA rotation/position est.

More: https://bit.ly/34GdoF5
πŸ”₯12🀯1
This media is not supported in your browser
VIEW IN TELEGRAM
🧼 Synthetic dataset for #Retail 🧼

πŸ‘‰A large-scale photorealistic synthetic dataset with annotations for semantic segmentation, instance segmentation, depth estimation, and object detection.

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Dataset from Standard.AI
βœ…2,134 unique scenes
βœ…25k+ annotated samples
βœ…Introducing the "change detection"
βœ…Multi-view representation learning
βœ…NonCommercial-ShareAlike 4.0

More: https://bit.ly/3uXqubB
🀯6πŸ₯°3πŸ‘1πŸ”₯1πŸ‘1
This media is not supported in your browser
VIEW IN TELEGRAM
🌈 Graph Neural Nets Forecasting🌈

πŸ‘‰Data-driven approach for forecasting global weather using graph neural networks

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Data-driven forecasting via GNNs
βœ…Model: 6.7M parameters, float32
βœ…6-hours forecast in 0.04 secs.
βœ…A 5-day forecast in 0.8 secs.

More: https://bit.ly/3LH4CXR
πŸ‘4πŸ‘2πŸ€”1
Media is too big
VIEW IN TELEGRAM
πŸ₯«Watch Those Words!πŸ₯«

πŸ‘‰Berkeley unveils a novel approach to discover cheap-fake and visually persuasive deep-fakes

𝐇𝐒𝐠𝐑π₯𝐒𝐠𝐑𝐭𝐬:
βœ…Regardless of falsification
βœ…Semantic person-specific
βœ…Word-conditioned analysis
βœ…Generalization across fakes

More: https://bit.ly/3oXWmcd
πŸ‘5😱1