This media is not supported in your browser
VIEW IN TELEGRAM
π±YOLO5 real-time logo detectorπ±
ππ’π π‘π₯π’π π‘ππ¬:
β Based on YOLOv5 family
β Google Colab + Azure
β 90k pics, training: 2 weeks
β Pretrained models/code
β GNU License v3.0
More: https://bit.ly/3r8Qoa7
ππ’π π‘π₯π’π π‘ππ¬:
β Based on YOLOv5 family
β Google Colab + Azure
β 90k pics, training: 2 weeks
β Pretrained models/code
β GNU License v3.0
More: https://bit.ly/3r8Qoa7
β€5π₯5π₯°1
π¦RelTR: #AI scene-graphsπ¦
πOne-stage method for object relationship via visual appearance only.
ππ’π π‘π₯π’π π‘ππ¬:
β RelTR ,end-to-end framework
β Classifying dense relationships
β Scene graphs on appearance only
β No combining entities & labeling
β Superior performance, faster
More: https://bit.ly/3r8k86Y
πOne-stage method for object relationship via visual appearance only.
ππ’π π‘π₯π’π π‘ππ¬:
β RelTR ,end-to-end framework
β Classifying dense relationships
β Scene graphs on appearance only
β No combining entities & labeling
β Superior performance, faster
More: https://bit.ly/3r8k86Y
π4π₯2π€―1π€©1
This media is not supported in your browser
VIEW IN TELEGRAM
π₯ΆSOTA in crowd analysis is INSANEπ₯Ά
πTencent unveils P2PNet to predict heads in images
ππ’π π‘π₯π’π π‘ππ¬:
β Pure point counting/detecting
β Normalized Average Precision
β VGG16-like architecture
β Simultaneous point/confidence
β License: only academic
More: https://bit.ly/33UjoK0
πTencent unveils P2PNet to predict heads in images
ππ’π π‘π₯π’π π‘ππ¬:
β Pure point counting/detecting
β Normalized Average Precision
β VGG16-like architecture
β Simultaneous point/confidence
β License: only academic
More: https://bit.ly/33UjoK0
π±4β€3π2π€―1
βοΈOLSO: Transformers OptimizationβοΈ
ππ’π π‘π₯π’π π‘ππ¬:
β Automagical with Hugging Face
β GPU-based optimizations
β Easily installation with pip
β Apache License 2.0
More: https://bit.ly/3r8wY58
ππ’π π‘π₯π’π π‘ππ¬:
β Automagical with Hugging Face
β GPU-based optimizations
β Easily installation with pip
β Apache License 2.0
More: https://bit.ly/3r8wY58
β€3π€©1
This media is not supported in your browser
VIEW IN TELEGRAM
π¦ΎSOTA in robotic manipulationπ¦Ύ
ππ’π π‘π₯π’π π‘ππ¬:
β VCD: Visible Connectivity Dynamics
β VCG: Visible Connectivity Graph
β Dynamics model over this VCG
β Handling material, geometry, color
β SOTA vs. model-based/model-free RL
β Source code and models available
More: https://bit.ly/3HhusiH
ππ’π π‘π₯π’π π‘ππ¬:
β VCD: Visible Connectivity Dynamics
β VCG: Visible Connectivity Graph
β Dynamics model over this VCG
β Handling material, geometry, color
β SOTA vs. model-based/model-free RL
β Source code and models available
More: https://bit.ly/3HhusiH
π₯1π1
This media is not supported in your browser
VIEW IN TELEGRAM
πVRT: new SOTA in super resolutionπ
ππ’π π‘π₯π’π π‘ππ¬:
β Image restoration via Swin
β Residual Swin Transf. Blocks
β SOTA in Artifact Reduction
β SOTA in Super-resolution
β SOTA in Denoising
β Parameters -67%!
β Non commercial π₯²
More: https://bit.ly/3rfAta1
ππ’π π‘π₯π’π π‘ππ¬:
β Image restoration via Swin
β Residual Swin Transf. Blocks
β SOTA in Artifact Reduction
β SOTA in Super-resolution
β SOTA in Denoising
β Parameters -67%!
β Non commercial π₯²
More: https://bit.ly/3rfAta1
π8β€1π₯1π±1
This media is not supported in your browser
VIEW IN TELEGRAM
π¦The new #MediaPipe is INSANE π¦
πGoogle just launched two new highly optimized body segmentation models
ππ’π π‘π₯π’π π‘ππ¬:
β Full body 3D pose
β Designed for yoga, fitness & dance
β Measurements for virtual tailor
β Selfie Segmentation on call
More: https://bit.ly/3s6sjjx
πGoogle just launched two new highly optimized body segmentation models
ππ’π π‘π₯π’π π‘ππ¬:
β Full body 3D pose
β Designed for yoga, fitness & dance
β Measurements for virtual tailor
β Selfie Segmentation on call
More: https://bit.ly/3s6sjjx
π5π₯4π€―1
This media is not supported in your browser
VIEW IN TELEGRAM
π₯Έ Clothed avatars for #metaverse π₯Έ
πTelepresence, AR/VR, anthropometry, and virtual try-on.
ππ’π π‘π₯π’π π‘ππ¬:
β Differential loss of explicit mesh
β Details via neural rendering
β Explicit mesh updating
β Consistency loss for quality++
β Hi-Fi surfaces by S.S. optimization
More: https://bit.ly/3ohAN6d
πTelepresence, AR/VR, anthropometry, and virtual try-on.
ππ’π π‘π₯π’π π‘ππ¬:
β Differential loss of explicit mesh
β Details via neural rendering
β Explicit mesh updating
β Consistency loss for quality++
β Hi-Fi surfaces by S.S. optimization
More: https://bit.ly/3ohAN6d
π₯6π2π€―1
This media is not supported in your browser
VIEW IN TELEGRAM
π¦JoJoGAN: One Shot Face Stylizationπ¦
πUIUC researchers unveil a novel method for one-shot image stylization.
ππ’π π‘π₯π’π π‘ππ¬:
β Stylization from single input
β Finetuning StyleGAN for stylization
β No supervision, good generalization
β MIT License (commercial allowed)
More: https://bit.ly/3ASVzyb
πUIUC researchers unveil a novel method for one-shot image stylization.
ππ’π π‘π₯π’π π‘ππ¬:
β Stylization from single input
β Finetuning StyleGAN for stylization
β No supervision, good generalization
β MIT License (commercial allowed)
More: https://bit.ly/3ASVzyb
β€5π2π2
This media is not supported in your browser
VIEW IN TELEGRAM
π§¦SOTA in OOD detection for safer #AIπ§¦
πOut-of-distribution (OOD) detection produces wrong/overconfident predictions.
ππ’π π‘π₯π’π π‘ππ¬:
β Novel framework for OOD
β Synthesizing virtual outliers
β Novel unknown-aware training
β Code and model available
More: https://bit.ly/3JnFIL9
πOut-of-distribution (OOD) detection produces wrong/overconfident predictions.
ππ’π π‘π₯π’π π‘ππ¬:
β Novel framework for OOD
β Synthesizing virtual outliers
β Novel unknown-aware training
β Code and model available
More: https://bit.ly/3JnFIL9
π₯3π2π€―1
This media is not supported in your browser
VIEW IN TELEGRAM
π
StyleGAN-XL neural synthesisπ
πFrom TΓΌbingen, StyleGAN-XL: new SOTA for large diverse dataset.
ππ’π π‘π₯π’π π‘ππ¬:
β First 1024p-gen for large data
β Growing strategy on StyleGAN3
β Beyond the narrow domains
β Pivotal Tuning Inversion (TPI)
β SOTA vs. GAN & diffusion models
More: https://bit.ly/3HK9MQk
πFrom TΓΌbingen, StyleGAN-XL: new SOTA for large diverse dataset.
ππ’π π‘π₯π’π π‘ππ¬:
β First 1024p-gen for large data
β Growing strategy on StyleGAN3
β Beyond the narrow domains
β Pivotal Tuning Inversion (TPI)
β SOTA vs. GAN & diffusion models
More: https://bit.ly/3HK9MQk
π₯6π1
This media is not supported in your browser
VIEW IN TELEGRAM
πThis keypoint is pure GLUEπ
πKeypoints play a central role in computer vision.
ππ’π π‘π₯π’π π‘ππ¬:
β Novel Object-centric keypoint
β Novel sim2real training method
β Intra-salience / inter-distinctness
β Enforcing semantic consistency
β Close to fully-supervised method!
More: https://bit.ly/3rth1qh
πKeypoints play a central role in computer vision.
ππ’π π‘π₯π’π π‘ππ¬:
β Novel Object-centric keypoint
β Novel sim2real training method
β Intra-salience / inter-distinctness
β Enforcing semantic consistency
β Close to fully-supervised method!
More: https://bit.ly/3rth1qh
π₯5π₯°1π€―1
This media is not supported in your browser
VIEW IN TELEGRAM
π‘ LEDNet: seeing in the dark π‘
πResearchers from NTU unveil LEDNet to see in the dark
ππ’π π‘π₯π’π π‘ππ¬:
β Novel data synthesis for low-light
β Low-light/deblurring dataset
β 12k low-blur/normal-sharp pairs
β LEDNet: lowlight + deblurring
More: https://bit.ly/3HIyYqM
πResearchers from NTU unveil LEDNet to see in the dark
ππ’π π‘π₯π’π π‘ππ¬:
β Novel data synthesis for low-light
β Low-light/deblurring dataset
β 12k low-blur/normal-sharp pairs
β LEDNet: lowlight + deblurring
More: https://bit.ly/3HIyYqM
π6π4π₯3π€―1
This media is not supported in your browser
VIEW IN TELEGRAM
π©βπ¦°Back in the 50's with GANπ©βπ¦°
ππ’π π‘π₯π’π π‘ππ¬:
β A few thousand vintage faces
β Models available for download
β Stylegan2-ffhqu-1024x1024
β NO Commercial allowed
More: https://bit.ly/3LlOyKX
ππ’π π‘π₯π’π π‘ππ¬:
β A few thousand vintage faces
β Models available for download
β Stylegan2-ffhqu-1024x1024
β NO Commercial allowed
More: https://bit.ly/3LlOyKX
π€―2β€1π±1
This media is not supported in your browser
VIEW IN TELEGRAM
π¦ VNCA: bio-inspired generative model π¦
πA novel generative model loosely inspired by the biological processes of cellular growth and differentiation
ππ’π π‘π₯π’π π‘ππ¬:
β Variational Neural Cellular Automata
β Probabilistic generative model
β Learn from common vector format
β Learn purely s.o. generative process
β Far away from SOTA, but interesting
More: https://bit.ly/3oGb2wG
πA novel generative model loosely inspired by the biological processes of cellular growth and differentiation
ππ’π π‘π₯π’π π‘ππ¬:
β Variational Neural Cellular Automata
β Probabilistic generative model
β Learn from common vector format
β Learn purely s.o. generative process
β Far away from SOTA, but interesting
More: https://bit.ly/3oGb2wG
π4π₯1π€―1
This media is not supported in your browser
VIEW IN TELEGRAM
πBlock-NeRF: Neural View Synthesisπ
πLarge-scale scene reconstruction by multiple compact NeRFs that each fit into memory.
ππ’π π‘π₯π’π π‘ππ¬:
β Berkeley + Google + Waymo = π€―
β Scaling NeRF to city-scale scenes
β Trick: multiple simple NeRFs
β Time decoupled, arbitrarily large scene
β Data over months & different conditions
More: https://bit.ly/3GGVHBV
πLarge-scale scene reconstruction by multiple compact NeRFs that each fit into memory.
ππ’π π‘π₯π’π π‘ππ¬:
β Berkeley + Google + Waymo = π€―
β Scaling NeRF to city-scale scenes
β Trick: multiple simple NeRFs
β Time decoupled, arbitrarily large scene
β Data over months & different conditions
More: https://bit.ly/3GGVHBV
π4π₯3π€―1
This media is not supported in your browser
VIEW IN TELEGRAM
π₯¬HW-Accelerated Neuro-Evolutionπ₯¬
πScalable, general purpose, hardware accelerated neuro-evolution toolkit by Google
ππ’π π‘π₯π’π π‘ππ¬:
β Parallel on multiple TPU/GPUs
β Neuro-evo algorithms with NNs
β WaterWorld, Abstract paint, more
β From Google, not an official product
β Code under Apache License 2.0
More: https://bit.ly/3szEi9w
πScalable, general purpose, hardware accelerated neuro-evolution toolkit by Google
ππ’π π‘π₯π’π π‘ππ¬:
β Parallel on multiple TPU/GPUs
β Neuro-evo algorithms with NNs
β WaterWorld, Abstract paint, more
β From Google, not an official product
β Code under Apache License 2.0
More: https://bit.ly/3szEi9w
π3π₯2π€―1π±1
This media is not supported in your browser
VIEW IN TELEGRAM
π DeepETA: #Uber ETA via #AIπ
πUber unveils the low-latency deep architecture for global ETA prediction
ππ’π π‘π₯π’π π‘ππ¬:
β Latency / Accuracy / Generality
β 7 NNs architectures tested
β Encoder-decoder + Self-Attention
β Linear transformer (kernel trick)
β Feature sparsity for speed
More: https://bit.ly/3gFWmJh
πUber unveils the low-latency deep architecture for global ETA prediction
ππ’π π‘π₯π’π π‘ππ¬:
β Latency / Accuracy / Generality
β 7 NNs architectures tested
β Encoder-decoder + Self-Attention
β Linear transformer (kernel trick)
β Feature sparsity for speed
More: https://bit.ly/3gFWmJh
π3π₯1π€―1
This media is not supported in your browser
VIEW IN TELEGRAM
βοΈCLIPasso: Semantic Sketching via CLIPβοΈ
πSketching method guided by geometric and semantic simplifications (CLIP)
ππ’π π‘π₯π’π π‘ππ¬:
β EPFL, TAU and IDC Herzliya
β CLIP image encoder for sketching
β Sketching as a set of Bezier curves
β Param-optimization on CLIP-loss
β Source code and models available
More: https://bit.ly/3oLEDF4
πSketching method guided by geometric and semantic simplifications (CLIP)
ππ’π π‘π₯π’π π‘ππ¬:
β EPFL, TAU and IDC Herzliya
β CLIP image encoder for sketching
β Sketching as a set of Bezier curves
β Param-optimization on CLIP-loss
β Source code and models available
More: https://bit.ly/3oLEDF4
π₯2π₯°2π1
This media is not supported in your browser
VIEW IN TELEGRAM
πͺSAHI: slicing detection/segmentationπͺ
πAn open-source lightweight library for large scale object detection & instance segmentation
ππ’π π‘π₯π’π π‘ππ¬:
β Slicing Aided Hyper Inference
β Large-scale detection/segment.
β Sliced inference and merging
β Utils for conversion, slicing, etc.
β Code licensed under MIT License
More: https://bit.ly/3uMJoBZ
πAn open-source lightweight library for large scale object detection & instance segmentation
ππ’π π‘π₯π’π π‘ππ¬:
β Slicing Aided Hyper Inference
β Large-scale detection/segment.
β Sliced inference and merging
β Utils for conversion, slicing, etc.
β Code licensed under MIT License
More: https://bit.ly/3uMJoBZ
π₯3β€2π€―1