This media is not supported in your browser
VIEW IN TELEGRAM
π¦The new #MediaPipe is INSANE π¦
πGoogle just launched two new highly optimized body segmentation models
ππ’π π‘π₯π’π π‘ππ¬:
β Full body 3D pose
β Designed for yoga, fitness & dance
β Measurements for virtual tailor
β Selfie Segmentation on call
More: https://bit.ly/3s6sjjx
πGoogle just launched two new highly optimized body segmentation models
ππ’π π‘π₯π’π π‘ππ¬:
β Full body 3D pose
β Designed for yoga, fitness & dance
β Measurements for virtual tailor
β Selfie Segmentation on call
More: https://bit.ly/3s6sjjx
π5π₯4π€―1
This media is not supported in your browser
VIEW IN TELEGRAM
π₯Έ Clothed avatars for #metaverse π₯Έ
πTelepresence, AR/VR, anthropometry, and virtual try-on.
ππ’π π‘π₯π’π π‘ππ¬:
β Differential loss of explicit mesh
β Details via neural rendering
β Explicit mesh updating
β Consistency loss for quality++
β Hi-Fi surfaces by S.S. optimization
More: https://bit.ly/3ohAN6d
πTelepresence, AR/VR, anthropometry, and virtual try-on.
ππ’π π‘π₯π’π π‘ππ¬:
β Differential loss of explicit mesh
β Details via neural rendering
β Explicit mesh updating
β Consistency loss for quality++
β Hi-Fi surfaces by S.S. optimization
More: https://bit.ly/3ohAN6d
π₯6π2π€―1
This media is not supported in your browser
VIEW IN TELEGRAM
π¦JoJoGAN: One Shot Face Stylizationπ¦
πUIUC researchers unveil a novel method for one-shot image stylization.
ππ’π π‘π₯π’π π‘ππ¬:
β Stylization from single input
β Finetuning StyleGAN for stylization
β No supervision, good generalization
β MIT License (commercial allowed)
More: https://bit.ly/3ASVzyb
πUIUC researchers unveil a novel method for one-shot image stylization.
ππ’π π‘π₯π’π π‘ππ¬:
β Stylization from single input
β Finetuning StyleGAN for stylization
β No supervision, good generalization
β MIT License (commercial allowed)
More: https://bit.ly/3ASVzyb
β€5π2π2
This media is not supported in your browser
VIEW IN TELEGRAM
π§¦SOTA in OOD detection for safer #AIπ§¦
πOut-of-distribution (OOD) detection produces wrong/overconfident predictions.
ππ’π π‘π₯π’π π‘ππ¬:
β Novel framework for OOD
β Synthesizing virtual outliers
β Novel unknown-aware training
β Code and model available
More: https://bit.ly/3JnFIL9
πOut-of-distribution (OOD) detection produces wrong/overconfident predictions.
ππ’π π‘π₯π’π π‘ππ¬:
β Novel framework for OOD
β Synthesizing virtual outliers
β Novel unknown-aware training
β Code and model available
More: https://bit.ly/3JnFIL9
π₯3π2π€―1
This media is not supported in your browser
VIEW IN TELEGRAM
π
StyleGAN-XL neural synthesisπ
πFrom TΓΌbingen, StyleGAN-XL: new SOTA for large diverse dataset.
ππ’π π‘π₯π’π π‘ππ¬:
β First 1024p-gen for large data
β Growing strategy on StyleGAN3
β Beyond the narrow domains
β Pivotal Tuning Inversion (TPI)
β SOTA vs. GAN & diffusion models
More: https://bit.ly/3HK9MQk
πFrom TΓΌbingen, StyleGAN-XL: new SOTA for large diverse dataset.
ππ’π π‘π₯π’π π‘ππ¬:
β First 1024p-gen for large data
β Growing strategy on StyleGAN3
β Beyond the narrow domains
β Pivotal Tuning Inversion (TPI)
β SOTA vs. GAN & diffusion models
More: https://bit.ly/3HK9MQk
π₯6π1
This media is not supported in your browser
VIEW IN TELEGRAM
πThis keypoint is pure GLUEπ
πKeypoints play a central role in computer vision.
ππ’π π‘π₯π’π π‘ππ¬:
β Novel Object-centric keypoint
β Novel sim2real training method
β Intra-salience / inter-distinctness
β Enforcing semantic consistency
β Close to fully-supervised method!
More: https://bit.ly/3rth1qh
πKeypoints play a central role in computer vision.
ππ’π π‘π₯π’π π‘ππ¬:
β Novel Object-centric keypoint
β Novel sim2real training method
β Intra-salience / inter-distinctness
β Enforcing semantic consistency
β Close to fully-supervised method!
More: https://bit.ly/3rth1qh
π₯5π₯°1π€―1
This media is not supported in your browser
VIEW IN TELEGRAM
π‘ LEDNet: seeing in the dark π‘
πResearchers from NTU unveil LEDNet to see in the dark
ππ’π π‘π₯π’π π‘ππ¬:
β Novel data synthesis for low-light
β Low-light/deblurring dataset
β 12k low-blur/normal-sharp pairs
β LEDNet: lowlight + deblurring
More: https://bit.ly/3HIyYqM
πResearchers from NTU unveil LEDNet to see in the dark
ππ’π π‘π₯π’π π‘ππ¬:
β Novel data synthesis for low-light
β Low-light/deblurring dataset
β 12k low-blur/normal-sharp pairs
β LEDNet: lowlight + deblurring
More: https://bit.ly/3HIyYqM
π6π4π₯3π€―1
This media is not supported in your browser
VIEW IN TELEGRAM
π©βπ¦°Back in the 50's with GANπ©βπ¦°
ππ’π π‘π₯π’π π‘ππ¬:
β A few thousand vintage faces
β Models available for download
β Stylegan2-ffhqu-1024x1024
β NO Commercial allowed
More: https://bit.ly/3LlOyKX
ππ’π π‘π₯π’π π‘ππ¬:
β A few thousand vintage faces
β Models available for download
β Stylegan2-ffhqu-1024x1024
β NO Commercial allowed
More: https://bit.ly/3LlOyKX
π€―2β€1π±1
This media is not supported in your browser
VIEW IN TELEGRAM
π¦ VNCA: bio-inspired generative model π¦
πA novel generative model loosely inspired by the biological processes of cellular growth and differentiation
ππ’π π‘π₯π’π π‘ππ¬:
β Variational Neural Cellular Automata
β Probabilistic generative model
β Learn from common vector format
β Learn purely s.o. generative process
β Far away from SOTA, but interesting
More: https://bit.ly/3oGb2wG
πA novel generative model loosely inspired by the biological processes of cellular growth and differentiation
ππ’π π‘π₯π’π π‘ππ¬:
β Variational Neural Cellular Automata
β Probabilistic generative model
β Learn from common vector format
β Learn purely s.o. generative process
β Far away from SOTA, but interesting
More: https://bit.ly/3oGb2wG
π4π₯1π€―1
This media is not supported in your browser
VIEW IN TELEGRAM
πBlock-NeRF: Neural View Synthesisπ
πLarge-scale scene reconstruction by multiple compact NeRFs that each fit into memory.
ππ’π π‘π₯π’π π‘ππ¬:
β Berkeley + Google + Waymo = π€―
β Scaling NeRF to city-scale scenes
β Trick: multiple simple NeRFs
β Time decoupled, arbitrarily large scene
β Data over months & different conditions
More: https://bit.ly/3GGVHBV
πLarge-scale scene reconstruction by multiple compact NeRFs that each fit into memory.
ππ’π π‘π₯π’π π‘ππ¬:
β Berkeley + Google + Waymo = π€―
β Scaling NeRF to city-scale scenes
β Trick: multiple simple NeRFs
β Time decoupled, arbitrarily large scene
β Data over months & different conditions
More: https://bit.ly/3GGVHBV
π4π₯3π€―1
This media is not supported in your browser
VIEW IN TELEGRAM
π₯¬HW-Accelerated Neuro-Evolutionπ₯¬
πScalable, general purpose, hardware accelerated neuro-evolution toolkit by Google
ππ’π π‘π₯π’π π‘ππ¬:
β Parallel on multiple TPU/GPUs
β Neuro-evo algorithms with NNs
β WaterWorld, Abstract paint, more
β From Google, not an official product
β Code under Apache License 2.0
More: https://bit.ly/3szEi9w
πScalable, general purpose, hardware accelerated neuro-evolution toolkit by Google
ππ’π π‘π₯π’π π‘ππ¬:
β Parallel on multiple TPU/GPUs
β Neuro-evo algorithms with NNs
β WaterWorld, Abstract paint, more
β From Google, not an official product
β Code under Apache License 2.0
More: https://bit.ly/3szEi9w
π3π₯2π€―1π±1
This media is not supported in your browser
VIEW IN TELEGRAM
π DeepETA: #Uber ETA via #AIπ
πUber unveils the low-latency deep architecture for global ETA prediction
ππ’π π‘π₯π’π π‘ππ¬:
β Latency / Accuracy / Generality
β 7 NNs architectures tested
β Encoder-decoder + Self-Attention
β Linear transformer (kernel trick)
β Feature sparsity for speed
More: https://bit.ly/3gFWmJh
πUber unveils the low-latency deep architecture for global ETA prediction
ππ’π π‘π₯π’π π‘ππ¬:
β Latency / Accuracy / Generality
β 7 NNs architectures tested
β Encoder-decoder + Self-Attention
β Linear transformer (kernel trick)
β Feature sparsity for speed
More: https://bit.ly/3gFWmJh
π3π₯1π€―1
This media is not supported in your browser
VIEW IN TELEGRAM
βοΈCLIPasso: Semantic Sketching via CLIPβοΈ
πSketching method guided by geometric and semantic simplifications (CLIP)
ππ’π π‘π₯π’π π‘ππ¬:
β EPFL, TAU and IDC Herzliya
β CLIP image encoder for sketching
β Sketching as a set of Bezier curves
β Param-optimization on CLIP-loss
β Source code and models available
More: https://bit.ly/3oLEDF4
πSketching method guided by geometric and semantic simplifications (CLIP)
ππ’π π‘π₯π’π π‘ππ¬:
β EPFL, TAU and IDC Herzliya
β CLIP image encoder for sketching
β Sketching as a set of Bezier curves
β Param-optimization on CLIP-loss
β Source code and models available
More: https://bit.ly/3oLEDF4
π₯2π₯°2π1
This media is not supported in your browser
VIEW IN TELEGRAM
πͺSAHI: slicing detection/segmentationπͺ
πAn open-source lightweight library for large scale object detection & instance segmentation
ππ’π π‘π₯π’π π‘ππ¬:
β Slicing Aided Hyper Inference
β Large-scale detection/segment.
β Sliced inference and merging
β Utils for conversion, slicing, etc.
β Code licensed under MIT License
More: https://bit.ly/3uMJoBZ
πAn open-source lightweight library for large scale object detection & instance segmentation
ππ’π π‘π₯π’π π‘ππ¬:
β Slicing Aided Hyper Inference
β Large-scale detection/segment.
β Sliced inference and merging
β Utils for conversion, slicing, etc.
β Code licensed under MIT License
More: https://bit.ly/3uMJoBZ
π₯3β€2π€―1
This media is not supported in your browser
VIEW IN TELEGRAM
π100,000,000 image-text pairs!π
πLarge-scale Chinese cross-modal dataset for benchmarking different multi-modal pre-training methods.
ππ’π π‘π₯π’π π‘ππ¬:
β 100 Million <image, text> pairs
β >200px size, aspect ratio (1/3~3)
β Models of ResNet, ViT & SwinT
β Methods: CLIP, FILIP and LiT
β Privacy/Sensitive words π€
More: https://bit.ly/34BqlzX
πLarge-scale Chinese cross-modal dataset for benchmarking different multi-modal pre-training methods.
ππ’π π‘π₯π’π π‘ππ¬:
β 100 Million <image, text> pairs
β >200px size, aspect ratio (1/3~3)
β Models of ResNet, ViT & SwinT
β Methods: CLIP, FILIP and LiT
β Privacy/Sensitive words π€
More: https://bit.ly/34BqlzX
π5π€1
This media is not supported in your browser
VIEW IN TELEGRAM
π§33 Million synthetic pedestriansπ§
πA novel large, fully synthetic dataset
ππ’π π‘π₯π’π π‘ππ¬:
β Exploiting the #gta5 engine
β 764 full-HD videos @20 fps
β 33M+ person instances
β BBs & segmentation masks
β 2D/3D keypoints & depth
More: https://bit.ly/36njlY1
πA novel large, fully synthetic dataset
ππ’π π‘π₯π’π π‘ππ¬:
β Exploiting the #gta5 engine
β 764 full-HD videos @20 fps
β 33M+ person instances
β BBs & segmentation masks
β 2D/3D keypoints & depth
More: https://bit.ly/36njlY1
π6π€―1
This media is not supported in your browser
VIEW IN TELEGRAM
π₯Marker-free 6D-point trackingπ₯
πFull position and rotation of skeletal joints, with only a RGB frame
ππ’π π‘π₯π’π π‘ππ¬:
β Full 3-axis joint rotations
β V-markers, emulating mocap
β #3D from monocular with NN
β Generalization, no retraining
β SOTA rotation/position est.
More: https://bit.ly/34GdoF5
πFull position and rotation of skeletal joints, with only a RGB frame
ππ’π π‘π₯π’π π‘ππ¬:
β Full 3-axis joint rotations
β V-markers, emulating mocap
β #3D from monocular with NN
β Generalization, no retraining
β SOTA rotation/position est.
More: https://bit.ly/34GdoF5
π₯12π€―1
This media is not supported in your browser
VIEW IN TELEGRAM
π§Ό Synthetic dataset for #Retail π§Ό
πA large-scale photorealistic synthetic dataset with annotations for semantic segmentation, instance segmentation, depth estimation, and object detection.
ππ’π π‘π₯π’π π‘ππ¬:
β Dataset from Standard.AI
β 2,134 unique scenes
β 25k+ annotated samples
β Introducing the "change detection"
β Multi-view representation learning
β NonCommercial-ShareAlike 4.0
More: https://bit.ly/3uXqubB
πA large-scale photorealistic synthetic dataset with annotations for semantic segmentation, instance segmentation, depth estimation, and object detection.
ππ’π π‘π₯π’π π‘ππ¬:
β Dataset from Standard.AI
β 2,134 unique scenes
β 25k+ annotated samples
β Introducing the "change detection"
β Multi-view representation learning
β NonCommercial-ShareAlike 4.0
More: https://bit.ly/3uXqubB
π€―6π₯°3π1π₯1π1
This media is not supported in your browser
VIEW IN TELEGRAM
π Graph Neural Nets Forecastingπ
πData-driven approach for forecasting global weather using graph neural networks
ππ’π π‘π₯π’π π‘ππ¬:
β Data-driven forecasting via GNNs
β Model: 6.7M parameters, float32
β 6-hours forecast in 0.04 secs.
β A 5-day forecast in 0.8 secs.
More: https://bit.ly/3LH4CXR
πData-driven approach for forecasting global weather using graph neural networks
ππ’π π‘π₯π’π π‘ππ¬:
β Data-driven forecasting via GNNs
β Model: 6.7M parameters, float32
β 6-hours forecast in 0.04 secs.
β A 5-day forecast in 0.8 secs.
More: https://bit.ly/3LH4CXR
π4π2π€1
Media is too big
VIEW IN TELEGRAM
π₯«Watch Those Words!π₯«
πBerkeley unveils a novel approach to discover cheap-fake and visually persuasive deep-fakes
ππ’π π‘π₯π’π π‘ππ¬:
β Regardless of falsification
β Semantic person-specific
β Word-conditioned analysis
β Generalization across fakes
More: https://bit.ly/3oXWmcd
πBerkeley unveils a novel approach to discover cheap-fake and visually persuasive deep-fakes
ππ’π π‘π₯π’π π‘ππ¬:
β Regardless of falsification
β Semantic person-specific
β Word-conditioned analysis
β Generalization across fakes
More: https://bit.ly/3oXWmcd
π5π±1