This media is not supported in your browser
VIEW IN TELEGRAM
ðĨŽHW-Accelerated Neuro-EvolutionðĨŽ
ðScalable, general purpose, hardware accelerated neuro-evolution toolkit by Google
ððĒð ðĄðĨðĒð ðĄððŽ:
â Parallel on multiple TPU/GPUs
â Neuro-evo algorithms with NNs
â WaterWorld, Abstract paint, more
â From Google, not an official product
â Code under Apache License 2.0
More: https://bit.ly/3szEi9w
ðScalable, general purpose, hardware accelerated neuro-evolution toolkit by Google
ððĒð ðĄðĨðĒð ðĄððŽ:
â Parallel on multiple TPU/GPUs
â Neuro-evo algorithms with NNs
â WaterWorld, Abstract paint, more
â From Google, not an official product
â Code under Apache License 2.0
More: https://bit.ly/3szEi9w
ð3ðĨ2ðĪŊ1ðą1
This media is not supported in your browser
VIEW IN TELEGRAM
ð DeepETA: #Uber ETA via #AIð
ðUber unveils the low-latency deep architecture for global ETA prediction
ððĒð ðĄðĨðĒð ðĄððŽ:
â Latency / Accuracy / Generality
â 7 NNs architectures tested
â Encoder-decoder + Self-Attention
â Linear transformer (kernel trick)
â Feature sparsity for speed
More: https://bit.ly/3gFWmJh
ðUber unveils the low-latency deep architecture for global ETA prediction
ððĒð ðĄðĨðĒð ðĄððŽ:
â Latency / Accuracy / Generality
â 7 NNs architectures tested
â Encoder-decoder + Self-Attention
â Linear transformer (kernel trick)
â Feature sparsity for speed
More: https://bit.ly/3gFWmJh
ð3ðĨ1ðĪŊ1
This media is not supported in your browser
VIEW IN TELEGRAM
âïļCLIPasso: Semantic Sketching via CLIPâïļ
ðSketching method guided by geometric and semantic simplifications (CLIP)
ððĒð ðĄðĨðĒð ðĄððŽ:
â EPFL, TAU and IDC Herzliya
â CLIP image encoder for sketching
â Sketching as a set of Bezier curves
â Param-optimization on CLIP-loss
â Source code and models available
More: https://bit.ly/3oLEDF4
ðSketching method guided by geometric and semantic simplifications (CLIP)
ððĒð ðĄðĨðĒð ðĄððŽ:
â EPFL, TAU and IDC Herzliya
â CLIP image encoder for sketching
â Sketching as a set of Bezier curves
â Param-optimization on CLIP-loss
â Source code and models available
More: https://bit.ly/3oLEDF4
ðĨ2ðĨ°2ð1
This media is not supported in your browser
VIEW IN TELEGRAM
ðŠSAHI: slicing detection/segmentationðŠ
ðAn open-source lightweight library for large scale object detection & instance segmentation
ððĒð ðĄðĨðĒð ðĄððŽ:
â Slicing Aided Hyper Inference
â Large-scale detection/segment.
â Sliced inference and merging
â Utils for conversion, slicing, etc.
â Code licensed under MIT License
More: https://bit.ly/3uMJoBZ
ðAn open-source lightweight library for large scale object detection & instance segmentation
ððĒð ðĄðĨðĒð ðĄððŽ:
â Slicing Aided Hyper Inference
â Large-scale detection/segment.
â Sliced inference and merging
â Utils for conversion, slicing, etc.
â Code licensed under MIT License
More: https://bit.ly/3uMJoBZ
ðĨ3âĪ2ðĪŊ1
This media is not supported in your browser
VIEW IN TELEGRAM
ð100,000,000 image-text pairs!ð
ðLarge-scale Chinese cross-modal dataset for benchmarking different multi-modal pre-training methods.
ððĒð ðĄðĨðĒð ðĄððŽ:
â 100 Million <image, text> pairs
â >200px size, aspect ratio (1/3~3)
â Models of ResNet, ViT & SwinT
â Methods: CLIP, FILIP and LiT
â Privacy/Sensitive words ðĪ
More: https://bit.ly/34BqlzX
ðLarge-scale Chinese cross-modal dataset for benchmarking different multi-modal pre-training methods.
ððĒð ðĄðĨðĒð ðĄððŽ:
â 100 Million <image, text> pairs
â >200px size, aspect ratio (1/3~3)
â Models of ResNet, ViT & SwinT
â Methods: CLIP, FILIP and LiT
â Privacy/Sensitive words ðĪ
More: https://bit.ly/34BqlzX
ð5ðĪ1
This media is not supported in your browser
VIEW IN TELEGRAM
ð§33 Million synthetic pedestriansð§
ðA novel large, fully synthetic dataset
ððĒð ðĄðĨðĒð ðĄððŽ:
â Exploiting the #gta5 engine
â 764 full-HD videos @20 fps
â 33M+ person instances
â BBs & segmentation masks
â 2D/3D keypoints & depth
More: https://bit.ly/36njlY1
ðA novel large, fully synthetic dataset
ððĒð ðĄðĨðĒð ðĄððŽ:
â Exploiting the #gta5 engine
â 764 full-HD videos @20 fps
â 33M+ person instances
â BBs & segmentation masks
â 2D/3D keypoints & depth
More: https://bit.ly/36njlY1
ð6ðĪŊ1
This media is not supported in your browser
VIEW IN TELEGRAM
ðĨMarker-free 6D-point trackingðĨ
ðFull position and rotation of skeletal joints, with only a RGB frame
ððĒð ðĄðĨðĒð ðĄððŽ:
â Full 3-axis joint rotations
â V-markers, emulating mocap
â #3D from monocular with NN
â Generalization, no retraining
â SOTA rotation/position est.
More: https://bit.ly/34GdoF5
ðFull position and rotation of skeletal joints, with only a RGB frame
ððĒð ðĄðĨðĒð ðĄððŽ:
â Full 3-axis joint rotations
â V-markers, emulating mocap
â #3D from monocular with NN
â Generalization, no retraining
â SOTA rotation/position est.
More: https://bit.ly/34GdoF5
ðĨ12ðĪŊ1
This media is not supported in your browser
VIEW IN TELEGRAM
ð§ž Synthetic dataset for #Retail ð§ž
ðA large-scale photorealistic synthetic dataset with annotations for semantic segmentation, instance segmentation, depth estimation, and object detection.
ððĒð ðĄðĨðĒð ðĄððŽ:
â Dataset from Standard.AI
â 2,134 unique scenes
â 25k+ annotated samples
â Introducing the "change detection"
â Multi-view representation learning
â NonCommercial-ShareAlike 4.0
More: https://bit.ly/3uXqubB
ðA large-scale photorealistic synthetic dataset with annotations for semantic segmentation, instance segmentation, depth estimation, and object detection.
ððĒð ðĄðĨðĒð ðĄððŽ:
â Dataset from Standard.AI
â 2,134 unique scenes
â 25k+ annotated samples
â Introducing the "change detection"
â Multi-view representation learning
â NonCommercial-ShareAlike 4.0
More: https://bit.ly/3uXqubB
ðĪŊ6ðĨ°3ð1ðĨ1ð1
This media is not supported in your browser
VIEW IN TELEGRAM
ð Graph Neural Nets Forecastingð
ðData-driven approach for forecasting global weather using graph neural networks
ððĒð ðĄðĨðĒð ðĄððŽ:
â Data-driven forecasting via GNNs
â Model: 6.7M parameters, float32
â 6-hours forecast in 0.04 secs.
â A 5-day forecast in 0.8 secs.
More: https://bit.ly/3LH4CXR
ðData-driven approach for forecasting global weather using graph neural networks
ððĒð ðĄðĨðĒð ðĄððŽ:
â Data-driven forecasting via GNNs
â Model: 6.7M parameters, float32
â 6-hours forecast in 0.04 secs.
â A 5-day forecast in 0.8 secs.
More: https://bit.ly/3LH4CXR
ð4ð2ðĪ1
Media is too big
VIEW IN TELEGRAM
ðĨŦWatch Those Words!ðĨŦ
ðBerkeley unveils a novel approach to discover cheap-fake and visually persuasive deep-fakes
ððĒð ðĄðĨðĒð ðĄððŽ:
â Regardless of falsification
â Semantic person-specific
â Word-conditioned analysis
â Generalization across fakes
More: https://bit.ly/3oXWmcd
ðBerkeley unveils a novel approach to discover cheap-fake and visually persuasive deep-fakes
ððĒð ðĄðĨðĒð ðĄððŽ:
â Regardless of falsification
â Semantic person-specific
â Word-conditioned analysis
â Generalization across fakes
More: https://bit.ly/3oXWmcd
ð5ðą1
This media is not supported in your browser
VIEW IN TELEGRAM
ðV2X-sim for #selfdriving is out!ð
ðV2X: collaboration between a vehicle and any surrounding entity
ððĒð ðĄðĨðĒð ðĄððŽ:
â Suitable for #selfdrivingcars
â Rec. from road & vehicles
â Multi-streams/perception
â Detection, tracking, & segmentation
â RGB, depth, semantic, BEV & LiDAR
More: https://bit.ly/3H6veOI
ðV2X: collaboration between a vehicle and any surrounding entity
ððĒð ðĄðĨðĒð ðĄððŽ:
â Suitable for #selfdrivingcars
â Rec. from road & vehicles
â Multi-streams/perception
â Detection, tracking, & segmentation
â RGB, depth, semantic, BEV & LiDAR
More: https://bit.ly/3H6veOI
ðĨ6ðĪĐ1
This media is not supported in your browser
VIEW IN TELEGRAM
ðInfinite Synthetic dataset for Fitnessð
ðOpensource synthetic images for fitness, single/multi-person, and realistic variation in lighting, camera angles, and occlusions
ððĒð ðĄðĨðĒð ðĄððŽ:
â 60k images, 1-5 avatars
â 15 categories, 21 variations
â Blender and ray-tracing
â SMPL-X + facial expression
â Cloth/skin tone sampled
â 147 4K HDRI panoramas
â Creative Commons 4.0
More: https://bit.ly/33B1R9q
ðOpensource synthetic images for fitness, single/multi-person, and realistic variation in lighting, camera angles, and occlusions
ððĒð ðĄðĨðĒð ðĄððŽ:
â 60k images, 1-5 avatars
â 15 categories, 21 variations
â Blender and ray-tracing
â SMPL-X + facial expression
â Cloth/skin tone sampled
â 147 4K HDRI panoramas
â Creative Commons 4.0
More: https://bit.ly/33B1R9q
ðĪĐ5âĪ1ð1
This media is not supported in your browser
VIEW IN TELEGRAM
â DITTO: Digital Twins from Interaction â
ðDigitizing objects for #metaverse through interactive perception
ððĒð ðĄðĨðĒð ðĄððŽ:
â DIgital Twin of arTiculated Objects
â Geometry & kinematic articulation
â Articulation & 3D via perception
â Source code under MIT License
More:https://bit.ly/3LMazCV
ðDigitizing objects for #metaverse through interactive perception
ððĒð ðĄðĨðĒð ðĄððŽ:
â DIgital Twin of arTiculated Objects
â Geometry & kinematic articulation
â Articulation & 3D via perception
â Source code under MIT License
More:https://bit.ly/3LMazCV
ðĨ5âĪ2ð1ðĪŊ1
This media is not supported in your browser
VIEW IN TELEGRAM
ðĪ Robotic Telekinesis from Youtube ðĪ
ðCMU unveils a Robot that observes humans and imitates their actions in real-time
ððĒð ðĄðĨðĒð ðĄððŽ:
â Enabling robo-hand teleoperation
â Suitable for untrained operator
â Single uncalibrated RGB camera
â Leveraging unlabeled #youtube
â No active fine-tuning or setup
â No collision via Adv-Training
More: https://bit.ly/3H7zUnh
ðCMU unveils a Robot that observes humans and imitates their actions in real-time
ððĒð ðĄðĨðĒð ðĄððŽ:
â Enabling robo-hand teleoperation
â Suitable for untrained operator
â Single uncalibrated RGB camera
â Leveraging unlabeled #youtube
â No active fine-tuning or setup
â No collision via Adv-Training
More: https://bit.ly/3H7zUnh
ðĨ3ðĪŊ2ð1ð1
This media is not supported in your browser
VIEW IN TELEGRAM
ðDIGAN: #AI for video generationð
ðA novel INR-based generative adversarial network for video generation
ððĒð ðĄðĨðĒð ðĄððŽ:
â Dynamics-aware generator
â INR-based clip generator
â Manipulating space/time
â Identifying unnatural motion
More: https://bit.ly/3H6sHE4
ðA novel INR-based generative adversarial network for video generation
ððĒð ðĄðĨðĒð ðĄððŽ:
â Dynamics-aware generator
â INR-based clip generator
â Manipulating space/time
â Identifying unnatural motion
More: https://bit.ly/3H6sHE4
ðĨ4ðĪŊ1
This media is not supported in your browser
VIEW IN TELEGRAM
ðĶFILM Neural Frame InterpolationðĶ
ðFrame interpolation that synthesizes multiple intermediate frames from two input images with large in-between motion
ððĒð ðĄðĨðĒð ðĄððŽ:
â Single unified network
â High quality output
â SOTA on the Xiph
â Apache License 2.0
More: https://bit.ly/3pl4ZxH
ðFrame interpolation that synthesizes multiple intermediate frames from two input images with large in-between motion
ððĒð ðĄðĨðĒð ðĄððŽ:
â Single unified network
â High quality output
â SOTA on the Xiph
â Apache License 2.0
More: https://bit.ly/3pl4ZxH
ðĨ5ð2ðĨ°1
This media is not supported in your browser
VIEW IN TELEGRAM
ðNeural Maintenance via listeningð
ðNovel neural-method to detect whether a machine is "healthy" or requires maintenance
ððĒð ðĄðĨðĒð ðĄððŽ:
â Defects at an early stage
â FDWT, fast discrete wavelet
â Learnable wavelet/denoising
â Unsupervised learnable FDWT
â The new SOTA in PM
More: https://bit.ly/3hiKWeX
ðNovel neural-method to detect whether a machine is "healthy" or requires maintenance
ððĒð ðĄðĨðĒð ðĄððŽ:
â Defects at an early stage
â FDWT, fast discrete wavelet
â Learnable wavelet/denoising
â Unsupervised learnable FDWT
â The new SOTA in PM
More: https://bit.ly/3hiKWeX
ðĪŊ6ðĪ1
This media is not supported in your browser
VIEW IN TELEGRAM
ðĶðĻ StyleGAN on Internet pics ðĶðĻ
ðStyleGAN on raw uncurated images collected from Internet
ððĒð ðĄðĨðĒð ðĄððŽ:
â Outliers & multi-modal
â Self-distillation approach
â Self-filtering of outliers
â Perceptual clustering
More: https://bit.ly/33Z1d5H
ðStyleGAN on raw uncurated images collected from Internet
ððĒð ðĄðĨðĒð ðĄððŽ:
â Outliers & multi-modal
â Self-distillation approach
â Self-filtering of outliers
â Perceptual clustering
More: https://bit.ly/33Z1d5H
âĪ2ð1ðĨ1ðĪŊ1
This media is not supported in your browser
VIEW IN TELEGRAM
ðĶThe new SOTA for Unsupervised ðĶ
ðSelf-supervised transformer to discover objects in images
ððĒð ðĄðĨðĒð ðĄððŽ:
â Visual tokens as nodes in graph
â Edges as connectivity score
â The second smallest eV = fg
â Suitable for unsupervised saliency
â Weakly supervised obj. detection
â Code under MIT License
More: https://bit.ly/3sqbFg3
ðSelf-supervised transformer to discover objects in images
ððĒð ðĄðĨðĒð ðĄððŽ:
â Visual tokens as nodes in graph
â Edges as connectivity score
â The second smallest eV = fg
â Suitable for unsupervised saliency
â Weakly supervised obj. detection
â Code under MIT License
More: https://bit.ly/3sqbFg3
ð4ðĨ3ðĪŊ1
This media is not supported in your browser
VIEW IN TELEGRAM
ðĨĶ GAN-generated CryptoPunks ðĨĶ
ðA simple (and funny) SN-GAN to generate cryptopunks
ððĒð ðĄðĨðĒð ðĄððŽ:
â Spectral normalization (2018)
â Easy to incorporate into training
â A project by Teddy Koker ðĐ
More: https://bit.ly/35C1rQI
ðA simple (and funny) SN-GAN to generate cryptopunks
ððĒð ðĄðĨðĒð ðĄððŽ:
â Spectral normalization (2018)
â Easy to incorporate into training
â A project by Teddy Koker ðĐ
More: https://bit.ly/35C1rQI
âĪ3ð3ð1ð1
This media is not supported in your browser
VIEW IN TELEGRAM
ðĪŠSEER: self-AI from BILLIONS picðĪŠ
ðMETA + INRIA trained models on billions of random images without any pre-processing or assumptions
ððĒð ðĄðĨðĒð ðĄððŽ:
â Self-supervised on pics from web
â Discovering properties in datasets
â More fair, less biased & less harmful
â Better OOD generalization
â Source code available!
More: https://bit.ly/3vy69dd
ðMETA + INRIA trained models on billions of random images without any pre-processing or assumptions
ððĒð ðĄðĨðĒð ðĄððŽ:
â Self-supervised on pics from web
â Discovering properties in datasets
â More fair, less biased & less harmful
â Better OOD generalization
â Source code available!
More: https://bit.ly/3vy69dd
ðĨ4ð3ðĪŊ1