This media is not supported in your browser
VIEW IN TELEGRAM
πV2X-sim for #selfdriving is out!π
πV2X: collaboration between a vehicle and any surrounding entity
ππ’π π‘π₯π’π π‘ππ¬:
β Suitable for #selfdrivingcars
β Rec. from road & vehicles
β Multi-streams/perception
β Detection, tracking, & segmentation
β RGB, depth, semantic, BEV & LiDAR
More: https://bit.ly/3H6veOI
πV2X: collaboration between a vehicle and any surrounding entity
ππ’π π‘π₯π’π π‘ππ¬:
β Suitable for #selfdrivingcars
β Rec. from road & vehicles
β Multi-streams/perception
β Detection, tracking, & segmentation
β RGB, depth, semantic, BEV & LiDAR
More: https://bit.ly/3H6veOI
π₯6π€©1
This media is not supported in your browser
VIEW IN TELEGRAM
πInfinite Synthetic dataset for Fitnessπ
πOpensource synthetic images for fitness, single/multi-person, and realistic variation in lighting, camera angles, and occlusions
ππ’π π‘π₯π’π π‘ππ¬:
β 60k images, 1-5 avatars
β 15 categories, 21 variations
β Blender and ray-tracing
β SMPL-X + facial expression
β Cloth/skin tone sampled
β 147 4K HDRI panoramas
β Creative Commons 4.0
More: https://bit.ly/33B1R9q
πOpensource synthetic images for fitness, single/multi-person, and realistic variation in lighting, camera angles, and occlusions
ππ’π π‘π₯π’π π‘ππ¬:
β 60k images, 1-5 avatars
β 15 categories, 21 variations
β Blender and ray-tracing
β SMPL-X + facial expression
β Cloth/skin tone sampled
β 147 4K HDRI panoramas
β Creative Commons 4.0
More: https://bit.ly/33B1R9q
π€©5β€1π1
This media is not supported in your browser
VIEW IN TELEGRAM
β DITTO: Digital Twins from Interaction β
πDigitizing objects for #metaverse through interactive perception
ππ’π π‘π₯π’π π‘ππ¬:
β DIgital Twin of arTiculated Objects
β Geometry & kinematic articulation
β Articulation & 3D via perception
β Source code under MIT License
More:https://bit.ly/3LMazCV
πDigitizing objects for #metaverse through interactive perception
ππ’π π‘π₯π’π π‘ππ¬:
β DIgital Twin of arTiculated Objects
β Geometry & kinematic articulation
β Articulation & 3D via perception
β Source code under MIT License
More:https://bit.ly/3LMazCV
π₯5β€2π1π€―1
This media is not supported in your browser
VIEW IN TELEGRAM
π€ Robotic Telekinesis from Youtube π€
πCMU unveils a Robot that observes humans and imitates their actions in real-time
ππ’π π‘π₯π’π π‘ππ¬:
β Enabling robo-hand teleoperation
β Suitable for untrained operator
β Single uncalibrated RGB camera
β Leveraging unlabeled #youtube
β No active fine-tuning or setup
β No collision via Adv-Training
More: https://bit.ly/3H7zUnh
πCMU unveils a Robot that observes humans and imitates their actions in real-time
ππ’π π‘π₯π’π π‘ππ¬:
β Enabling robo-hand teleoperation
β Suitable for untrained operator
β Single uncalibrated RGB camera
β Leveraging unlabeled #youtube
β No active fine-tuning or setup
β No collision via Adv-Training
More: https://bit.ly/3H7zUnh
π₯3π€―2π1π1
This media is not supported in your browser
VIEW IN TELEGRAM
πDIGAN: #AI for video generationπ
πA novel INR-based generative adversarial network for video generation
ππ’π π‘π₯π’π π‘ππ¬:
β Dynamics-aware generator
β INR-based clip generator
β Manipulating space/time
β Identifying unnatural motion
More: https://bit.ly/3H6sHE4
πA novel INR-based generative adversarial network for video generation
ππ’π π‘π₯π’π π‘ππ¬:
β Dynamics-aware generator
β INR-based clip generator
β Manipulating space/time
β Identifying unnatural motion
More: https://bit.ly/3H6sHE4
π₯4π€―1
This media is not supported in your browser
VIEW IN TELEGRAM
π¦FILM Neural Frame Interpolationπ¦
πFrame interpolation that synthesizes multiple intermediate frames from two input images with large in-between motion
ππ’π π‘π₯π’π π‘ππ¬:
β Single unified network
β High quality output
β SOTA on the Xiph
β Apache License 2.0
More: https://bit.ly/3pl4ZxH
πFrame interpolation that synthesizes multiple intermediate frames from two input images with large in-between motion
ππ’π π‘π₯π’π π‘ππ¬:
β Single unified network
β High quality output
β SOTA on the Xiph
β Apache License 2.0
More: https://bit.ly/3pl4ZxH
π₯5π2π₯°1
This media is not supported in your browser
VIEW IN TELEGRAM
πNeural Maintenance via listeningπ
πNovel neural-method to detect whether a machine is "healthy" or requires maintenance
ππ’π π‘π₯π’π π‘ππ¬:
β Defects at an early stage
β FDWT, fast discrete wavelet
β Learnable wavelet/denoising
β Unsupervised learnable FDWT
β The new SOTA in PM
More: https://bit.ly/3hiKWeX
πNovel neural-method to detect whether a machine is "healthy" or requires maintenance
ππ’π π‘π₯π’π π‘ππ¬:
β Defects at an early stage
β FDWT, fast discrete wavelet
β Learnable wavelet/denoising
β Unsupervised learnable FDWT
β The new SOTA in PM
More: https://bit.ly/3hiKWeX
π€―6π€1
This media is not supported in your browser
VIEW IN TELEGRAM
π¦π¨ StyleGAN on Internet pics π¦π¨
πStyleGAN on raw uncurated images collected from Internet
ππ’π π‘π₯π’π π‘ππ¬:
β Outliers & multi-modal
β Self-distillation approach
β Self-filtering of outliers
β Perceptual clustering
More: https://bit.ly/33Z1d5H
πStyleGAN on raw uncurated images collected from Internet
ππ’π π‘π₯π’π π‘ππ¬:
β Outliers & multi-modal
β Self-distillation approach
β Self-filtering of outliers
β Perceptual clustering
More: https://bit.ly/33Z1d5H
β€2π1π₯1π€―1
This media is not supported in your browser
VIEW IN TELEGRAM
π¦The new SOTA for Unsupervised π¦
πSelf-supervised transformer to discover objects in images
ππ’π π‘π₯π’π π‘ππ¬:
β Visual tokens as nodes in graph
β Edges as connectivity score
β The second smallest eV = fg
β Suitable for unsupervised saliency
β Weakly supervised obj. detection
β Code under MIT License
More: https://bit.ly/3sqbFg3
πSelf-supervised transformer to discover objects in images
ππ’π π‘π₯π’π π‘ππ¬:
β Visual tokens as nodes in graph
β Edges as connectivity score
β The second smallest eV = fg
β Suitable for unsupervised saliency
β Weakly supervised obj. detection
β Code under MIT License
More: https://bit.ly/3sqbFg3
π4π₯3π€―1
This media is not supported in your browser
VIEW IN TELEGRAM
π₯¦ GAN-generated CryptoPunks π₯¦
πA simple (and funny) SN-GAN to generate cryptopunks
ππ’π π‘π₯π’π π‘ππ¬:
β Spectral normalization (2018)
β Easy to incorporate into training
β A project by Teddy Koker π©
More: https://bit.ly/35C1rQI
πA simple (and funny) SN-GAN to generate cryptopunks
ππ’π π‘π₯π’π π‘ππ¬:
β Spectral normalization (2018)
β Easy to incorporate into training
β A project by Teddy Koker π©
More: https://bit.ly/35C1rQI
β€3π3π1π1
This media is not supported in your browser
VIEW IN TELEGRAM
π€ͺSEER: self-AI from BILLIONS picπ€ͺ
πMETA + INRIA trained models on billions of random images without any pre-processing or assumptions
ππ’π π‘π₯π’π π‘ππ¬:
β Self-supervised on pics from web
β Discovering properties in datasets
β More fair, less biased & less harmful
β Better OOD generalization
β Source code available!
More: https://bit.ly/3vy69dd
πMETA + INRIA trained models on billions of random images without any pre-processing or assumptions
ππ’π π‘π₯π’π π‘ππ¬:
β Self-supervised on pics from web
β Discovering properties in datasets
β More fair, less biased & less harmful
β Better OOD generalization
β Source code available!
More: https://bit.ly/3vy69dd
π₯4π3π€―1
This media is not supported in your browser
VIEW IN TELEGRAM
π²A novel AI-controllable synthesisπ²
πModeling local semantic parts separately and synthesizing images in a compositional way
ππ’π π‘π₯π’π π‘ππ¬:
β Structure & texture locally controlled
β Disentanglement between areas
β Fine-grained editing of images
β Extendible via transfer learning
β Just accepted to #CVPR2022
More: https://bit.ly/3IBgkBy
πModeling local semantic parts separately and synthesizing images in a compositional way
ππ’π π‘π₯π’π π‘ππ¬:
β Structure & texture locally controlled
β Disentanglement between areas
β Fine-grained editing of images
β Extendible via transfer learning
β Just accepted to #CVPR2022
More: https://bit.ly/3IBgkBy
π±3π€―2β€1
This media is not supported in your browser
VIEW IN TELEGRAM
π₯£ #AI-Generation with Dream Fields π₯£
πNeural rendering with multi-modal image and text representations
ππ’π π‘π₯π’π π‘ππ¬:
β Aligned image & text models
β 3D from natural language
β No additional data
β D.F. neural-scene
More: https://bit.ly/3Mhwm5D
πNeural rendering with multi-modal image and text representations
ππ’π π‘π₯π’π π‘ππ¬:
β Aligned image & text models
β 3D from natural language
β No additional data
β D.F. neural-scene
More: https://bit.ly/3Mhwm5D
π10π1
This media is not supported in your browser
VIEW IN TELEGRAM
πͺ Mip-NeRF 360 for unbounded scenes πͺ
πAn extension of NeRF to overcome the challenges presented by unbounded scenes
ππ’π π‘π₯π’π π‘ππ¬:
β Realistic synthesized views
β Intricate/unbounded scenes
β Detailed depth maps
β Mean-squared error -54%
β No code provided π₯
More: https://bit.ly/36ZxsD4
πAn extension of NeRF to overcome the challenges presented by unbounded scenes
ππ’π π‘π₯π’π π‘ππ¬:
β Realistic synthesized views
β Intricate/unbounded scenes
β Detailed depth maps
β Mean-squared error -54%
β No code provided π₯
More: https://bit.ly/36ZxsD4
π€―4β€1
This media is not supported in your browser
VIEW IN TELEGRAM
π PINA: personal Neural Avatar π
πA novel method to acquire neural avatars from RGB-D videos
ππ’π π‘π₯π’π π‘ππ¬:
β A virtual copy of themselves
β Realistic clothing deformations
β Shape & non-rigid deformation
β Avatars from RGB-D sequences
β Creative Commons Zero v1.0
More: https://bit.ly/3HAtRIh
πA novel method to acquire neural avatars from RGB-D videos
ππ’π π‘π₯π’π π‘ππ¬:
β A virtual copy of themselves
β Realistic clothing deformations
β Shape & non-rigid deformation
β Avatars from RGB-D sequences
β Creative Commons Zero v1.0
More: https://bit.ly/3HAtRIh
π4β€1π1π1
This media is not supported in your browser
VIEW IN TELEGRAM
π¦ EfficientVIS: new SOTA for VIS π¦
πSimultaneous classification, segmentation, and tracking multiple object instances in videos
ππ’π π‘π₯π’π π‘ππ¬:
β Efficient and fully end-to-end
β Iterative query-video interaction
β First RoI-wise clip-level RT-VIS
β Requires 15Γ fewer epochs
More: https://bit.ly/3KfqurN
πSimultaneous classification, segmentation, and tracking multiple object instances in videos
ππ’π π‘π₯π’π π‘ππ¬:
β Efficient and fully end-to-end
β Iterative query-video interaction
β First RoI-wise clip-level RT-VIS
β Requires 15Γ fewer epochs
More: https://bit.ly/3KfqurN
π10π₯3π1π€―1
This media is not supported in your browser
VIEW IN TELEGRAM
π #AI-clips from single frameπ
πMoving objects in #3D while generating a video by a sequence of desired actions
ππ’π π‘π₯π’π π‘ππ¬:
β A playable environments
β A single starting imageπ€―
β Controllable camera
β Unsupervised learning
More: https://bit.ly/35VDrYO
πMoving objects in #3D while generating a video by a sequence of desired actions
ππ’π π‘π₯π’π π‘ππ¬:
β A playable environments
β A single starting imageπ€―
β Controllable camera
β Unsupervised learning
More: https://bit.ly/35VDrYO
β€3π1π€―1
This media is not supported in your browser
VIEW IN TELEGRAM
π§Kubric: AI dataset generatorπ§
πOpen-source #Python framework for photo-realistic scenes: full control, rich annotations, TBs of fresh data π€―
ππ’π π‘π₯π’π π‘ππ¬:
β Synthetic datasets with GT
β From NeRF to optical flow
β Full control over data
β Ok privacy & licensing
β Apache License 2.0
More: https://bit.ly/3hQCaFs
πOpen-source #Python framework for photo-realistic scenes: full control, rich annotations, TBs of fresh data π€―
ππ’π π‘π₯π’π π‘ππ¬:
β Synthetic datasets with GT
β From NeRF to optical flow
β Full control over data
β Ok privacy & licensing
β Apache License 2.0
More: https://bit.ly/3hQCaFs
π₯6π1π€―1
This media is not supported in your browser
VIEW IN TELEGRAM
πͺΒ΅Transfer for enormous NNs πͺ
πMicrosoft unveils how to tune enormous neural networks
ππ’π π‘π₯π’π π‘ππ¬:
β New HP tuning: Β΅Transfer
β Zero-shot transfer to full-model
β Outperforming BERT-large
β Outperforming 6.7B GPT-3
β Code under MIT license
More: https://bit.ly/3qc37Ij
πMicrosoft unveils how to tune enormous neural networks
ππ’π π‘π₯π’π π‘ππ¬:
β New HP tuning: Β΅Transfer
β Zero-shot transfer to full-model
β Outperforming BERT-large
β Outperforming 6.7B GPT-3
β Code under MIT license
More: https://bit.ly/3qc37Ij
π₯2π€―2β€1
This media is not supported in your browser
VIEW IN TELEGRAM
π§Semantic via only text supervisionπ§
πGroupViT with a text encoder on a large-scale image-text dataset: semantic with any pixel-level annotations in training!
ππ’π π‘π₯π’π π‘ππ¬:
β Hierarc. Grouping Vision Transf.
β Additional text encoder
β NO pixel-level annotations
β Semantic-seg task via zero-shot
β Source code available soon
More:https://bit.ly/3hPGeWr
πGroupViT with a text encoder on a large-scale image-text dataset: semantic with any pixel-level annotations in training!
ππ’π π‘π₯π’π π‘ππ¬:
β Hierarc. Grouping Vision Transf.
β Additional text encoder
β NO pixel-level annotations
β Semantic-seg task via zero-shot
β Source code available soon
More:https://bit.ly/3hPGeWr
π6π₯°1π€―1