This media is not supported in your browser
VIEW IN TELEGRAM
π¦π¨ StyleGAN on Internet pics π¦π¨
πStyleGAN on raw uncurated images collected from Internet
ππ’π π‘π₯π’π π‘ππ¬:
β Outliers & multi-modal
β Self-distillation approach
β Self-filtering of outliers
β Perceptual clustering
More: https://bit.ly/33Z1d5H
πStyleGAN on raw uncurated images collected from Internet
ππ’π π‘π₯π’π π‘ππ¬:
β Outliers & multi-modal
β Self-distillation approach
β Self-filtering of outliers
β Perceptual clustering
More: https://bit.ly/33Z1d5H
β€2π1π₯1π€―1
This media is not supported in your browser
VIEW IN TELEGRAM
π¦The new SOTA for Unsupervised π¦
πSelf-supervised transformer to discover objects in images
ππ’π π‘π₯π’π π‘ππ¬:
β Visual tokens as nodes in graph
β Edges as connectivity score
β The second smallest eV = fg
β Suitable for unsupervised saliency
β Weakly supervised obj. detection
β Code under MIT License
More: https://bit.ly/3sqbFg3
πSelf-supervised transformer to discover objects in images
ππ’π π‘π₯π’π π‘ππ¬:
β Visual tokens as nodes in graph
β Edges as connectivity score
β The second smallest eV = fg
β Suitable for unsupervised saliency
β Weakly supervised obj. detection
β Code under MIT License
More: https://bit.ly/3sqbFg3
π4π₯3π€―1
This media is not supported in your browser
VIEW IN TELEGRAM
π₯¦ GAN-generated CryptoPunks π₯¦
πA simple (and funny) SN-GAN to generate cryptopunks
ππ’π π‘π₯π’π π‘ππ¬:
β Spectral normalization (2018)
β Easy to incorporate into training
β A project by Teddy Koker π©
More: https://bit.ly/35C1rQI
πA simple (and funny) SN-GAN to generate cryptopunks
ππ’π π‘π₯π’π π‘ππ¬:
β Spectral normalization (2018)
β Easy to incorporate into training
β A project by Teddy Koker π©
More: https://bit.ly/35C1rQI
β€3π3π1π1
This media is not supported in your browser
VIEW IN TELEGRAM
π€ͺSEER: self-AI from BILLIONS picπ€ͺ
πMETA + INRIA trained models on billions of random images without any pre-processing or assumptions
ππ’π π‘π₯π’π π‘ππ¬:
β Self-supervised on pics from web
β Discovering properties in datasets
β More fair, less biased & less harmful
β Better OOD generalization
β Source code available!
More: https://bit.ly/3vy69dd
πMETA + INRIA trained models on billions of random images without any pre-processing or assumptions
ππ’π π‘π₯π’π π‘ππ¬:
β Self-supervised on pics from web
β Discovering properties in datasets
β More fair, less biased & less harmful
β Better OOD generalization
β Source code available!
More: https://bit.ly/3vy69dd
π₯4π3π€―1
This media is not supported in your browser
VIEW IN TELEGRAM
π²A novel AI-controllable synthesisπ²
πModeling local semantic parts separately and synthesizing images in a compositional way
ππ’π π‘π₯π’π π‘ππ¬:
β Structure & texture locally controlled
β Disentanglement between areas
β Fine-grained editing of images
β Extendible via transfer learning
β Just accepted to #CVPR2022
More: https://bit.ly/3IBgkBy
πModeling local semantic parts separately and synthesizing images in a compositional way
ππ’π π‘π₯π’π π‘ππ¬:
β Structure & texture locally controlled
β Disentanglement between areas
β Fine-grained editing of images
β Extendible via transfer learning
β Just accepted to #CVPR2022
More: https://bit.ly/3IBgkBy
π±3π€―2β€1
This media is not supported in your browser
VIEW IN TELEGRAM
π₯£ #AI-Generation with Dream Fields π₯£
πNeural rendering with multi-modal image and text representations
ππ’π π‘π₯π’π π‘ππ¬:
β Aligned image & text models
β 3D from natural language
β No additional data
β D.F. neural-scene
More: https://bit.ly/3Mhwm5D
πNeural rendering with multi-modal image and text representations
ππ’π π‘π₯π’π π‘ππ¬:
β Aligned image & text models
β 3D from natural language
β No additional data
β D.F. neural-scene
More: https://bit.ly/3Mhwm5D
π10π1
This media is not supported in your browser
VIEW IN TELEGRAM
πͺ Mip-NeRF 360 for unbounded scenes πͺ
πAn extension of NeRF to overcome the challenges presented by unbounded scenes
ππ’π π‘π₯π’π π‘ππ¬:
β Realistic synthesized views
β Intricate/unbounded scenes
β Detailed depth maps
β Mean-squared error -54%
β No code provided π₯
More: https://bit.ly/36ZxsD4
πAn extension of NeRF to overcome the challenges presented by unbounded scenes
ππ’π π‘π₯π’π π‘ππ¬:
β Realistic synthesized views
β Intricate/unbounded scenes
β Detailed depth maps
β Mean-squared error -54%
β No code provided π₯
More: https://bit.ly/36ZxsD4
π€―4β€1
This media is not supported in your browser
VIEW IN TELEGRAM
π PINA: personal Neural Avatar π
πA novel method to acquire neural avatars from RGB-D videos
ππ’π π‘π₯π’π π‘ππ¬:
β A virtual copy of themselves
β Realistic clothing deformations
β Shape & non-rigid deformation
β Avatars from RGB-D sequences
β Creative Commons Zero v1.0
More: https://bit.ly/3HAtRIh
πA novel method to acquire neural avatars from RGB-D videos
ππ’π π‘π₯π’π π‘ππ¬:
β A virtual copy of themselves
β Realistic clothing deformations
β Shape & non-rigid deformation
β Avatars from RGB-D sequences
β Creative Commons Zero v1.0
More: https://bit.ly/3HAtRIh
π4β€1π1π1
This media is not supported in your browser
VIEW IN TELEGRAM
π¦ EfficientVIS: new SOTA for VIS π¦
πSimultaneous classification, segmentation, and tracking multiple object instances in videos
ππ’π π‘π₯π’π π‘ππ¬:
β Efficient and fully end-to-end
β Iterative query-video interaction
β First RoI-wise clip-level RT-VIS
β Requires 15Γ fewer epochs
More: https://bit.ly/3KfqurN
πSimultaneous classification, segmentation, and tracking multiple object instances in videos
ππ’π π‘π₯π’π π‘ππ¬:
β Efficient and fully end-to-end
β Iterative query-video interaction
β First RoI-wise clip-level RT-VIS
β Requires 15Γ fewer epochs
More: https://bit.ly/3KfqurN
π10π₯3π1π€―1
This media is not supported in your browser
VIEW IN TELEGRAM
π #AI-clips from single frameπ
πMoving objects in #3D while generating a video by a sequence of desired actions
ππ’π π‘π₯π’π π‘ππ¬:
β A playable environments
β A single starting imageπ€―
β Controllable camera
β Unsupervised learning
More: https://bit.ly/35VDrYO
πMoving objects in #3D while generating a video by a sequence of desired actions
ππ’π π‘π₯π’π π‘ππ¬:
β A playable environments
β A single starting imageπ€―
β Controllable camera
β Unsupervised learning
More: https://bit.ly/35VDrYO
β€3π1π€―1
This media is not supported in your browser
VIEW IN TELEGRAM
π§Kubric: AI dataset generatorπ§
πOpen-source #Python framework for photo-realistic scenes: full control, rich annotations, TBs of fresh data π€―
ππ’π π‘π₯π’π π‘ππ¬:
β Synthetic datasets with GT
β From NeRF to optical flow
β Full control over data
β Ok privacy & licensing
β Apache License 2.0
More: https://bit.ly/3hQCaFs
πOpen-source #Python framework for photo-realistic scenes: full control, rich annotations, TBs of fresh data π€―
ππ’π π‘π₯π’π π‘ππ¬:
β Synthetic datasets with GT
β From NeRF to optical flow
β Full control over data
β Ok privacy & licensing
β Apache License 2.0
More: https://bit.ly/3hQCaFs
π₯6π1π€―1
This media is not supported in your browser
VIEW IN TELEGRAM
πͺΒ΅Transfer for enormous NNs πͺ
πMicrosoft unveils how to tune enormous neural networks
ππ’π π‘π₯π’π π‘ππ¬:
β New HP tuning: Β΅Transfer
β Zero-shot transfer to full-model
β Outperforming BERT-large
β Outperforming 6.7B GPT-3
β Code under MIT license
More: https://bit.ly/3qc37Ij
πMicrosoft unveils how to tune enormous neural networks
ππ’π π‘π₯π’π π‘ππ¬:
β New HP tuning: Β΅Transfer
β Zero-shot transfer to full-model
β Outperforming BERT-large
β Outperforming 6.7B GPT-3
β Code under MIT license
More: https://bit.ly/3qc37Ij
π₯2π€―2β€1
This media is not supported in your browser
VIEW IN TELEGRAM
π§Semantic via only text supervisionπ§
πGroupViT with a text encoder on a large-scale image-text dataset: semantic with any pixel-level annotations in training!
ππ’π π‘π₯π’π π‘ππ¬:
β Hierarc. Grouping Vision Transf.
β Additional text encoder
β NO pixel-level annotations
β Semantic-seg task via zero-shot
β Source code available soon
More:https://bit.ly/3hPGeWr
πGroupViT with a text encoder on a large-scale image-text dataset: semantic with any pixel-level annotations in training!
ππ’π π‘π₯π’π π‘ππ¬:
β Hierarc. Grouping Vision Transf.
β Additional text encoder
β NO pixel-level annotations
β Semantic-seg task via zero-shot
β Source code available soon
More:https://bit.ly/3hPGeWr
π6π₯°1π€―1
This media is not supported in your browser
VIEW IN TELEGRAM
β4D-Net: Lidar + RGB synchronizationβ
πGoogle unveils 4D-Net to combine 3D LiDAR and onboard RGB camera
ππ’π π‘π₯π’π π‘ππ¬:
β Point clouds/images in time
β Fusing multiple modalities in 4D
β Novel sampling for 3D P.C. in time
β New SOTA for 3D detection
More: https://bit.ly/3hZCFwN
πGoogle unveils 4D-Net to combine 3D LiDAR and onboard RGB camera
ππ’π π‘π₯π’π π‘ππ¬:
β Point clouds/images in time
β Fusing multiple modalities in 4D
β Novel sampling for 3D P.C. in time
β New SOTA for 3D detection
More: https://bit.ly/3hZCFwN
π12π₯2π€―1
This media is not supported in your browser
VIEW IN TELEGRAM
π New SOTA in video synthesis! π
πSnap unveils a novel multimodal video generation framework via text/images
ππ’π π‘π₯π’π π‘ππ¬:
β Multimodal video generation
β Bidirectional transformer
β Video token with self-learn.
β Text augmentation for robustness
β Longer sequence synthesis
More: https://bit.ly/3hZLXsG
πSnap unveils a novel multimodal video generation framework via text/images
ππ’π π‘π₯π’π π‘ππ¬:
β Multimodal video generation
β Bidirectional transformer
β Video token with self-learn.
β Text augmentation for robustness
β Longer sequence synthesis
More: https://bit.ly/3hZLXsG
π€―4π1π₯1π1
This media is not supported in your browser
VIEW IN TELEGRAM
π StyelNeRF source code is out π
π3D consistent photo-realistic image synthesis
ππ’π π‘π₯π’π π‘ππ¬:
β NeRF + style generator
β 3D consistency for HD image
β Novel regularization loss
β Camera control on styles
More: https://bit.ly/3t5xC49
π3D consistent photo-realistic image synthesis
ππ’π π‘π₯π’π π‘ππ¬:
β NeRF + style generator
β 3D consistency for HD image
β Novel regularization loss
β Camera control on styles
More: https://bit.ly/3t5xC49
π₯4π₯°1π€―1
This media is not supported in your browser
VIEW IN TELEGRAM
π¦CLD-based generative #AI by #Nvidiaπ¦
πNvidia unveils a novel critically-damped Langevin diffusion (CLD) for synthetic data
ππ’π π‘π₯π’π π‘ππ¬:
β A novel diffusion process for SGMs
β Novel score matching obj. for CLD
β Hybrid denoising score matching
β Efficient sampling from CLD model
β Source code under a specific license
More: https://bit.ly/35MToBe
πNvidia unveils a novel critically-damped Langevin diffusion (CLD) for synthetic data
ππ’π π‘π₯π’π π‘ππ¬:
β A novel diffusion process for SGMs
β Novel score matching obj. for CLD
β Hybrid denoising score matching
β Efficient sampling from CLD model
β Source code under a specific license
More: https://bit.ly/35MToBe
π₯2π€©2π1π€―1
This media is not supported in your browser
VIEW IN TELEGRAM
πΈUFO: segmentation @140+ FPSπΈ
πUnified Transformer Framework for Co-Segmentation, Co-Saliency & Salient Object Detection. All in one!
ππ’π π‘π₯π’π π‘ππ¬:
β Unified framework for co-segmentation
β Co-segmentation, co-saliency, saliency
β Block for long-range dependencies
β Able to reach for 140 FPS in inference
β The new SOTA on multiple datasets
β Source code under MIT License
More: https://bit.ly/3KLd9b9
πUnified Transformer Framework for Co-Segmentation, Co-Saliency & Salient Object Detection. All in one!
ππ’π π‘π₯π’π π‘ππ¬:
β Unified framework for co-segmentation
β Co-segmentation, co-saliency, saliency
β Block for long-range dependencies
β Able to reach for 140 FPS in inference
β The new SOTA on multiple datasets
β Source code under MIT License
More: https://bit.ly/3KLd9b9
π₯6π1π€―1
This media is not supported in your browser
VIEW IN TELEGRAM
π Multi-GANs fashion π
πGlobal GAN blended with other GANs for faces, shoes, etc.
ππ’π π‘π₯π’π π‘ππ¬:
β Multi-GAN framework
β Several generators
β Free of artifacts
β Full-body generation
β Humans, 1024x1024
More: https://bit.ly/37mfOte
πGlobal GAN blended with other GANs for faces, shoes, etc.
ππ’π π‘π₯π’π π‘ππ¬:
β Multi-GAN framework
β Several generators
β Free of artifacts
β Full-body generation
β Humans, 1024x1024
More: https://bit.ly/37mfOte
π₯2π2β€1π€―1
This media is not supported in your browser
VIEW IN TELEGRAM
π§ FLAG: #3D Avatar Generation π§
πA flow-based generative model of the 3D human body from sparse observations.
ππ’π π‘π₯π’π π‘ππ¬:
β FLow-based Avatar Generative
β Conditional distro of body pose
β Exact pose likelihood process
β Invertibility -> oracle latent code
More: https://bit.ly/3CQpk3p
πA flow-based generative model of the 3D human body from sparse observations.
ππ’π π‘π₯π’π π‘ππ¬:
β FLow-based Avatar Generative
β Conditional distro of body pose
β Exact pose likelihood process
β Invertibility -> oracle latent code
More: https://bit.ly/3CQpk3p
π2π₯1π€―1