This media is not supported in your browser
VIEW IN TELEGRAM
๐ฅท๐ฟ FCA: #3D Neural Camouflage ๐ฅท๐ฟ
๐#3D full-camouflage adversarial patch to fool neural detectors
๐๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โ Attack by diff-neural render
โ E2E physical adversarial attack
โ Envs, vehicles & detectors
โ Source code available!
More: https://bit.ly/38kKyfa
๐#3D full-camouflage adversarial patch to fool neural detectors
๐๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โ Attack by diff-neural render
โ E2E physical adversarial attack
โ Envs, vehicles & detectors
โ Source code available!
More: https://bit.ly/38kKyfa
๐5๐ฅ3๐คฏ2๐1
Media is too big
VIEW IN TELEGRAM
๐ One-Shot Object Pose ๐
๐A novel one-shot object pose estimator
๐๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โ Visual localization pipeline for object pose
โ Handling novel objects without CAD model
โ Novel graph attention for 2D-3D matching
โ Large dataset for one-shot object pose
More: https://bit.ly/3MTogjJ
๐A novel one-shot object pose estimator
๐๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โ Visual localization pipeline for object pose
โ Handling novel objects without CAD model
โ Novel graph attention for 2D-3D matching
โ Large dataset for one-shot object pose
More: https://bit.ly/3MTogjJ
๐ฅ11โค4๐2๐คฏ2
This media is not supported in your browser
VIEW IN TELEGRAM
โ๏ธSTEVE: Slot-TransformEr for VidEosโ๏ธ
๐STEVE: unsupervised model for object-centric learning in videos
๐๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โ Adoption of a slot decoder (SLATE)
โ SLATE with slot-level recurrence model
โ Complex and naturalistic videos
โ Significantly outperforms previous SOTA
More: https://bit.ly/3PNxxM3
๐STEVE: unsupervised model for object-centric learning in videos
๐๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โ Adoption of a slot decoder (SLATE)
โ SLATE with slot-level recurrence model
โ Complex and naturalistic videos
โ Significantly outperforms previous SOTA
More: https://bit.ly/3PNxxM3
๐ฅ7๐1๐คฏ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ฆ CogVideo: insane text-to-clip ๐ฆ
๐CogVideo: 9B-parameters world's first large scale open-source text-to-video ๐ต
๐๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โ Largest open-source T2C transformer
โ Finetuning of text-to-image model
โ Multi-frame-rate hierarchical training
โ From pretrained model CogView2
More: https://bit.ly/3Gzfl4n
๐CogVideo: 9B-parameters world's first large scale open-source text-to-video ๐ต
๐๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โ Largest open-source T2C transformer
โ Finetuning of text-to-image model
โ Multi-frame-rate hierarchical training
โ From pretrained model CogView2
More: https://bit.ly/3Gzfl4n
๐ฅ9๐6
This media is not supported in your browser
VIEW IN TELEGRAM
๐ฆTime-Aware Neural Voxels๐ฆ
๐TiNeuVox: "NeRF" with time-aware voxel features ๐ต
๐๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โ Dynamic scene w/ optimizable structure
โ Temporal information in radiance net
โ Small/large motion w/ single-res of feats
โ 192ร faster than previous Hyper-NeRF
More: https://bit.ly/3wR4O08
๐TiNeuVox: "NeRF" with time-aware voxel features ๐ต
๐๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โ Dynamic scene w/ optimizable structure
โ Temporal information in radiance net
โ Small/large motion w/ single-res of feats
โ 192ร faster than previous Hyper-NeRF
More: https://bit.ly/3wR4O08
๐11๐ฅ2๐คฏ1
๐ซNeural Anomaly Detection by AWS๐ซ
๐Ultra-competitive inference and SOTA for both detection and localization
๐๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โ Locally aggregated, mid-level feats patch
โ Maximizing nominal information at test time
โ Reducing biases towards ImageNet classes
โ Image-level anomaly AUROC of up to 99.6%
More: https://bit.ly/3t7Ndjg
๐Ultra-competitive inference and SOTA for both detection and localization
๐๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โ Locally aggregated, mid-level feats patch
โ Maximizing nominal information at test time
โ Reducing biases towards ImageNet classes
โ Image-level anomaly AUROC of up to 99.6%
More: https://bit.ly/3t7Ndjg
๐ฅ7๐คฏ3๐2
This media is not supported in your browser
VIEW IN TELEGRAM
๐น Project Skate from Google #AI ๐น
๐#AI tool to analyze the skateboarder's tricks in real-time
More: https://bit.ly/3zbQS3M
๐#AI tool to analyze the skateboarder's tricks in real-time
More: https://bit.ly/3zbQS3M
๐ฅ15๐คฉ3๐1
This media is not supported in your browser
VIEW IN TELEGRAM
๐งฌNeural Text2Human Generation๐งฌ
๐Text-driven neural human generation
๐๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โ Full-body from a given human pose
โ Hierarchical texture-aware codebook
โ DeepFashion -> 44k Hi-Res images
โ Code and models available!
More: https://bit.ly/3Mdnpt0
๐Text-driven neural human generation
๐๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โ Full-body from a given human pose
โ Hierarchical texture-aware codebook
โ DeepFashion -> 44k Hi-Res images
โ Code and models available!
More: https://bit.ly/3Mdnpt0
๐ฅ15๐1
๐งจEfficientFormers: 1.6ms inference ๐งจ
๐Transformers fast as MobileNet? Snap shows that on #iphone!
๐๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โ Low latency on mobile, high performance!
โ Revisiting the design of ViT through latency
โ New dimension-consistent design paradigm
โ EfficientFormers: a new ViT for mobile!
More: https://bit.ly/3MdgW15
๐Transformers fast as MobileNet? Snap shows that on #iphone!
๐๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โ Low latency on mobile, high performance!
โ Revisiting the design of ViT through latency
โ New dimension-consistent design paradigm
โ EfficientFormers: a new ViT for mobile!
More: https://bit.ly/3MdgW15
๐ฅ16๐1๐คฏ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ข Transformer-Based Sens-Fusion ๐ข
๐Updating TransFuser (CVPR21): image + LiDAR representations with self-attention
๐๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โ Existing approach can't handle traffic ๐ข
โ Novel multi-modal fusion transformer
โ The new SOTA in driving performance
โ Reducing avg collisions per KM by 48%
โ Insights on current limitations of E2E
More: https://bit.ly/391dmd6
๐Updating TransFuser (CVPR21): image + LiDAR representations with self-attention
๐๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โ Existing approach can't handle traffic ๐ข
โ Novel multi-modal fusion transformer
โ The new SOTA in driving performance
โ Reducing avg collisions per KM by 48%
โ Insights on current limitations of E2E
More: https://bit.ly/391dmd6
๐11๐ฅ2
๐ง๐ปโโ๏ธYogNet: neural yoga assistant๐ง๐ปโโ๏ธ
๐Multi-person yoga neural expert for 20 asanas
๐๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โ CNNs & reg.LSTMs + 3D-CNNs
โ Multi-person asanas in real-time
โ YAR: dataset for yoga & posture
โ 1206 videos, 2D RGB camera
More: https://bit.ly/3NncVbE
๐Multi-person yoga neural expert for 20 asanas
๐๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โ CNNs & reg.LSTMs + 3D-CNNs
โ Multi-person asanas in real-time
โ YAR: dataset for yoga & posture
โ 1206 videos, 2D RGB camera
More: https://bit.ly/3NncVbE
โค13๐1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ด Geogram: geometric algos in C++ ๐ด
๐Novel open-source programming library with (research) geometric algorithms in C++
๐๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โ Geometry Processing from #INRIA
โ 30+ papers from SIGGRAPH, etc.
โ Grants: GOODSHAPE & VORPALINE
โ Code (mostly C++) under BSD 3
More: https://bit.ly/3mhS4L7
๐Novel open-source programming library with (research) geometric algorithms in C++
๐๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โ Geometry Processing from #INRIA
โ 30+ papers from SIGGRAPH, etc.
โ Grants: GOODSHAPE & VORPALINE
โ Code (mostly C++) under BSD 3
More: https://bit.ly/3mhS4L7
๐ฅ6๐3โค1
๐ Open Source Vision from #Apple ๐
๐CVNets: open-source (not a joke) lib for neural vision.
๐๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โ PyTorch-based neural lib. for vision
โ Train 2โ4ร longer w/ augmentations
โ Plug-and-play components for CV
โ Source code under a custom license
More: https://bit.ly/39d1dSj
๐CVNets: open-source (not a joke) lib for neural vision.
๐๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โ PyTorch-based neural lib. for vision
โ Train 2โ4ร longer w/ augmentations
โ Plug-and-play components for CV
โ Source code under a custom license
More: https://bit.ly/39d1dSj
๐9
This media is not supported in your browser
VIEW IN TELEGRAM
๐๐ปNeural Clips by #Nvidia: INSANE ๐๐ป
๐Neural generation with changes in camera viewpoint & content that arises over time ๐คฏ
๐๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โ Novel hierarchical generator architecture
โ Temp. receptive field + temporal embed.
โ Multi-res. with super-resolution network
โ SOTA in long clip with motion & changes
โ Code, data & models in August 2022 ๐๏ธ
More: https://bit.ly/3zroWsC
๐Neural generation with changes in camera viewpoint & content that arises over time ๐คฏ
๐๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โ Novel hierarchical generator architecture
โ Temp. receptive field + temporal embed.
โ Multi-res. with super-resolution network
โ SOTA in long clip with motion & changes
โ Code, data & models in August 2022 ๐๏ธ
More: https://bit.ly/3zroWsC
๐คฏ9๐2โค1
This media is not supported in your browser
VIEW IN TELEGRAM
โฝ Zero to #Messi with #deeplearning โฝ
๐EA unveils a neural system to learn multiple soccer juggling skills ๐
๐๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โ Learning difficult soccer juggling skills
โ Layer-wise mixture-of-experts architecture
โ Specialization arises naturally
โ Adaptive random walk training strategy
More: https://bit.ly/3mwRaL2
๐EA unveils a neural system to learn multiple soccer juggling skills ๐
๐๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โ Learning difficult soccer juggling skills
โ Layer-wise mixture-of-experts architecture
โ Specialization arises naturally
โ Adaptive random walk training strategy
More: https://bit.ly/3mwRaL2
๐ฅ7๐3
This media is not supported in your browser
VIEW IN TELEGRAM
๐๏ธ HumanNeRF: source code is out! ๐๏ธ
๐Pausing the video at any frame and rendering the subject from arbitrary views!
๐๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โ Synthesizing photorealistic humans
โ Synthesizing details, ie. cloth & face
โ Volumetric canonical T-pose
โ Skeletal rigid/non-rigid decomposition
More: https://bit.ly/3NEkTNY
๐Pausing the video at any frame and rendering the subject from arbitrary views!
๐๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โ Synthesizing photorealistic humans
โ Synthesizing details, ie. cloth & face
โ Volumetric canonical T-pose
โ Skeletal rigid/non-rigid decomposition
More: https://bit.ly/3NEkTNY
๐คฏ17๐ฅ5๐2
This media is not supported in your browser
VIEW IN TELEGRAM
๐ EG3D: source code is out! ๐
๐#Nvidia just opened EG3D: real time multi-view faces w/ HQ #3D geometry!
๐๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โ Tri-plane-based 3D GAN framework
โ Pose-correlated attribute (expression)
โ SOTA in uncond. 3D-aware synthesis
โ Source code & models NOW available!
More: https://bit.ly/3aOfHs0
๐#Nvidia just opened EG3D: real time multi-view faces w/ HQ #3D geometry!
๐๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โ Tri-plane-based 3D GAN framework
โ Pose-correlated attribute (expression)
โ SOTA in uncond. 3D-aware synthesis
โ Source code & models NOW available!
More: https://bit.ly/3aOfHs0
๐ฅ7๐คฏ6๐4โค2
๐ฅOne Millisecond Backbone. Fire!๐ฅ
๐MobileOne by #Apple: efficient mobile backbone with inference <1 ms on #iPhone12!
๐๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โ 75.9% top-1 accuracy on ImageNet
โ 38ร faster than MobileFormer net
โ Classification, detection & segmentation
โ Source code & model soon available!
More: https://bit.ly/3tsT7f2
๐MobileOne by #Apple: efficient mobile backbone with inference <1 ms on #iPhone12!
๐๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โ 75.9% top-1 accuracy on ImageNet
โ 38ร faster than MobileFormer net
โ Classification, detection & segmentation
โ Source code & model soon available!
More: https://bit.ly/3tsT7f2
โค24๐2
This media is not supported in your browser
VIEW IN TELEGRAM
๐งจ Scaling Transformers to GigaPixels!๐งจ
๐Novel ViT called Hierarchical Image Pyramid Transformer (HIPT) -> Scaling to GigaPixels!
๐๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โ Gigapixel whole-slide imaging (WSI)
โ Leveraging natural hier. structure of WSI
โ Self-supervised Hi-Res representations
โ Source code and models available!
More: https://bit.ly/3xLuzkg
๐Novel ViT called Hierarchical Image Pyramid Transformer (HIPT) -> Scaling to GigaPixels!
๐๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โ Gigapixel whole-slide imaging (WSI)
โ Leveraging natural hier. structure of WSI
โ Self-supervised Hi-Res representations
โ Source code and models available!
More: https://bit.ly/3xLuzkg
๐คฏ16๐1
This media is not supported in your browser
VIEW IN TELEGRAM
๐BodyMap: Hyper-Detailed Humans๐
๐#META unveils 1st-ever dense continuous correspondence for clothed humans
๐๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โ 1st-ever dense continuous corresp.
โ HQ fingers, hair, and clothes
โ Novel ViT-based architecture
โ SOTA on DensePose COCO
More: https://bit.ly/39nEPps
๐#META unveils 1st-ever dense continuous correspondence for clothed humans
๐๐ข๐ ๐ก๐ฅ๐ข๐ ๐ก๐ญ๐ฌ:
โ 1st-ever dense continuous corresp.
โ HQ fingers, hair, and clothes
โ Novel ViT-based architecture
โ SOTA on DensePose COCO
More: https://bit.ly/39nEPps
๐13โค2