This media is not supported in your browser
VIEW IN TELEGRAM
πΈ Virtual Occlusions in #AR πΈ
πNiantic (#pokemongo) on a novel approach for virtual assets to appear βsitting amongβ the real world objects
πReview https://bit.ly/3o04wn6
πPaper arxiv.org/pdf/2305.07014.pdf
πProject nianticlabs.github.io/implicit-depth
πCode github.com/nianticlabs/implicit-depth
πNiantic (#pokemongo) on a novel approach for virtual assets to appear βsitting amongβ the real world objects
πReview https://bit.ly/3o04wn6
πPaper arxiv.org/pdf/2305.07014.pdf
πProject nianticlabs.github.io/implicit-depth
πCode github.com/nianticlabs/implicit-depth
π₯11π€―5π3β‘1π€©1
This media is not supported in your browser
VIEW IN TELEGRAM
πΏ De-Aging Harrison Ford via SD πΏ
πStable Diffusion for Hollywood: preview of the next autotune of entertainment industry. A discussionπ
π More: https://bit.ly/41EzaQK
πStable Diffusion for Hollywood: preview of the next autotune of entertainment industry. A discussionπ
π More: https://bit.ly/41EzaQK
π€―19π₯9π6π©3β‘1π±1
This media is not supported in your browser
VIEW IN TELEGRAM
πͺ° #3D Auto-Reconstruction πͺ°
πAutoRecon: automated discovery & reconstruction of objects from multi-view pics.
πReview https://bit.ly/3MxI0f4
πPaper arxiv.org/pdf/2305.08810.pdf
πProject zju3dv.github.io/autorecon/
πCode github.com/zju3dv/AutoRecon
πAutoRecon: automated discovery & reconstruction of objects from multi-view pics.
πReview https://bit.ly/3MxI0f4
πPaper arxiv.org/pdf/2305.08810.pdf
πProject zju3dv.github.io/autorecon/
πCode github.com/zju3dv/AutoRecon
π₯11β€4π€―3π₯°1
This media is not supported in your browser
VIEW IN TELEGRAM
π Multi-Layered 3D Garments Animation π
πS-Lab unveils LayersNet: animating multi-layered garments driven by various external forces, such as human bodies & wind
πReview https://bit.ly/435b42F
πPaper arxiv.org/pdf/2305.10418.pdf
πProject mmlab-ntu.github.io/project/layersnet
πS-Lab unveils LayersNet: animating multi-layered garments driven by various external forces, such as human bodies & wind
πReview https://bit.ly/435b42F
πPaper arxiv.org/pdf/2305.10418.pdf
πProject mmlab-ntu.github.io/project/layersnet
π₯6π±2β€1π1
This media is not supported in your browser
VIEW IN TELEGRAM
π« 100% Mask-Free VIS π«
πETH Z unveils MaskFreeVIS: novel high-performing VIS without any mask annotations.
πReview https://bit.ly/3Wg7CQB
πPaper arxiv.org/pdf/2303.15904.pdf
πProject www.vis.xyz/pub/maskfreevis/
πCode github.com/SysCV/maskfreevis
πETH Z unveils MaskFreeVIS: novel high-performing VIS without any mask annotations.
πReview https://bit.ly/3Wg7CQB
πPaper arxiv.org/pdf/2303.15904.pdf
πProject www.vis.xyz/pub/maskfreevis/
πCode github.com/SysCV/maskfreevis
π₯6π4π€―2β€1π±1
This media is not supported in your browser
VIEW IN TELEGRAM
π Drag-GAN: user-friendly image-manipulation π
π Manual deforming of (real and generated) images over pose, shape, expression and layout.
πReview https://bit.ly/3BFyXlR
πPaper arxiv.org/pdf/2305.10973.pdf
πProject vcai.mpi-inf.mpg.de/projects/DragGAN
πCode github.com/XingangPan/DragGAN
π Manual deforming of (real and generated) images over pose, shape, expression and layout.
πReview https://bit.ly/3BFyXlR
πPaper arxiv.org/pdf/2305.10973.pdf
πProject vcai.mpi-inf.mpg.de/projects/DragGAN
πCode github.com/XingangPan/DragGAN
π₯34π€―18β€6π4π±1
This media is not supported in your browser
VIEW IN TELEGRAM
πΊοΈ AI-generated stereotypical men πΊοΈ
πA thread about generating stereotypical person from 15 countries all around the world. And yes, Italian love Pizza.
π More https://bit.ly/3oo0t4c
πA thread about generating stereotypical person from 15 countries all around the world. And yes, Italian love Pizza.
π More https://bit.ly/3oo0t4c
π€£6β€3π€―1
This media is not supported in your browser
VIEW IN TELEGRAM
πΆ AVOS Multiscale Encoder-Decoder ViT πΆ
π MED-VT, world's first Multiscale Encoder Decoder Video Transformer for AVOS
πReview https://bit.ly/3MohFi1
πPaper arxiv.org/pdf/2304.05930.pdf
πProject rkyuca.github.io/medvt
πCode github.com/rkyuca/medvt
π MED-VT, world's first Multiscale Encoder Decoder Video Transformer for AVOS
πReview https://bit.ly/3MohFi1
πPaper arxiv.org/pdf/2304.05930.pdf
πProject rkyuca.github.io/medvt
πCode github.com/rkyuca/medvt
π13π₯°1
This media is not supported in your browser
VIEW IN TELEGRAM
π Neural Dynamic Image-Based Rendering π
π DynIBaR: synthesizing novel views from monocular video depicting a complex dynamic scene.
πReview https://t.ly/90Kw
πPaper arxiv.org/pdf/2211.11082.pdf
πProject https://dynibar.github.io/
πCode github.com/google/dynibar
π DynIBaR: synthesizing novel views from monocular video depicting a complex dynamic scene.
πReview https://t.ly/90Kw
πPaper arxiv.org/pdf/2211.11082.pdf
πProject https://dynibar.github.io/
πCode github.com/google/dynibar
β€9π3π₯°1π€―1
This media is not supported in your browser
VIEW IN TELEGRAM
π¦ Open Semantic Segmentation π¦
πSSSegmentation: open source supervised semantic segmentation toolbox based on #PyTorch
πReview https://t.ly/ZE9q
πPaper arxiv.org/pdf/2305.17091.pdf
πCode github.com/SegmentationBLWX/sssegmentation
πSSSegmentation: open source supervised semantic segmentation toolbox based on #PyTorch
πReview https://t.ly/ZE9q
πPaper arxiv.org/pdf/2305.17091.pdf
πCode github.com/SegmentationBLWX/sssegmentation
π₯10β€4β‘1π1π€―1π€©1πΎ1
This media is not supported in your browser
VIEW IN TELEGRAM
ποΈ 4D Humans with Transformers ποΈ
πNovel approach to reconstruct and track humans (even in unusual poses)
πReview https://t.ly/XGv_
πPaper arxiv.org/pdf/2305.20091.pdf
πProject shubham-goel.github.io/4dhumans/#
πCode github.com/shubham-goel/4D-Humans
πNovel approach to reconstruct and track humans (even in unusual poses)
πReview https://t.ly/XGv_
πPaper arxiv.org/pdf/2305.20091.pdf
πProject shubham-goel.github.io/4dhumans/#
πCode github.com/shubham-goel/4D-Humans
π€―10π7π₯5β€2β‘1
This media is not supported in your browser
VIEW IN TELEGRAM
π½ Neuralangelo Digital Twins. INSANEπ½
π A novel framework from #Nvidia for Hi-Fi 3D Digital twins.
πReview https://t.ly/rxoF4
πProject research.nvidia.com/labs/dir/neuralangelo
πPaper research.nvidia.com/labs/dir/neuralangelo/paper.pdf
π A novel framework from #Nvidia for Hi-Fi 3D Digital twins.
πReview https://t.ly/rxoF4
πProject research.nvidia.com/labs/dir/neuralangelo
πPaper research.nvidia.com/labs/dir/neuralangelo/paper.pdf
π₯15π4π€―1
This media is not supported in your browser
VIEW IN TELEGRAM
π¦ ColorDiffuser: Text-to-Video Colorization π¦
πHK University unveils ColorDiffuser: adapting pre-trained text-to-image latent diffusion model for video colorization
πReview https://t.ly/XGv_
πPaper arxiv.org/pdf/2306.01732.pdf
πProject colordiffuser.github.io/
πCode github.com/ColorDiffuser/ColorDiffuser
πHK University unveils ColorDiffuser: adapting pre-trained text-to-image latent diffusion model for video colorization
πReview https://t.ly/XGv_
πPaper arxiv.org/pdf/2306.01732.pdf
πProject colordiffuser.github.io/
πCode github.com/ColorDiffuser/ColorDiffuser
π€―8β€2π€©1
This media is not supported in your browser
VIEW IN TELEGRAM
π» Extending Mona Lisa with AI π»
π A guy on Reddit extends Mona Lisa Painting with #Photoshop AI. The result is surprising.
πMore https://t.ly/j_2r
π A guy on Reddit extends Mona Lisa Painting with #Photoshop AI. The result is surprising.
πMore https://t.ly/j_2r
π€―20π5π€©4π₯3π±2π€£2β‘1
This media is not supported in your browser
VIEW IN TELEGRAM
πΈ Segment Anything in HQ πΈ
πHQ-SAM: SAM with the ability to accurately segment objects, maintaining promptable design, efficiency, zero-shot generalizability
πReview https://t.ly/GxX5B
πPaper arxiv.org/pdf/2306.01567.pdf
πModels github.com/SysCV/SAM-HQ
πHQ-SAM: SAM with the ability to accurately segment objects, maintaining promptable design, efficiency, zero-shot generalizability
πReview https://t.ly/GxX5B
πPaper arxiv.org/pdf/2306.01567.pdf
πModels github.com/SysCV/SAM-HQ
π₯18π4π€―1π±1π1
This media is not supported in your browser
VIEW IN TELEGRAM
π Track Everything Everywhere π
π#Google unveils OmniMotion: full-length motion tracking for every pixel in every frame of video.
πReview https://t.ly/Krvw
πPaper arxiv.org/pdf/2306.05422.pdf
πProject omnimotion.github.io/
πDemo omnimotion.github.io/#interactive_demo
πCode github.com/qianqianwang68/omnimotion
π#Google unveils OmniMotion: full-length motion tracking for every pixel in every frame of video.
πReview https://t.ly/Krvw
πPaper arxiv.org/pdf/2306.05422.pdf
πProject omnimotion.github.io/
πDemo omnimotion.github.io/#interactive_demo
πCode github.com/qianqianwang68/omnimotion
π₯23β€5π€―3π€©1π©1
This media is not supported in your browser
VIEW IN TELEGRAM
ποΈ Scene Five: Through Her Eyes ποΈ
π #3D scene reconstruction of what a person is observing using only the reflections of their eyes
πReview https://t.ly/uBO6
πPaper arxiv.org/pdf/2306.09348.pdf
πProject https://world-from-eyes.github.io/
π #3D scene reconstruction of what a person is observing using only the reflections of their eyes
πReview https://t.ly/uBO6
πPaper arxiv.org/pdf/2306.09348.pdf
πProject https://world-from-eyes.github.io/
π€―28π₯12π©2π€©1
This media is not supported in your browser
VIEW IN TELEGRAM
π§Ώ NeRF-Supervised Deep Stereo π§Ώ
πA novel pioneering pipeline for training deep stereo networks WITH NO ground-truth
πReview https://t.ly/c7j-
πProject nerfstereo.github.io/
πDataset https://amsacta.unibo.it/id/eprint/7218/
πCode github.com/fabiotosi92/NeRF-Supervised-Deep-Stereo
πPaper https://openaccess.thecvf.com/content/CVPR2023/papers/Tosi_NeRF-Supervised_Deep_Stereo_CVPR_2023_paper.pdf
πA novel pioneering pipeline for training deep stereo networks WITH NO ground-truth
πReview https://t.ly/c7j-
πProject nerfstereo.github.io/
πDataset https://amsacta.unibo.it/id/eprint/7218/
πCode github.com/fabiotosi92/NeRF-Supervised-Deep-Stereo
πPaper https://openaccess.thecvf.com/content/CVPR2023/papers/Tosi_NeRF-Supervised_Deep_Stereo_CVPR_2023_paper.pdf
π₯°8π€©3β€1π1π©1π1
This media is not supported in your browser
VIEW IN TELEGRAM
π«£ Text-Guided Adversarial Makeup π«£
πNovel facial privacy protection via adversarial latent codes. Makeup vs Face Recognition.
πReview https://t.ly/pBCP
πPaper arxiv.org/pdf/2306.10008.pdf
πCode github.com/fahadshamshad/Clip2Protect
πNovel facial privacy protection via adversarial latent codes. Makeup vs Face Recognition.
πReview https://t.ly/pBCP
πPaper arxiv.org/pdf/2306.10008.pdf
πCode github.com/fahadshamshad/Clip2Protect
β€6π1π₯1π₯°1π©1
Media is too big
VIEW IN TELEGRAM
π¦· Few-Shot Geometry-Aware Keypoints π¦·
πUBC (+Flawless AI) unveils the new SOTA in semantic keypoints localization. Suitable for faces, animals, cars, mouth, teeth & more
πReview https://t.ly/-0qN
πPaper arxiv.org/pdf/2303.17216.pdf
πProject xingzhehe.github.io/FewShot3DKP/
πUBC (+Flawless AI) unveils the new SOTA in semantic keypoints localization. Suitable for faces, animals, cars, mouth, teeth & more
πReview https://t.ly/-0qN
πPaper arxiv.org/pdf/2303.17216.pdf
πProject xingzhehe.github.io/FewShot3DKP/
π€―10π4β€2β‘2π2π€©2π₯1