This media is not supported in your browser
VIEW IN TELEGRAM
π₯ Track Anything: SAM-powered tracking π₯
π SUSTech VIP Lab proposes TAM, a "novel" video tracker powered by SAM
πReview https://bit.ly/44jwI4W
πPaper arxiv.org/pdf/2304.11968.pdf
πCode github.com/gaomingqi/Track-Anything
π SUSTech VIP Lab proposes TAM, a "novel" video tracker powered by SAM
πReview https://bit.ly/44jwI4W
πPaper arxiv.org/pdf/2304.11968.pdf
πCode github.com/gaomingqi/Track-Anything
π₯17π4π€―2π±2π₯°1
This media is not supported in your browser
VIEW IN TELEGRAM
π± Segment Everything Everywhere π±
π Segmenting everything using visual/language prompts (BBs, scribbles, text & audio)
πReview https://bit.ly/3LEiOmx
πPaper arxiv.org/pdf/2304.06718.pdf
πDemo huggingface.co/spaces/xdecoder/SEEM
πCode github.com/UX-Decoder/Segment-Everything-Everywhere-All-At-Once
π Segmenting everything using visual/language prompts (BBs, scribbles, text & audio)
πReview https://bit.ly/3LEiOmx
πPaper arxiv.org/pdf/2304.06718.pdf
πDemo huggingface.co/spaces/xdecoder/SEEM
πCode github.com/UX-Decoder/Segment-Everything-Everywhere-All-At-Once
π₯13β€4π€―1π€©1
π¦ Look mom, I'm a giraffe π¦
π A patent to transpose adversarial patches onto a knitted fabric. Be undetectable or associated with incorrect category such as "animal" (giraffe, zebra, etc)
π More: https://bit.ly/3LzjSGV
π A patent to transpose adversarial patches onto a knitted fabric. Be undetectable or associated with incorrect category such as "animal" (giraffe, zebra, etc)
π More: https://bit.ly/3LzjSGV
β€20π4π€©4π₯3π©3π1π±1
This media is not supported in your browser
VIEW IN TELEGRAM
π RelPose++: SOTA 6D from 2-8 pics π
πCMU unveils a novel neural method for 6D camera poses from only 2-8 images
πReview https://bit.ly/42ioJ6K
πPaper arxiv.org/pdf/2305.04926.pdf
πProject amyxlase.github.io/relpose-plus-plus
πCode github.com/amyxlase/relpose-plus-plus
πCMU unveils a novel neural method for 6D camera poses from only 2-8 images
πReview https://bit.ly/42ioJ6K
πPaper arxiv.org/pdf/2305.04926.pdf
πProject amyxlase.github.io/relpose-plus-plus
πCode github.com/amyxlase/relpose-plus-plus
π₯16π€©1
This media is not supported in your browser
VIEW IN TELEGRAM
π¦ 6D Non-Prehensile Manipulation π¦
π#META (+CMU) unveils HACMan, novel 6D non-prehensile manipulation of objects
πReview https://bit.ly/3NP1jl1
πPaper arxiv.org/pdf/2305.03942.pdf
πProject hacman-2023.github.io
π#META (+CMU) unveils HACMan, novel 6D non-prehensile manipulation of objects
πReview https://bit.ly/3NP1jl1
πPaper arxiv.org/pdf/2305.03942.pdf
πProject hacman-2023.github.io
π6π₯4π€―3π±1
This media is not supported in your browser
VIEW IN TELEGRAM
πΈ Virtual Occlusions in #AR πΈ
πNiantic (#pokemongo) on a novel approach for virtual assets to appear βsitting amongβ the real world objects
πReview https://bit.ly/3o04wn6
πPaper arxiv.org/pdf/2305.07014.pdf
πProject nianticlabs.github.io/implicit-depth
πCode github.com/nianticlabs/implicit-depth
πNiantic (#pokemongo) on a novel approach for virtual assets to appear βsitting amongβ the real world objects
πReview https://bit.ly/3o04wn6
πPaper arxiv.org/pdf/2305.07014.pdf
πProject nianticlabs.github.io/implicit-depth
πCode github.com/nianticlabs/implicit-depth
π₯11π€―5π3β‘1π€©1
This media is not supported in your browser
VIEW IN TELEGRAM
πΏ De-Aging Harrison Ford via SD πΏ
πStable Diffusion for Hollywood: preview of the next autotune of entertainment industry. A discussionπ
π More: https://bit.ly/41EzaQK
πStable Diffusion for Hollywood: preview of the next autotune of entertainment industry. A discussionπ
π More: https://bit.ly/41EzaQK
π€―19π₯9π6π©3β‘1π±1
This media is not supported in your browser
VIEW IN TELEGRAM
πͺ° #3D Auto-Reconstruction πͺ°
πAutoRecon: automated discovery & reconstruction of objects from multi-view pics.
πReview https://bit.ly/3MxI0f4
πPaper arxiv.org/pdf/2305.08810.pdf
πProject zju3dv.github.io/autorecon/
πCode github.com/zju3dv/AutoRecon
πAutoRecon: automated discovery & reconstruction of objects from multi-view pics.
πReview https://bit.ly/3MxI0f4
πPaper arxiv.org/pdf/2305.08810.pdf
πProject zju3dv.github.io/autorecon/
πCode github.com/zju3dv/AutoRecon
π₯11β€4π€―3π₯°1
This media is not supported in your browser
VIEW IN TELEGRAM
π Multi-Layered 3D Garments Animation π
πS-Lab unveils LayersNet: animating multi-layered garments driven by various external forces, such as human bodies & wind
πReview https://bit.ly/435b42F
πPaper arxiv.org/pdf/2305.10418.pdf
πProject mmlab-ntu.github.io/project/layersnet
πS-Lab unveils LayersNet: animating multi-layered garments driven by various external forces, such as human bodies & wind
πReview https://bit.ly/435b42F
πPaper arxiv.org/pdf/2305.10418.pdf
πProject mmlab-ntu.github.io/project/layersnet
π₯6π±2β€1π1
This media is not supported in your browser
VIEW IN TELEGRAM
π« 100% Mask-Free VIS π«
πETH Z unveils MaskFreeVIS: novel high-performing VIS without any mask annotations.
πReview https://bit.ly/3Wg7CQB
πPaper arxiv.org/pdf/2303.15904.pdf
πProject www.vis.xyz/pub/maskfreevis/
πCode github.com/SysCV/maskfreevis
πETH Z unveils MaskFreeVIS: novel high-performing VIS without any mask annotations.
πReview https://bit.ly/3Wg7CQB
πPaper arxiv.org/pdf/2303.15904.pdf
πProject www.vis.xyz/pub/maskfreevis/
πCode github.com/SysCV/maskfreevis
π₯6π4π€―2β€1π±1
This media is not supported in your browser
VIEW IN TELEGRAM
π Drag-GAN: user-friendly image-manipulation π
π Manual deforming of (real and generated) images over pose, shape, expression and layout.
πReview https://bit.ly/3BFyXlR
πPaper arxiv.org/pdf/2305.10973.pdf
πProject vcai.mpi-inf.mpg.de/projects/DragGAN
πCode github.com/XingangPan/DragGAN
π Manual deforming of (real and generated) images over pose, shape, expression and layout.
πReview https://bit.ly/3BFyXlR
πPaper arxiv.org/pdf/2305.10973.pdf
πProject vcai.mpi-inf.mpg.de/projects/DragGAN
πCode github.com/XingangPan/DragGAN
π₯34π€―18β€6π4π±1
This media is not supported in your browser
VIEW IN TELEGRAM
πΊοΈ AI-generated stereotypical men πΊοΈ
πA thread about generating stereotypical person from 15 countries all around the world. And yes, Italian love Pizza.
π More https://bit.ly/3oo0t4c
πA thread about generating stereotypical person from 15 countries all around the world. And yes, Italian love Pizza.
π More https://bit.ly/3oo0t4c
π€£6β€3π€―1
This media is not supported in your browser
VIEW IN TELEGRAM
πΆ AVOS Multiscale Encoder-Decoder ViT πΆ
π MED-VT, world's first Multiscale Encoder Decoder Video Transformer for AVOS
πReview https://bit.ly/3MohFi1
πPaper arxiv.org/pdf/2304.05930.pdf
πProject rkyuca.github.io/medvt
πCode github.com/rkyuca/medvt
π MED-VT, world's first Multiscale Encoder Decoder Video Transformer for AVOS
πReview https://bit.ly/3MohFi1
πPaper arxiv.org/pdf/2304.05930.pdf
πProject rkyuca.github.io/medvt
πCode github.com/rkyuca/medvt
π13π₯°1
This media is not supported in your browser
VIEW IN TELEGRAM
π Neural Dynamic Image-Based Rendering π
π DynIBaR: synthesizing novel views from monocular video depicting a complex dynamic scene.
πReview https://t.ly/90Kw
πPaper arxiv.org/pdf/2211.11082.pdf
πProject https://dynibar.github.io/
πCode github.com/google/dynibar
π DynIBaR: synthesizing novel views from monocular video depicting a complex dynamic scene.
πReview https://t.ly/90Kw
πPaper arxiv.org/pdf/2211.11082.pdf
πProject https://dynibar.github.io/
πCode github.com/google/dynibar
β€9π3π₯°1π€―1
This media is not supported in your browser
VIEW IN TELEGRAM
π¦ Open Semantic Segmentation π¦
πSSSegmentation: open source supervised semantic segmentation toolbox based on #PyTorch
πReview https://t.ly/ZE9q
πPaper arxiv.org/pdf/2305.17091.pdf
πCode github.com/SegmentationBLWX/sssegmentation
πSSSegmentation: open source supervised semantic segmentation toolbox based on #PyTorch
πReview https://t.ly/ZE9q
πPaper arxiv.org/pdf/2305.17091.pdf
πCode github.com/SegmentationBLWX/sssegmentation
π₯10β€4β‘1π1π€―1π€©1πΎ1
This media is not supported in your browser
VIEW IN TELEGRAM
ποΈ 4D Humans with Transformers ποΈ
πNovel approach to reconstruct and track humans (even in unusual poses)
πReview https://t.ly/XGv_
πPaper arxiv.org/pdf/2305.20091.pdf
πProject shubham-goel.github.io/4dhumans/#
πCode github.com/shubham-goel/4D-Humans
πNovel approach to reconstruct and track humans (even in unusual poses)
πReview https://t.ly/XGv_
πPaper arxiv.org/pdf/2305.20091.pdf
πProject shubham-goel.github.io/4dhumans/#
πCode github.com/shubham-goel/4D-Humans
π€―10π7π₯5β€2β‘1
This media is not supported in your browser
VIEW IN TELEGRAM
π½ Neuralangelo Digital Twins. INSANEπ½
π A novel framework from #Nvidia for Hi-Fi 3D Digital twins.
πReview https://t.ly/rxoF4
πProject research.nvidia.com/labs/dir/neuralangelo
πPaper research.nvidia.com/labs/dir/neuralangelo/paper.pdf
π A novel framework from #Nvidia for Hi-Fi 3D Digital twins.
πReview https://t.ly/rxoF4
πProject research.nvidia.com/labs/dir/neuralangelo
πPaper research.nvidia.com/labs/dir/neuralangelo/paper.pdf
π₯15π4π€―1
This media is not supported in your browser
VIEW IN TELEGRAM
π¦ ColorDiffuser: Text-to-Video Colorization π¦
πHK University unveils ColorDiffuser: adapting pre-trained text-to-image latent diffusion model for video colorization
πReview https://t.ly/XGv_
πPaper arxiv.org/pdf/2306.01732.pdf
πProject colordiffuser.github.io/
πCode github.com/ColorDiffuser/ColorDiffuser
πHK University unveils ColorDiffuser: adapting pre-trained text-to-image latent diffusion model for video colorization
πReview https://t.ly/XGv_
πPaper arxiv.org/pdf/2306.01732.pdf
πProject colordiffuser.github.io/
πCode github.com/ColorDiffuser/ColorDiffuser
π€―8β€2π€©1
This media is not supported in your browser
VIEW IN TELEGRAM
π» Extending Mona Lisa with AI π»
π A guy on Reddit extends Mona Lisa Painting with #Photoshop AI. The result is surprising.
πMore https://t.ly/j_2r
π A guy on Reddit extends Mona Lisa Painting with #Photoshop AI. The result is surprising.
πMore https://t.ly/j_2r
π€―20π5π€©4π₯3π±2π€£2β‘1
This media is not supported in your browser
VIEW IN TELEGRAM
πΈ Segment Anything in HQ πΈ
πHQ-SAM: SAM with the ability to accurately segment objects, maintaining promptable design, efficiency, zero-shot generalizability
πReview https://t.ly/GxX5B
πPaper arxiv.org/pdf/2306.01567.pdf
πModels github.com/SysCV/SAM-HQ
πHQ-SAM: SAM with the ability to accurately segment objects, maintaining promptable design, efficiency, zero-shot generalizability
πReview https://t.ly/GxX5B
πPaper arxiv.org/pdf/2306.01567.pdf
πModels github.com/SysCV/SAM-HQ
π₯18π4π€―1π±1π1