This media is not supported in your browser
VIEW IN TELEGRAM
π₯ "Segmenting Anything". CRAZY! π₯
π#Meta unveils a novel model and (1B+) dataset for neural segmentation π€―
πReview https://bit.ly/3nM2uXx
πPaper https://bit.ly/43788DC
πProject https://segment-anything.com
πCode github.com/facebookresearch/segment-anything
π#Meta unveils a novel model and (1B+) dataset for neural segmentation π€―
πReview https://bit.ly/3nM2uXx
πPaper https://bit.ly/43788DC
πProject https://segment-anything.com
πCode github.com/facebookresearch/segment-anything
π€―36β€16π±3π2π₯1π₯°1
This media is not supported in your browser
VIEW IN TELEGRAM
π¦ SegGPT: Segmenting Everything (In Context) π¦
πBAAI unveils SegGPT, a generalist model for segmenting everything in context
πReview https://bit.ly/3zFkUf2
πPaper arxiv.org/pdf/2304.03284.pdf
πCode github.com/baaivision/Painter
πDemo huggingface.co/spaces/BAAI/SegGPT
πBAAI unveils SegGPT, a generalist model for segmenting everything in context
πReview https://bit.ly/3zFkUf2
πPaper arxiv.org/pdf/2304.03284.pdf
πCode github.com/baaivision/Painter
πDemo huggingface.co/spaces/BAAI/SegGPT
π₯15π7β€3π±1
This media is not supported in your browser
VIEW IN TELEGRAM
πDreamPose: Fashion I-2-V Diffusionπ
π Turning fashion photos into realistic videos via driving pose sequence
πReview https://bit.ly/3AdNtAN
πPaper arxiv.org/pdf/2304.06025.pdf
πCode github.com/johannakarras/DreamPose
πProject grail.cs.washington.edu/projects/dreampose
π Turning fashion photos into realistic videos via driving pose sequence
πReview https://bit.ly/3AdNtAN
πPaper arxiv.org/pdf/2304.06025.pdf
πCode github.com/johannakarras/DreamPose
πProject grail.cs.washington.edu/projects/dreampose
π€―11π₯3β€2π2π₯°1
This media is not supported in your browser
VIEW IN TELEGRAM
π₯¦ Zip-NeRF: the Anti-Aliasing NeRF π₯¦
π#Google unveils a novel version of NeRF able to fix the aliasing problem being 22x faster in training than SOTA.
πReview https://bit.ly/3L1hZ6M
πPaper arxiv.org/pdf/2304.06706.pdf
πProject https://jonbarron.info/zipnerf
π#Google unveils a novel version of NeRF able to fix the aliasing problem being 22x faster in training than SOTA.
πReview https://bit.ly/3L1hZ6M
πPaper arxiv.org/pdf/2304.06706.pdf
πProject https://jonbarron.info/zipnerf
π€―13π₯4π3
This media is not supported in your browser
VIEW IN TELEGRAM
π₯ ALERT: Stable Diffusion XL is out! π₯
πSDXL the new generative AI by Stability.AI for images from text. Up to 1024x1024 resolution, for free.
πMore https://bit.ly/41wrh0j
πSDXL the new generative AI by Stability.AI for images from text. Up to 1024x1024 resolution, for free.
πMore https://bit.ly/41wrh0j
π€―10β€7π±1
This media is not supported in your browser
VIEW IN TELEGRAM
πͺ¬ META's Animated Drawings is out! πͺ¬
π#META unveils an easy-to-use method for animating human-like figures drawn by children.
πReview https://bit.ly/3mGeQQv
πPaper arxiv.org/pdf/2303.12741.pdf
πProject fairanimateddrawings.com
π#META unveils an easy-to-use method for animating human-like figures drawn by children.
πReview https://bit.ly/3mGeQQv
πPaper arxiv.org/pdf/2303.12741.pdf
πProject fairanimateddrawings.com
π±16π₯°5π4π2π€©2β‘1π₯1πΎ1
This media is not supported in your browser
VIEW IN TELEGRAM
π»DDS: diffusive text-based image editingπ»
πGoogle unveils a novel text-based image editing for modifications of an input image towards a text description.
πReview https://bit.ly/3L52UBl
πPaper arxiv.org/pdf/2304.07090.pdf
πProject delta-denoising-score.github.io
πGoogle unveils a novel text-based image editing for modifications of an input image towards a text description.
πReview https://bit.ly/3L52UBl
πPaper arxiv.org/pdf/2304.07090.pdf
πProject delta-denoising-score.github.io
π₯12β€2π2
This media is not supported in your browser
VIEW IN TELEGRAM
πͺ
Inpaint Anything: Segmentation + Inpainting πͺ
πRemove / Fill /Replace anything (also via prompt). "Inpaint Anything", a new paradigm of βclicking & filling"
πReview https://bit.ly/43JNREE
πPaper arxiv.org/pdf/2304.06790.pdf
πCode github.com/geekyutao/Inpaint-Anything
πRemove / Fill /Replace anything (also via prompt). "Inpaint Anything", a new paradigm of βclicking & filling"
πReview https://bit.ly/43JNREE
πPaper arxiv.org/pdf/2304.06790.pdf
πCode github.com/geekyutao/Inpaint-Anything
π16π€―8β€3π’1
Hi friends,
right now I'm flying to NY for a business trip!
π Is there anyone studying/working @NYU? I'd love to visit the campus and (eventually) attend to a few lessons about AI/CV/MATH on Monday (or this Friday)
Send me a DM -> @argovision
right now I'm flying to NY for a business trip!
π Is there anyone studying/working @NYU? I'd love to visit the campus and (eventually) attend to a few lessons about AI/CV/MATH on Monday (or this Friday)
Send me a DM -> @argovision
β€15π6πΎ5π€―3π€©1
This media is not supported in your browser
VIEW IN TELEGRAM
π₯ Track Anything: SAM-powered tracking π₯
π SUSTech VIP Lab proposes TAM, a "novel" video tracker powered by SAM
πReview https://bit.ly/44jwI4W
πPaper arxiv.org/pdf/2304.11968.pdf
πCode github.com/gaomingqi/Track-Anything
π SUSTech VIP Lab proposes TAM, a "novel" video tracker powered by SAM
πReview https://bit.ly/44jwI4W
πPaper arxiv.org/pdf/2304.11968.pdf
πCode github.com/gaomingqi/Track-Anything
π₯17π4π€―2π±2π₯°1
This media is not supported in your browser
VIEW IN TELEGRAM
π± Segment Everything Everywhere π±
π Segmenting everything using visual/language prompts (BBs, scribbles, text & audio)
πReview https://bit.ly/3LEiOmx
πPaper arxiv.org/pdf/2304.06718.pdf
πDemo huggingface.co/spaces/xdecoder/SEEM
πCode github.com/UX-Decoder/Segment-Everything-Everywhere-All-At-Once
π Segmenting everything using visual/language prompts (BBs, scribbles, text & audio)
πReview https://bit.ly/3LEiOmx
πPaper arxiv.org/pdf/2304.06718.pdf
πDemo huggingface.co/spaces/xdecoder/SEEM
πCode github.com/UX-Decoder/Segment-Everything-Everywhere-All-At-Once
π₯13β€4π€―1π€©1
π¦ Look mom, I'm a giraffe π¦
π A patent to transpose adversarial patches onto a knitted fabric. Be undetectable or associated with incorrect category such as "animal" (giraffe, zebra, etc)
π More: https://bit.ly/3LzjSGV
π A patent to transpose adversarial patches onto a knitted fabric. Be undetectable or associated with incorrect category such as "animal" (giraffe, zebra, etc)
π More: https://bit.ly/3LzjSGV
β€20π4π€©4π₯3π©3π1π±1
This media is not supported in your browser
VIEW IN TELEGRAM
π RelPose++: SOTA 6D from 2-8 pics π
πCMU unveils a novel neural method for 6D camera poses from only 2-8 images
πReview https://bit.ly/42ioJ6K
πPaper arxiv.org/pdf/2305.04926.pdf
πProject amyxlase.github.io/relpose-plus-plus
πCode github.com/amyxlase/relpose-plus-plus
πCMU unveils a novel neural method for 6D camera poses from only 2-8 images
πReview https://bit.ly/42ioJ6K
πPaper arxiv.org/pdf/2305.04926.pdf
πProject amyxlase.github.io/relpose-plus-plus
πCode github.com/amyxlase/relpose-plus-plus
π₯16π€©1
This media is not supported in your browser
VIEW IN TELEGRAM
π¦ 6D Non-Prehensile Manipulation π¦
π#META (+CMU) unveils HACMan, novel 6D non-prehensile manipulation of objects
πReview https://bit.ly/3NP1jl1
πPaper arxiv.org/pdf/2305.03942.pdf
πProject hacman-2023.github.io
π#META (+CMU) unveils HACMan, novel 6D non-prehensile manipulation of objects
πReview https://bit.ly/3NP1jl1
πPaper arxiv.org/pdf/2305.03942.pdf
πProject hacman-2023.github.io
π6π₯4π€―3π±1
This media is not supported in your browser
VIEW IN TELEGRAM
πΈ Virtual Occlusions in #AR πΈ
πNiantic (#pokemongo) on a novel approach for virtual assets to appear βsitting amongβ the real world objects
πReview https://bit.ly/3o04wn6
πPaper arxiv.org/pdf/2305.07014.pdf
πProject nianticlabs.github.io/implicit-depth
πCode github.com/nianticlabs/implicit-depth
πNiantic (#pokemongo) on a novel approach for virtual assets to appear βsitting amongβ the real world objects
πReview https://bit.ly/3o04wn6
πPaper arxiv.org/pdf/2305.07014.pdf
πProject nianticlabs.github.io/implicit-depth
πCode github.com/nianticlabs/implicit-depth
π₯11π€―5π3β‘1π€©1
This media is not supported in your browser
VIEW IN TELEGRAM
πΏ De-Aging Harrison Ford via SD πΏ
πStable Diffusion for Hollywood: preview of the next autotune of entertainment industry. A discussionπ
π More: https://bit.ly/41EzaQK
πStable Diffusion for Hollywood: preview of the next autotune of entertainment industry. A discussionπ
π More: https://bit.ly/41EzaQK
π€―19π₯9π6π©3β‘1π±1
This media is not supported in your browser
VIEW IN TELEGRAM
πͺ° #3D Auto-Reconstruction πͺ°
πAutoRecon: automated discovery & reconstruction of objects from multi-view pics.
πReview https://bit.ly/3MxI0f4
πPaper arxiv.org/pdf/2305.08810.pdf
πProject zju3dv.github.io/autorecon/
πCode github.com/zju3dv/AutoRecon
πAutoRecon: automated discovery & reconstruction of objects from multi-view pics.
πReview https://bit.ly/3MxI0f4
πPaper arxiv.org/pdf/2305.08810.pdf
πProject zju3dv.github.io/autorecon/
πCode github.com/zju3dv/AutoRecon
π₯11β€4π€―3π₯°1
This media is not supported in your browser
VIEW IN TELEGRAM
π Multi-Layered 3D Garments Animation π
πS-Lab unveils LayersNet: animating multi-layered garments driven by various external forces, such as human bodies & wind
πReview https://bit.ly/435b42F
πPaper arxiv.org/pdf/2305.10418.pdf
πProject mmlab-ntu.github.io/project/layersnet
πS-Lab unveils LayersNet: animating multi-layered garments driven by various external forces, such as human bodies & wind
πReview https://bit.ly/435b42F
πPaper arxiv.org/pdf/2305.10418.pdf
πProject mmlab-ntu.github.io/project/layersnet
π₯6π±2β€1π1
This media is not supported in your browser
VIEW IN TELEGRAM
π« 100% Mask-Free VIS π«
πETH Z unveils MaskFreeVIS: novel high-performing VIS without any mask annotations.
πReview https://bit.ly/3Wg7CQB
πPaper arxiv.org/pdf/2303.15904.pdf
πProject www.vis.xyz/pub/maskfreevis/
πCode github.com/SysCV/maskfreevis
πETH Z unveils MaskFreeVIS: novel high-performing VIS without any mask annotations.
πReview https://bit.ly/3Wg7CQB
πPaper arxiv.org/pdf/2303.15904.pdf
πProject www.vis.xyz/pub/maskfreevis/
πCode github.com/SysCV/maskfreevis
π₯6π4π€―2β€1π±1
This media is not supported in your browser
VIEW IN TELEGRAM
π Drag-GAN: user-friendly image-manipulation π
π Manual deforming of (real and generated) images over pose, shape, expression and layout.
πReview https://bit.ly/3BFyXlR
πPaper arxiv.org/pdf/2305.10973.pdf
πProject vcai.mpi-inf.mpg.de/projects/DragGAN
πCode github.com/XingangPan/DragGAN
π Manual deforming of (real and generated) images over pose, shape, expression and layout.
πReview https://bit.ly/3BFyXlR
πPaper arxiv.org/pdf/2305.10973.pdf
πProject vcai.mpi-inf.mpg.de/projects/DragGAN
πCode github.com/XingangPan/DragGAN
π₯34π€―18β€6π4π±1