๐ฆ MagiCapture: HD Multi-Concept Portrait ๐ฆ
๐KAIST unveils MagiCapture: integrating subject and style concepts to generate high-resolution portrait images using just a few subject and style references
๐Review https://t.ly/c9rOo
๐Paper https://arxiv.org/pdf/2309.06895.pdf
๐KAIST unveils MagiCapture: integrating subject and style concepts to generate high-resolution portrait images using just a few subject and style references
๐Review https://t.ly/c9rOo
๐Paper https://arxiv.org/pdf/2309.06895.pdf
โค5๐ฅฐ1
This media is not supported in your browser
VIEW IN TELEGRAM
โฝ Dynamic NeRFs for Soccer โฝ
๐SoccerNeRF: first attempt of "cheap" NeRF applied to football for reconstructing soccer replays in space and time.
๐Review https://t.ly/Ywcvk
๐Paper arxiv.org/pdf/2309.06802.pdf
๐Project https://soccernerfs.isach.be/
๐Code github.com/iSach/SoccerNeRFs
๐SoccerNeRF: first attempt of "cheap" NeRF applied to football for reconstructing soccer replays in space and time.
๐Review https://t.ly/Ywcvk
๐Paper arxiv.org/pdf/2309.06802.pdf
๐Project https://soccernerfs.isach.be/
๐Code github.com/iSach/SoccerNeRFs
๐ฅ8โค4๐3๐คฉ2๐ฅฐ1
This media is not supported in your browser
VIEW IN TELEGRAM
โข๏ธ GlueStick: Graph Neural Matching โข๏ธ
๐GlueStick is joint deep matcher for points and lines that leverages the connectivity information between nodes to better glue them together
๐Review https://t.ly/Atxqo
๐Paper arxiv.org/pdf/2304.02008.pdf
๐Code https://github.com/cvg/GlueStick
๐GlueStick is joint deep matcher for points and lines that leverages the connectivity information between nodes to better glue them together
๐Review https://t.ly/Atxqo
๐Paper arxiv.org/pdf/2304.02008.pdf
๐Code https://github.com/cvg/GlueStick
๐ฅ11๐4โค1๐คฏ1๐คฉ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ซCPR-Coach: Neural Cardiopulmonary Resuscitation๐ซ
๐CPR-Coach: fine-grained action recognition in cardiopulmonary resuscitation
๐Review https://t.ly/Qbg4K
๐Paper arxiv.org/pdf/2309.11718.pdf
๐Code github.com/Shunli-Wang/CPR-Coach
๐Project shunli-wang.github.io/CPR-Coach
๐CPR-Coach: fine-grained action recognition in cardiopulmonary resuscitation
๐Review https://t.ly/Qbg4K
๐Paper arxiv.org/pdf/2309.11718.pdf
๐Code github.com/Shunli-Wang/CPR-Coach
๐Project shunli-wang.github.io/CPR-Coach
โค7๐ฅ3๐1
๐งช NeuralLabeling with NeRF ๐งช
๐Annotating a scene by generating segmentation masks, affordance maps, 2D bounding boxes, 3D BB, 6DOF poses, depth & meshes.
๐Review https://t.ly/1GPsj
๐Paper arxiv.org/pdf/2309.11966.pdf
๐Code github.com/FlorisE/neural-labeling
๐Project florise.github.io/neural_labeling_web
๐Annotating a scene by generating segmentation masks, affordance maps, 2D bounding boxes, 3D BB, 6DOF poses, depth & meshes.
๐Review https://t.ly/1GPsj
๐Paper arxiv.org/pdf/2309.11966.pdf
๐Code github.com/FlorisE/neural-labeling
๐Project florise.github.io/neural_labeling_web
๐5๐คฏ3๐ฅ2โค1๐ฅฐ1
๐ DE-ViT: detecting everything via DINOv2 ๐
๐DE-ViT: open-set object detector based on DINOv2 backbone. It's the new SOTA on COCO & LVIS dataset
๐Review https://t.ly/_DAmt
๐Paper arxiv.org/pdf/2309.12969.pdf
๐Code https://github.com/mlzxy/devit
๐DE-ViT: open-set object detector based on DINOv2 backbone. It's the new SOTA on COCO & LVIS dataset
๐Review https://t.ly/_DAmt
๐Paper arxiv.org/pdf/2309.12969.pdf
๐Code https://github.com/mlzxy/devit
๐ฅ8๐4โค1๐คฏ1๐ฑ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ตCoTracker: fast transformer-tracker๐ต
๐META's CoTracker is a fast transformer-based model that can track any point in a video
๐Review https://t.ly/M36A_
๐Paper arxiv.org/pdf/2307.07635.pdf
๐Project https://co-tracker.github.io/
๐Code github.com/facebookresearch/co-tracker
๐META's CoTracker is a fast transformer-based model that can track any point in a video
๐Review https://t.ly/M36A_
๐Paper arxiv.org/pdf/2307.07635.pdf
๐Project https://co-tracker.github.io/
๐Code github.com/facebookresearch/co-tracker
โค7๐4๐คฏ2๐ฅ1๐ฑ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ฌ๏ธ Neural Blowing in Still Photos ๐ฌ๏ธ
๐ A novel approach to animate human hair (and clothes) in a still portraits
๐Review https://t.ly/HKG0t
๐Paper arxiv.org/pdf/2309.14207.pdf
๐Project nevergiveu.github.io/AutomaticHairBlowing
๐ A novel approach to animate human hair (and clothes) in a still portraits
๐Review https://t.ly/HKG0t
๐Paper arxiv.org/pdf/2309.14207.pdf
๐Project nevergiveu.github.io/AutomaticHairBlowing
๐6๐คฏ3๐ฅ1๐1๐1๐คฃ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ฎ OW Indoor Segmentation ๐ฎ
๐3D-OWIS is a novel open-world 3D indoor instance segmentation method (with auto-labeling scheme) to separate known/unknown category labels
๐Review https://t.ly/-7ALf
๐Paper arxiv.org/pdf/2309.14338.pdf
๐Code github.com/aminebdj/3D-OWIS
๐3D-OWIS is a novel open-world 3D indoor instance segmentation method (with auto-labeling scheme) to separate known/unknown category labels
๐Review https://t.ly/-7ALf
๐Paper arxiv.org/pdf/2309.14338.pdf
๐Code github.com/aminebdj/3D-OWIS
๐6๐ฅ1๐ฅฐ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐งฑ Generating Scenes from Touch ๐งฑ
๐#AI for synthesizing images from tactile signals (and vice versa) and apply it to a number of visuo-tactile synthesis tasks
๐Review https://t.ly/Gxr0L
๐Paper https://arxiv.org/pdf/2309.15117.pdf
๐Project https://fredfyyang.github.io/vision-from-touch
๐Code https://github.com/fredfyyang/vision-from-touch
๐#AI for synthesizing images from tactile signals (and vice versa) and apply it to a number of visuo-tactile synthesis tasks
๐Review https://t.ly/Gxr0L
๐Paper https://arxiv.org/pdf/2309.15117.pdf
๐Project https://fredfyyang.github.io/vision-from-touch
๐Code https://github.com/fredfyyang/vision-from-touch
๐คฏ9๐6โค1๐ฅ1๐1๐ฑ1
This media is not supported in your browser
VIEW IN TELEGRAM
โDecaf: 3D Face-Hand Interactionsโ
๐The first learning-based MoCap to track human hands interacting with human faces in #3D from single monocular RGB videos
๐Review https://t.ly/070Tj
๐Paper arxiv.org/pdf/2309.16670.pdf
๐Project vcai.mpi-inf.mpg.de/projects/Decaf
๐The first learning-based MoCap to track human hands interacting with human faces in #3D from single monocular RGB videos
๐Review https://t.ly/070Tj
๐Paper arxiv.org/pdf/2309.16670.pdf
๐Project vcai.mpi-inf.mpg.de/projects/Decaf
๐8๐คฏ8๐ฅ3โค1๐1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ฑ Making LLaMA See and Draw ๐ฑ
๐Tencent #AI planted a SEED of Vision in Large Language Model. Making LLaMA see 'n' draw stuff.
๐Review https://t.ly/QiCAv
๐Paper arxiv.org/pdf/2310.01218.pdf
๐Code github.com/AILab-CVC/SEED
๐Tencent #AI planted a SEED of Vision in Large Language Model. Making LLaMA see 'n' draw stuff.
๐Review https://t.ly/QiCAv
๐Paper arxiv.org/pdf/2310.01218.pdf
๐Code github.com/AILab-CVC/SEED
โค8๐4๐คฏ3๐ฅ1
๐ฅVisual-Math Q&A: MathVista is out! ๐ฅ
๐ MathVista is the ultimate benchmark designed to amalgamate challenges from diverse mathematical and visual tasks
๐Review https://t.ly/yfqHZ
๐Paper https://arxiv.org/pdf/2310.02255.pdf
๐Project https://mathvista.github.io/
๐Code github.com/lupantech/MathVista
๐ MathVista is the ultimate benchmark designed to amalgamate challenges from diverse mathematical and visual tasks
๐Review https://t.ly/yfqHZ
๐Paper https://arxiv.org/pdf/2310.02255.pdf
๐Project https://mathvista.github.io/
๐Code github.com/lupantech/MathVista
โค8๐3๐ฅ3๐พ2๐1๐คฏ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐๐ Where Is OpenCV 5? ๐๐
๐On October 24th, the organization is launching a crowdfunding campaign to raise funds for #OpenCV 5 development.
๐me in 2008 during my thesis work about face tracking; up to 50x faster than the previous SOTA. No chance to did it without OpenCV library and support from the community.
๐ฅSupport #OpenCV 5 to create the next-gen of researchers and scientists. Spread the voice: https://t.ly/UTukV
๐On October 24th, the organization is launching a crowdfunding campaign to raise funds for #OpenCV 5 development.
๐me in 2008 during my thesis work about face tracking; up to 50x faster than the previous SOTA. No chance to did it without OpenCV library and support from the community.
๐ฅSupport #OpenCV 5 to create the next-gen of researchers and scientists. Spread the voice: https://t.ly/UTukV
โค22๐8๐ฅ3๐ฉ1
๐SwimXYZ: Synthetic Swim๐
๐SwimXYZ: synthetic dataset for swimming, monocular videos annotated with ground truth 2D and 3D joints
๐Review https://t.ly/F-rdF
๐Paper arxiv.org/pdf/2310.04360.pdf
๐Data g-fiche.github.io/research-pages/swimxyz
๐SwimXYZ: synthetic dataset for swimming, monocular videos annotated with ground truth 2D and 3D joints
๐Review https://t.ly/F-rdF
๐Paper arxiv.org/pdf/2310.04360.pdf
๐Data g-fiche.github.io/research-pages/swimxyz
๐ฅ4๐2โค1๐ฑ1๐คฉ1
๐ TextPSG: PSG from Text ๐
๐A novel problem in #AI: Panoptic Scene Graph Generation from Purely Textual Descriptions (Caption-toPSG)
๐Review https://t.ly/UXEmk
๐Paper arxiv.org/pdf/2310.07056.pdf
๐Project vis-www.cs.umass.edu/TextPSG
๐Code github.com/chengyzhao/TextPSG
๐A novel problem in #AI: Panoptic Scene Graph Generation from Purely Textual Descriptions (Caption-toPSG)
๐Review https://t.ly/UXEmk
๐Paper arxiv.org/pdf/2310.07056.pdf
๐Project vis-www.cs.umass.edu/TextPSG
๐Code github.com/chengyzhao/TextPSG
๐ฅ9โค5๐3๐คฉ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ Full Human Motion ๐
๐OmniControl by Google is novel framework for text-conditioned human motion generation model based on diffusion process
๐Review https://t.ly/F_0Ov
๐Paper arxiv.org/pdf/2310.08580.pdf
๐Project neu-vi.github.io/omnicontrol/
๐OmniControl by Google is novel framework for text-conditioned human motion generation model based on diffusion process
๐Review https://t.ly/F_0Ov
๐Paper arxiv.org/pdf/2310.08580.pdf
๐Project neu-vi.github.io/omnicontrol/
๐5๐คฏ3๐ฅ2๐1๐ฑ1
๐ฆนโโ๏ธ Snap's Hyper-Realistic Human ๐ฆนโโ๏ธ
๐New diffusive #AI by Snap that generates in-the-wild human images with hyper-realism. Swipe the gallery, NUTS!๐
๐Gallery https://t.ly/cG74X
๐Paper arxiv.org/pdf/2310.08579.pdf
๐Project snap-research.github.io/HyperHuman
๐Code github.com/snap-research/HyperHuman
๐New diffusive #AI by Snap that generates in-the-wild human images with hyper-realism. Swipe the gallery, NUTS!๐
๐Gallery https://t.ly/cG74X
๐Paper arxiv.org/pdf/2310.08579.pdf
๐Project snap-research.github.io/HyperHuman
๐Code github.com/snap-research/HyperHuman
๐4๐ฅ1๐คฏ1๐ฑ1๐คฉ1๐คฃ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐AG3D clothed avatar from 2D๐
๐The novel SOTA in adversarial generative of realistic 3D people
๐Review https://t.ly/vnJO7
๐Project https://zj-dong.github.io/AG3D
๐Code https://github.com/zj-dong/AG3D
๐Paper zj-dong.github.io/AG3D/assets/paper.pdf
๐The novel SOTA in adversarial generative of realistic 3D people
๐Review https://t.ly/vnJO7
๐Project https://zj-dong.github.io/AG3D
๐Code https://github.com/zj-dong/AG3D
๐Paper zj-dong.github.io/AG3D/assets/paper.pdf
โค7๐4๐ฅ2๐ฅฐ2
This media is not supported in your browser
VIEW IN TELEGRAM
๐ฑPose-Format: All-in-One Pose๐ฑ
๐ Pose-format: a comprehensive toolkit designed for human pose: unified, flexible, and easy-to-use
๐Review https://t.ly/rFrhq
๐Paper arxiv.org/pdf/2310.09066.pdf
๐Code github.com/sign-language-processing/pose
๐ Pose-format: a comprehensive toolkit designed for human pose: unified, flexible, and easy-to-use
๐Review https://t.ly/rFrhq
๐Paper arxiv.org/pdf/2310.09066.pdf
๐Code github.com/sign-language-processing/pose
๐ฅ9๐คฏ4๐3๐ฑ2โก1๐ฉ1