๐ป CatFLW: Cat Neural Landmarks ๐ป
๐Landmark convolution neural network-based model for cat faces
๐Review https://t.ly/Y3mQ8
๐Paper arxiv.org/pdf/2305.04232.pdf
๐Dataset www.tech4animals.org/catflw
๐Landmark convolution neural network-based model for cat faces
๐Review https://t.ly/Y3mQ8
๐Paper arxiv.org/pdf/2305.04232.pdf
๐Dataset www.tech4animals.org/catflw
๐ฅฐ17โค4๐3๐ฑ1๐คฉ1๐1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ก4K4D: Real-Time 4D at 4K๐ก
๐THE new SOTA in view synthesis of dynamic 3D scenes at 4K. 30x faster, up to 400 FPS. Nuts!
๐Review https://t.ly/6ddQh
๐Paper arxiv.org/pdf/2310.11448.pdf
๐Project zju3dv.github.io/4k4d/
๐Code github.com/zju3dv/4K4D
๐THE new SOTA in view synthesis of dynamic 3D scenes at 4K. 30x faster, up to 400 FPS. Nuts!
๐Review https://t.ly/6ddQh
๐Paper arxiv.org/pdf/2310.11448.pdf
๐Project zju3dv.github.io/4k4d/
๐Code github.com/zju3dv/4K4D
๐ฅ8๐5๐คฏ5โค1๐ฑ1๐คฉ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ฃ๏ธ Holistic Parking Detection (YOLO) ๐ฃ๏ธ
๐ One-step Holistic Parking Slot Network: a tailor-made adaptation of YOLOv4 algorithm for all-shaped parking slot detection
๐Review https://t.ly/2l4ZG
๐Paper arxiv.org/pdf/2310.11629.pdf
๐ One-step Holistic Parking Slot Network: a tailor-made adaptation of YOLOv4 algorithm for all-shaped parking slot detection
๐Review https://t.ly/2l4ZG
๐Paper arxiv.org/pdf/2310.11629.pdf
๐ฅ8๐คฏ6โค4๐คฉ3๐1๐พ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ Cutie: VOS with heavy occlusions๐
๐Cutie: novel VOS for challenging scenarios with heavy occlusions & distractors
๐Review https://t.ly/W3FR-
๐Paper arxiv.org/pdf/2310.12982.pdf
๐Project https://hkchengrex.com/Cutie
๐Code https://github.com/hkchengrex/Cutie
๐Cutie: novel VOS for challenging scenarios with heavy occlusions & distractors
๐Review https://t.ly/W3FR-
๐Paper arxiv.org/pdf/2310.12982.pdf
๐Project https://hkchengrex.com/Cutie
๐Code https://github.com/hkchengrex/Cutie
๐13๐คฃ3โค1๐คฏ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐งก Rotoscoping Prince Of Persia (1985) ๐งก
๐ A rare footage for the animation of Prince of Persia (1989). Damn Romantic.
๐ More https://t.ly/xJife
๐ A rare footage for the animation of Prince of Persia (1989). Damn Romantic.
๐ More https://t.ly/xJife
โค17๐2๐2๐ฅฐ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ชPACE: new SOTA Motion๐ช
๐#Nvidia unveils the novel SOTA to estimate the human motion in a global scene from moving cams. Stunning results.
๐Review https://t.ly/20you
๐Project https://nvlabs.github.io/PACE
๐Paper https://arxiv.org/pdf/2310.13768.pdf
๐#Nvidia unveils the novel SOTA to estimate the human motion in a global scene from moving cams. Stunning results.
๐Review https://t.ly/20you
๐Project https://nvlabs.github.io/PACE
๐Paper https://arxiv.org/pdf/2310.13768.pdf
๐คฃ5โค4๐ฅ1๐คฏ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ฅคNanoSAM: SAM on low-cost boards๐ฅค
๐NanoSAM is a Segment Anything variant capable of running in real-time on #NVIDIA Jetson Orin with TensorRT
๐Review https://t.ly/UErq_
๐Tutorial https://github.com/NVIDIA-AI-IOT/nanosam
๐NanoSAM is a Segment Anything variant capable of running in real-time on #NVIDIA Jetson Orin with TensorRT
๐Review https://t.ly/UErq_
๐Tutorial https://github.com/NVIDIA-AI-IOT/nanosam
๐ฅ11๐1๐1๐คฏ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ง SOTA RGB-D Video Salient Object ๐ง
๐ DCTNet+ (model) and RDVS(dataset) for a new SOTA in Video Saliency Object Detection
๐Review https://t.ly/DapLV
๐Code github.com/kerenfu/RDVS
๐Paper arxiv.org/pdf/2310.15482.pdf
๐ DCTNet+ (model) and RDVS(dataset) for a new SOTA in Video Saliency Object Detection
๐Review https://t.ly/DapLV
๐Code github.com/kerenfu/RDVS
๐Paper arxiv.org/pdf/2310.15482.pdf
๐ฅ4๐1๐คฏ1
This media is not supported in your browser
VIEW IN TELEGRAM
โ๏ธ Relighted 3D Hands ๐ค
๐#META unveils Re:InterHand: a large dataset of relighted 3D interacting hands
๐Review https://t.ly/I1dQk
๐Paper arxiv.org/pdf/2310.17768.pdf
๐Project mks0601.github.io/ReInterHand
๐Data github.com/mks0601/ReInterHand
๐#META unveils Re:InterHand: a large dataset of relighted 3D interacting hands
๐Review https://t.ly/I1dQk
๐Paper arxiv.org/pdf/2310.17768.pdf
๐Project mks0601.github.io/ReInterHand
๐Data github.com/mks0601/ReInterHand
๐คฏ8โค1๐ฑ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ Video Understanding with GPT-4V(ision) ๐
๐ #Microsoft unveils MM-Vid, the most advanced video understanding framework (w/ #chatgpt4). Impressive results on long-form videos & intricate tasks such as audio description & multimodal high-level comprehension
๐Review https://t.ly/RISMm
๐Paper arxiv.org/pdf/2310.19773.pdf
๐Project https://multimodal-vid.github.io
๐ #Microsoft unveils MM-Vid, the most advanced video understanding framework (w/ #chatgpt4). Impressive results on long-form videos & intricate tasks such as audio description & multimodal high-level comprehension
๐Review https://t.ly/RISMm
๐Paper arxiv.org/pdf/2310.19773.pdf
๐Project https://multimodal-vid.github.io
๐คฏ22๐9๐ฅ2๐1๐ฑ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ฃ Foot via Synthetic Data ๐ฃ
๐ 50,000 synthetic/photorealistic foot images + a novel SOTA library for foot
๐Review https://t.ly/TVanP
๐Paper https://arxiv.org/pdf/2310.18279.pdf
๐Project https://ollieboyne.github.io/FOUND
๐Code https://github.com/OllieBoyne/FOUND
๐ 50,000 synthetic/photorealistic foot images + a novel SOTA library for foot
๐Review https://t.ly/TVanP
๐Paper https://arxiv.org/pdf/2310.18279.pdf
๐Project https://ollieboyne.github.io/FOUND
๐Code https://github.com/OllieBoyne/FOUND
๐คฃ8๐4โค2๐ฅฐ2๐คฉ2
This media is not supported in your browser
VIEW IN TELEGRAM
๐ OYSTER: unsupervised detection w/ LIDAR ๐
๐Waabi unveils OYSTER: a novel unsupervised object detection from LiDAR point clouds.
๐Review https://t.ly/EMi58
๐Project https://waabi.ai/oyster/
๐Paper arxiv.org/pdf/2311.02007.pdf
๐Waabi unveils OYSTER: a novel unsupervised object detection from LiDAR point clouds.
๐Review https://t.ly/EMi58
๐Project https://waabi.ai/oyster/
๐Paper arxiv.org/pdf/2311.02007.pdf
โค15๐3๐ฅ2๐1
๐ฅGPT-4 Pass the Turing Test?๐ฅ
๐No. I mean...not yet. Read this Paper from UC San Diego๐
๐Review https://t.ly/o8HgM
๐Paper https://arxiv.org/pdf/2310.20216.pdf
๐No. I mean...not yet. Read this Paper from UC San Diego๐
๐Review https://t.ly/o8HgM
๐Paper https://arxiv.org/pdf/2310.20216.pdf
โค4๐ฅ3๐1๐คฉ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ฅปSF: Towards Virtual Cloth๐ฅป
๐SEA AI Lab unveils a novel #AI to recovery the garment sewing patterns from daily photos for #AR / #VR worlds
๐Review https://t.ly/MwpAV
๐Project https://sewformer.github.io/
๐Paper https://arxiv.org/pdf/2311.04218.pdf
๐Code https://github.com/sail-sg/sewformer
๐SEA AI Lab unveils a novel #AI to recovery the garment sewing patterns from daily photos for #AR / #VR worlds
๐Review https://t.ly/MwpAV
๐Project https://sewformer.github.io/
๐Paper https://arxiv.org/pdf/2311.04218.pdf
๐Code https://github.com/sail-sg/sewformer
๐4๐ฅ2๐ฅฐ2๐2๐คฏ1๐คฉ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐๏ธ 3DiffTection: new SOTA 3D detection ๐๏ธ
๐#Nvidia unveils 3DiffTection, the new SOTA for 3D object detection from single images. A powerful 3D detector powered by diffusion model
๐Review https://t.ly/PciXY
๐Paper https://arxiv.org/pdf/2311.04391.pdf
๐Code https://github.com/nv-tlabs/3DiffTection
๐Project research.nvidia.com/labs/toronto-ai/3difftection
๐#Nvidia unveils 3DiffTection, the new SOTA for 3D object detection from single images. A powerful 3D detector powered by diffusion model
๐Review https://t.ly/PciXY
๐Paper https://arxiv.org/pdf/2311.04391.pdf
๐Code https://github.com/nv-tlabs/3DiffTection
๐Project research.nvidia.com/labs/toronto-ai/3difftection
๐ฅ8โค6๐3๐ฑ3๐1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ช 30x Faster Neural Scenes ๐ช
๐ NeuRas: realistic real-time novel-view synthesis of VERY large scenes (>10000 m2 ). 30ร faster rendering than previous SOTA w/ comparable or better realism
๐Review https://t.ly/ELJSE
๐Paper https://arxiv.org/pdf/2311.05607.pdf
๐Project https://waabi.ai/NeuRas/
๐ NeuRas: realistic real-time novel-view synthesis of VERY large scenes (>10000 m2 ). 30ร faster rendering than previous SOTA w/ comparable or better realism
๐Review https://t.ly/ELJSE
๐Paper https://arxiv.org/pdf/2311.05607.pdf
๐Project https://waabi.ai/NeuRas/
๐ฅ9โค1๐1๐คฏ1๐คฉ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ฅ Hu.ma.ne #AI Pin is out! ๐ฅ
๐Hu.ma.ne just launched #AI Pin: the new standalone AI-powered screenless device. Running on the GPT-4 LLMs, suitable for real-time translation. #AI-powered camera and laser projector
๐ More https://t.ly/IvoN7
๐Hu.ma.ne just launched #AI Pin: the new standalone AI-powered screenless device. Running on the GPT-4 LLMs, suitable for real-time translation. #AI-powered camera and laser projector
๐ More https://t.ly/IvoN7
โค6๐ฅ4๐ฉ2๐1๐ฑ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ซ Segmentation of Human ๐ซ
๐TotalSegmentator_v2: segmenting 104 anatomical structures (27 organs, 59 bones, 10 muscles, 8 vessels) in CT. Now suitable in 3D Slicer, open source platform for image visualization.
๐Review https://t.ly/yHMm1
๐Code https://lnkd.in/dvgrbsCE
๐Paper https://lnkd.in/dkwHuuzU
๐TotalSegmentator_v2: segmenting 104 anatomical structures (27 organs, 59 bones, 10 muscles, 8 vessels) in CT. Now suitable in 3D Slicer, open source platform for image visualization.
๐Review https://t.ly/yHMm1
๐Code https://lnkd.in/dvgrbsCE
๐Paper https://lnkd.in/dkwHuuzU
๐ฅ14๐7๐คฏ6๐ฑ2โค1๐คฉ1
๐ช Spacecraft Pose Estimation ๐ช
๐SnT (Luxembourg) unveils the most advanced event-based dataset for Spacecrafts: Unreal Engine + data from ICNS simulator + Real images + Real event data acquired in lab
๐Review https://t.ly/m8JPB
๐Paper https://lnkd.in/d_edvc3n
๐Project https://lnkd.in/dPp375aY
๐SnT (Luxembourg) unveils the most advanced event-based dataset for Spacecrafts: Unreal Engine + data from ICNS simulator + Real images + Real event data acquired in lab
๐Review https://t.ly/m8JPB
๐Paper https://lnkd.in/d_edvc3n
๐Project https://lnkd.in/dPp375aY
โค7๐คฏ2๐1๐ฑ1
This media is not supported in your browser
VIEW IN TELEGRAM
๐ฅFlorence-2: unified Computer Vision๐ฅ
๐#Microsoft announces Florence-2: novel foundation model with unified, prompt-based, representation for a large variety of #computervision & vision-language task. One backbone -> multiple tasks!
๐Review https://t.ly/pOins
๐Paper arxiv.org/pdf/2311.06242.pdf
๐Project www.microsoft.com/en-us/research/project/projectflorence/
๐#Microsoft announces Florence-2: novel foundation model with unified, prompt-based, representation for a large variety of #computervision & vision-language task. One backbone -> multiple tasks!
๐Review https://t.ly/pOins
๐Paper arxiv.org/pdf/2311.06242.pdf
๐Project www.microsoft.com/en-us/research/project/projectflorence/
๐ฑ9โค5๐ฅ3๐1๐1๐พ1