This media is not supported in your browser
VIEW IN TELEGRAM
✂️ VideoCutLER: Simple UVIS ✂️
👉VideoCutLER is a simple unsupervised video instance segmentation (UVIS) method without relying on optical flows
😎Review https://t.ly/PBBjG
😎Paper arxiv.org/pdf/2308.14710.pdf
😎Project people.eecs.berkeley.edu/~xdwang/projects/CutLER
😎Code github.com/facebookresearch/CutLER/tree/main/videocutler
👉VideoCutLER is a simple unsupervised video instance segmentation (UVIS) method without relying on optical flows
😎Review https://t.ly/PBBjG
😎Paper arxiv.org/pdf/2308.14710.pdf
😎Project people.eecs.berkeley.edu/~xdwang/projects/CutLER
😎Code github.com/facebookresearch/CutLER/tree/main/videocutler
🔥8👍3❤2🤯1
This media is not supported in your browser
VIEW IN TELEGRAM
🐦 3D Pigeons Pose & Tracking 🐦
👉 3D-MuPPET: estimate and track 3D poses of pigeons with multiple-views
😎Review https://t.ly/jfAJJ
😎Paper arxiv.org/pdf/2308.15316.pdf
😎Code github.com/alexhang212/3D-MuPPET/
👉 3D-MuPPET: estimate and track 3D poses of pigeons with multiple-views
😎Review https://t.ly/jfAJJ
😎Paper arxiv.org/pdf/2308.15316.pdf
😎Code github.com/alexhang212/3D-MuPPET/
🤣17🤯14👍4🥰2❤1🤩1
This media is not supported in your browser
VIEW IN TELEGRAM
🎍RoboTAP: Dense Tracking for Few-Shot Imitation🎍
👉RoboTAP: novel dense tracking representation for robotic arm
😎Review https://t.ly/MCO_V
😎Paper arxiv.org/pdf/2308.15975.pdf
😎Project https://robotap.github.io/
😎Code github.com/deepmind/tapnet
👉RoboTAP: novel dense tracking representation for robotic arm
😎Review https://t.ly/MCO_V
😎Paper arxiv.org/pdf/2308.15975.pdf
😎Project https://robotap.github.io/
😎Code github.com/deepmind/tapnet
🔥8👍2🤯2🤩1
This media is not supported in your browser
VIEW IN TELEGRAM
⛺FACET: Fairness in Computer Vision⛺
👉#META AI opens a large, publicly available dataset for classification, detection & segmentation. Potential performance disparities & challenges across sensitive demographic attributes
😎Review https://t.ly/mKn-t
😎Paper arxiv.org/pdf/2309.00035.pdf
😎Dataset https://facet.metademolab.com/
👉#META AI opens a large, publicly available dataset for classification, detection & segmentation. Potential performance disparities & challenges across sensitive demographic attributes
😎Review https://t.ly/mKn-t
😎Paper arxiv.org/pdf/2309.00035.pdf
😎Dataset https://facet.metademolab.com/
🔥10❤6👍4👏1
This media is not supported in your browser
VIEW IN TELEGRAM
♊️ Doppelgangers in Structures ♊️
👉A novel learning-based approach for visual disambiguation: distinguishing illusory matches to produce correct, disambiguated #3D reconstructions
😎Review https://t.ly/9yLot
😎Paper arxiv.org/pdf/2309.02420.pdf
😎Code github.com/RuojinCai/Doppelgangers
😎Project doppelgangers-3d.github.io/
👉A novel learning-based approach for visual disambiguation: distinguishing illusory matches to produce correct, disambiguated #3D reconstructions
😎Review https://t.ly/9yLot
😎Paper arxiv.org/pdf/2309.02420.pdf
😎Code github.com/RuojinCai/Doppelgangers
😎Project doppelgangers-3d.github.io/
🔥8👍3🤯2👏1
This media is not supported in your browser
VIEW IN TELEGRAM
🍃 Tracking Anything with Decoupled VOS 🍃
👉A novel VOS approach that extends SAM for open-world video segmentation with no user input required
😎Review https://t.ly/xeobR
😎Paper arxiv.org/pdf/2309.03903.pdf
😎Project hkchengrex.com/Tracking-Anything-with-DEVA
😎Code github.com/hkchengrex/Tracking-Anything-with-DEVA
😎Colab https://colab.research.google.com/drive/1OsyNVoV_7ETD1zIE8UWxL3NXxu12m_YZ
👉A novel VOS approach that extends SAM for open-world video segmentation with no user input required
😎Review https://t.ly/xeobR
😎Paper arxiv.org/pdf/2309.03903.pdf
😎Project hkchengrex.com/Tracking-Anything-with-DEVA
😎Code github.com/hkchengrex/Tracking-Anything-with-DEVA
😎Colab https://colab.research.google.com/drive/1OsyNVoV_7ETD1zIE8UWxL3NXxu12m_YZ
🔥13👍6🤯4❤2😢1🤩1
This media is not supported in your browser
VIEW IN TELEGRAM
🪷 Diffusive Consistent Video Editing 🪷
👉 Weizmann Institute of Science unveils TokenFlow, a novel text-to-image diffusion model for text-driven video editing
😎Review https://t.ly/ru8km
😎Paper arxiv.org/pdf/2307.10373.pdf
😎Project diffusion-tokenflow.github.io
😎Code github.com/omerbt/TokenFlow
👉 Weizmann Institute of Science unveils TokenFlow, a novel text-to-image diffusion model for text-driven video editing
😎Review https://t.ly/ru8km
😎Paper arxiv.org/pdf/2307.10373.pdf
😎Project diffusion-tokenflow.github.io
😎Code github.com/omerbt/TokenFlow
❤9👍6🔥2🤯1😱1😢1
This media is not supported in your browser
VIEW IN TELEGRAM
🔥🔥 #META's DINOv2 is now commercial! 🔥🔥
👉Universal features for image classification, instance retrieval, video understanding, depth & semantic segmentation. Now suitable for commercial.
😎Review https://t.ly/LNrGy
😎Paper arxiv.org/pdf/2304.07193.pdf
😎Code github.com/facebookresearch/dinov2
😎Demo dinov2.metademolab.com/
👉Universal features for image classification, instance retrieval, video understanding, depth & semantic segmentation. Now suitable for commercial.
😎Review https://t.ly/LNrGy
😎Paper arxiv.org/pdf/2304.07193.pdf
😎Code github.com/facebookresearch/dinov2
😎Demo dinov2.metademolab.com/
🔥15👍3❤1🤯1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
🧄FreeMan: towards #3D Humans 🧄
👉FreeMan: the first large-scale, real-world, multi-view dataset for #3D human pose estimation. 11M frames!
😎Review https://t.ly/ICxpA
😎Paper arxiv.org/pdf/2309.05073.pdf
😎Project wangjiongw.github.io/freeman
👉FreeMan: the first large-scale, real-world, multi-view dataset for #3D human pose estimation. 11M frames!
😎Review https://t.ly/ICxpA
😎Paper arxiv.org/pdf/2309.05073.pdf
😎Project wangjiongw.github.io/freeman
👏6🤯4🥰1
🦊 MagiCapture: HD Multi-Concept Portrait 🦊
👉KAIST unveils MagiCapture: integrating subject and style concepts to generate high-resolution portrait images using just a few subject and style references
😎Review https://t.ly/c9rOo
😎Paper https://arxiv.org/pdf/2309.06895.pdf
👉KAIST unveils MagiCapture: integrating subject and style concepts to generate high-resolution portrait images using just a few subject and style references
😎Review https://t.ly/c9rOo
😎Paper https://arxiv.org/pdf/2309.06895.pdf
❤5🥰1
This media is not supported in your browser
VIEW IN TELEGRAM
⚽ Dynamic NeRFs for Soccer ⚽
👉SoccerNeRF: first attempt of "cheap" NeRF applied to football for reconstructing soccer replays in space and time.
😎Review https://t.ly/Ywcvk
😎Paper arxiv.org/pdf/2309.06802.pdf
😎Project https://soccernerfs.isach.be/
😎Code github.com/iSach/SoccerNeRFs
👉SoccerNeRF: first attempt of "cheap" NeRF applied to football for reconstructing soccer replays in space and time.
😎Review https://t.ly/Ywcvk
😎Paper arxiv.org/pdf/2309.06802.pdf
😎Project https://soccernerfs.isach.be/
😎Code github.com/iSach/SoccerNeRFs
🔥8❤4👍3🤩2🥰1
This media is not supported in your browser
VIEW IN TELEGRAM
☢️ GlueStick: Graph Neural Matching ☢️
👉GlueStick is joint deep matcher for points and lines that leverages the connectivity information between nodes to better glue them together
😎Review https://t.ly/Atxqo
😎Paper arxiv.org/pdf/2304.02008.pdf
😎Code https://github.com/cvg/GlueStick
👉GlueStick is joint deep matcher for points and lines that leverages the connectivity information between nodes to better glue them together
😎Review https://t.ly/Atxqo
😎Paper arxiv.org/pdf/2304.02008.pdf
😎Code https://github.com/cvg/GlueStick
🔥11👍4❤1🤯1🤩1
This media is not supported in your browser
VIEW IN TELEGRAM
🫀CPR-Coach: Neural Cardiopulmonary Resuscitation🫀
👉CPR-Coach: fine-grained action recognition in cardiopulmonary resuscitation
😎Review https://t.ly/Qbg4K
😎Paper arxiv.org/pdf/2309.11718.pdf
😎Code github.com/Shunli-Wang/CPR-Coach
😎Project shunli-wang.github.io/CPR-Coach
👉CPR-Coach: fine-grained action recognition in cardiopulmonary resuscitation
😎Review https://t.ly/Qbg4K
😎Paper arxiv.org/pdf/2309.11718.pdf
😎Code github.com/Shunli-Wang/CPR-Coach
😎Project shunli-wang.github.io/CPR-Coach
❤7🔥3👏1
🧪 NeuralLabeling with NeRF 🧪
👉Annotating a scene by generating segmentation masks, affordance maps, 2D bounding boxes, 3D BB, 6DOF poses, depth & meshes.
😎Review https://t.ly/1GPsj
😎Paper arxiv.org/pdf/2309.11966.pdf
😎Code github.com/FlorisE/neural-labeling
😎Project florise.github.io/neural_labeling_web
👉Annotating a scene by generating segmentation masks, affordance maps, 2D bounding boxes, 3D BB, 6DOF poses, depth & meshes.
😎Review https://t.ly/1GPsj
😎Paper arxiv.org/pdf/2309.11966.pdf
😎Code github.com/FlorisE/neural-labeling
😎Project florise.github.io/neural_labeling_web
👍5🤯3🔥2❤1🥰1
🍟 DE-ViT: detecting everything via DINOv2 🍟
👉DE-ViT: open-set object detector based on DINOv2 backbone. It's the new SOTA on COCO & LVIS dataset
😎Review https://t.ly/_DAmt
😎Paper arxiv.org/pdf/2309.12969.pdf
😎Code https://github.com/mlzxy/devit
👉DE-ViT: open-set object detector based on DINOv2 backbone. It's the new SOTA on COCO & LVIS dataset
😎Review https://t.ly/_DAmt
😎Paper arxiv.org/pdf/2309.12969.pdf
😎Code https://github.com/mlzxy/devit
🔥8👍4❤1🤯1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
🛵CoTracker: fast transformer-tracker🛵
👉META's CoTracker is a fast transformer-based model that can track any point in a video
😎Review https://t.ly/M36A_
😎Paper arxiv.org/pdf/2307.07635.pdf
😎Project https://co-tracker.github.io/
😎Code github.com/facebookresearch/co-tracker
👉META's CoTracker is a fast transformer-based model that can track any point in a video
😎Review https://t.ly/M36A_
😎Paper arxiv.org/pdf/2307.07635.pdf
😎Project https://co-tracker.github.io/
😎Code github.com/facebookresearch/co-tracker
❤7👍4🤯2🔥1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
🌬️ Neural Blowing in Still Photos 🌬️
👉 A novel approach to animate human hair (and clothes) in a still portraits
😎Review https://t.ly/HKG0t
😎Paper arxiv.org/pdf/2309.14207.pdf
😎Project nevergiveu.github.io/AutomaticHairBlowing
👉 A novel approach to animate human hair (and clothes) in a still portraits
😎Review https://t.ly/HKG0t
😎Paper arxiv.org/pdf/2309.14207.pdf
😎Project nevergiveu.github.io/AutomaticHairBlowing
👍6🤯3🔥1👏1😍1🤣1
This media is not supported in your browser
VIEW IN TELEGRAM
🌮 OW Indoor Segmentation 🌮
👉3D-OWIS is a novel open-world 3D indoor instance segmentation method (with auto-labeling scheme) to separate known/unknown category labels
😎Review https://t.ly/-7ALf
😎Paper arxiv.org/pdf/2309.14338.pdf
😎Code github.com/aminebdj/3D-OWIS
👉3D-OWIS is a novel open-world 3D indoor instance segmentation method (with auto-labeling scheme) to separate known/unknown category labels
😎Review https://t.ly/-7ALf
😎Paper arxiv.org/pdf/2309.14338.pdf
😎Code github.com/aminebdj/3D-OWIS
👍6🔥1🥰1
This media is not supported in your browser
VIEW IN TELEGRAM
🧱 Generating Scenes from Touch 🧱
👉#AI for synthesizing images from tactile signals (and vice versa) and apply it to a number of visuo-tactile synthesis tasks
😎Review https://t.ly/Gxr0L
😎Paper https://arxiv.org/pdf/2309.15117.pdf
😎Project https://fredfyyang.github.io/vision-from-touch
😎Code https://github.com/fredfyyang/vision-from-touch
👉#AI for synthesizing images from tactile signals (and vice versa) and apply it to a number of visuo-tactile synthesis tasks
😎Review https://t.ly/Gxr0L
😎Paper https://arxiv.org/pdf/2309.15117.pdf
😎Project https://fredfyyang.github.io/vision-from-touch
😎Code https://github.com/fredfyyang/vision-from-touch
🤯9👍6❤1🔥1👏1😱1
This media is not supported in your browser
VIEW IN TELEGRAM
☕Decaf: 3D Face-Hand Interactions☕
👉The first learning-based MoCap to track human hands interacting with human faces in #3D from single monocular RGB videos
😎Review https://t.ly/070Tj
😎Paper arxiv.org/pdf/2309.16670.pdf
😎Project vcai.mpi-inf.mpg.de/projects/Decaf
👉The first learning-based MoCap to track human hands interacting with human faces in #3D from single monocular RGB videos
😎Review https://t.ly/070Tj
😎Paper arxiv.org/pdf/2309.16670.pdf
😎Project vcai.mpi-inf.mpg.de/projects/Decaf
👍8🤯8🔥3❤1👏1