wjf5203/VNext
Next-generation Video instance recognition framework on top of Detectron2 which supports SeqFormer(ECCV Oral) and IDOL(ECCV Oral))
Language: Python
#instance_segmentation #object_detection #tracking #transformer #video_instance_segmentation
Stars: 109 Issues: 0 Forks: 4
https://github.com/wjf5203/VNext
Next-generation Video instance recognition framework on top of Detectron2 which supports SeqFormer(ECCV Oral) and IDOL(ECCV Oral))
Language: Python
#instance_segmentation #object_detection #tracking #transformer #video_instance_segmentation
Stars: 109 Issues: 0 Forks: 4
https://github.com/wjf5203/VNext
GitHub
GitHub - wjf5203/VNext: Next-generation Video instance recognition framework on top of Detectron2 which supports InstMove (CVPR…
Next-generation Video instance recognition framework on top of Detectron2 which supports InstMove (CVPR 2023), SeqFormer(ECCV Oral), and IDOL(ECCV Oral)) - wjf5203/VNext
open-mmlab/mmyolo
OpenMMLab YOLO series toolbox and benchmark
Language: Python
#object_detection #pytorch #yolo #yolov5 #yolov6 #yolox
Stars: 285 Issues: 7 Forks: 11
https://github.com/open-mmlab/mmyolo
OpenMMLab YOLO series toolbox and benchmark
Language: Python
#object_detection #pytorch #yolo #yolov5 #yolov6 #yolox
Stars: 285 Issues: 7 Forks: 11
https://github.com/open-mmlab/mmyolo
GitHub
GitHub - open-mmlab/mmyolo: OpenMMLab YOLO series toolbox and benchmark. Implemented RTMDet, RTMDet-Rotated,YOLOv5, YOLOv6, YOLOv7…
OpenMMLab YOLO series toolbox and benchmark. Implemented RTMDet, RTMDet-Rotated,YOLOv5, YOLOv6, YOLOv7, YOLOv8,YOLOX, PPYOLOE, etc. - open-mmlab/mmyolo
roboflow-ai/notebooks
Set of Jupyter Notebooks linked to Roboflow Blogpost and used in our YouTube videos.
Language: Jupyter Notebook
#computer_vision #deep_learning #deep_neural_networks #image_classification #image_segmentation #object_detection #pytorch #tutorial #yolov5 #yolov6 #yolov7
Stars: 126 Issues: 1 Forks: 14
https://github.com/roboflow-ai/notebooks
Set of Jupyter Notebooks linked to Roboflow Blogpost and used in our YouTube videos.
Language: Jupyter Notebook
#computer_vision #deep_learning #deep_neural_networks #image_classification #image_segmentation #object_detection #pytorch #tutorial #yolov5 #yolov6 #yolov7
Stars: 126 Issues: 1 Forks: 14
https://github.com/roboflow-ai/notebooks
GitHub
GitHub - roboflow/notebooks: This repository offers a comprehensive collection of tutorials on state-of-the-art computer vision…
This repository offers a comprehensive collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures like ResNet to cutting-e...
tinyvision/DAMO-YOLO
DAMO-YOLO: a fast and accurate object detection method with some new techs, including NAS backbones, efficient RepGFPN, ZeroHead, AlignedOTA, and distillation enhancement.
Language: Python
#deep_learning #nas #object_detection #onnx #pytorch #tensorrt #yolo #yolov5
Stars: 163 Issues: 8 Forks: 15
https://github.com/tinyvision/DAMO-YOLO
DAMO-YOLO: a fast and accurate object detection method with some new techs, including NAS backbones, efficient RepGFPN, ZeroHead, AlignedOTA, and distillation enhancement.
Language: Python
#deep_learning #nas #object_detection #onnx #pytorch #tensorrt #yolo #yolov5
Stars: 163 Issues: 8 Forks: 15
https://github.com/tinyvision/DAMO-YOLO
GitHub
GitHub - tinyvision/DAMO-YOLO: DAMO-YOLO: a fast and accurate object detection method with some new techs, including NAS backbones…
DAMO-YOLO: a fast and accurate object detection method with some new techs, including NAS backbones, efficient RepGFPN, ZeroHead, AlignedOTA, and distillation enhancement. - tinyvision/DAMO-YOLO
kadirnar/segment-anything-video
MetaSeg: Packaged version of the Segment Anything repository
Language: Python
#object_detection #object_segmentation #segment_anything #segmentation
Stars: 337 Issues: 4 Forks: 22
https://github.com/kadirnar/segment-anything-video
MetaSeg: Packaged version of the Segment Anything repository
Language: Python
#object_detection #object_segmentation #segment_anything #segmentation
Stars: 337 Issues: 4 Forks: 22
https://github.com/kadirnar/segment-anything-video
GitHub
GitHub - kadirnar/segment-anything-video: MetaSeg: Packaged version of the Segment Anything repository
MetaSeg: Packaged version of the Segment Anything repository - kadirnar/segment-anything-video
OpenGVLab/VisionLLM
VisionLLM: Large Language Model is also an Open-Ended Decoder for Vision-Centric Tasks
#generalist_model #large_language_models #object_detection
Stars: 205 Issues: 1 Forks: 2
https://github.com/OpenGVLab/VisionLLM
VisionLLM: Large Language Model is also an Open-Ended Decoder for Vision-Centric Tasks
#generalist_model #large_language_models #object_detection
Stars: 205 Issues: 1 Forks: 2
https://github.com/OpenGVLab/VisionLLM
GitHub
GitHub - OpenGVLab/VisionLLM: VisionLLM Series
VisionLLM Series. Contribute to OpenGVLab/VisionLLM development by creating an account on GitHub.
roboflow/multimodal-maestro
Effective prompting for Large Multimodal Models like GPT-4 Vision or LLaVA. 🔥
Language: Python
#cross_modal #gpt_4 #gpt_4_vision #instance_segmentation #llava #lmm #multimodality #object_detection #prompt_engineering #segment_anything #vision_language_model #visual_prompting
Stars: 367 Issues: 1 Forks: 23
https://github.com/roboflow/multimodal-maestro
Effective prompting for Large Multimodal Models like GPT-4 Vision or LLaVA. 🔥
Language: Python
#cross_modal #gpt_4 #gpt_4_vision #instance_segmentation #llava #lmm #multimodality #object_detection #prompt_engineering #segment_anything #vision_language_model #visual_prompting
Stars: 367 Issues: 1 Forks: 23
https://github.com/roboflow/multimodal-maestro
GitHub
GitHub - roboflow/maestro: streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL
streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL - roboflow/maestro
FoundationVision/GLEE
GLEE: General Object Foundation Model for Images and Videos at Scale
Language: Python
#foundation_model #object_detection #open_world #tracking
Stars: 153 Issues: 3 Forks: 9
https://github.com/FoundationVision/GLEE
GLEE: General Object Foundation Model for Images and Videos at Scale
Language: Python
#foundation_model #object_detection #open_world #tracking
Stars: 153 Issues: 3 Forks: 9
https://github.com/FoundationVision/GLEE
GitHub
GitHub - FoundationVision/GLEE: [CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale
[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale - FoundationVision/GLEE
IDEA-Research/Grounding-DINO-1.5-API
API for Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series
Language: Python
#grounding_dino #object_detection #open_set
Stars: 228 Issues: 7 Forks: 7
https://github.com/IDEA-Research/Grounding-DINO-1.5-API
API for Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series
Language: Python
#grounding_dino #object_detection #open_set
Stars: 228 Issues: 7 Forks: 7
https://github.com/IDEA-Research/Grounding-DINO-1.5-API
GitHub
GitHub - IDEA-Research/Grounding-DINO-1.5-API: Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model…
Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series - IDEA-Research/Grounding-DINO-1.5-API
roboflow/rf-detr
RF-DETR is a real-time object detection model architecture developed by Roboflow, released under the Apache 2.0 license.
Language: Python
#computer_vision #detr #machine_learning #object_detection #rf_detr
Stars: 292 Issues: 3 Forks: 19
https://github.com/roboflow/rf-detr
RF-DETR is a real-time object detection model architecture developed by Roboflow, released under the Apache 2.0 license.
Language: Python
#computer_vision #detr #machine_learning #object_detection #rf_detr
Stars: 292 Issues: 3 Forks: 19
https://github.com/roboflow/rf-detr
GitHub
GitHub - roboflow/rf-detr: RF-DETR is a real-time object detection model architecture developed by Roboflow, SOTA on COCO & designed…
RF-DETR is a real-time object detection model architecture developed by Roboflow, SOTA on COCO & designed for fine-tuning. - roboflow/rf-detr