GitHub repos

wjf5203/VNext
Next-generation Video instance recognition framework on top of Detectron2 which supports SeqFormer(ECCV Oral) and IDOL(ECCV Oral))
Language: Python
#instance_segmentation #object_detection #tracking #transformer #video_instance_segmentation
Stars: 109 Issues: 0 Forks: 4
https://github.com/wjf5203/VNext

GitHub

GitHub - wjf5203/VNext: Next-generation Video instance recognition framework on top of Detectron2 which supports InstMove (CVPR…

Next-generation Video instance recognition framework on top of Detectron2 which supports InstMove (CVPR 2023), SeqFormer(ECCV Oral), and IDOL(ECCV Oral)) - wjf5203/VNext

👍4

2.17K views22:19

GitHub repos

open-mmlab/mmyolo
OpenMMLab YOLO series toolbox and benchmark
Language: Python
#object_detection #pytorch #yolo #yolov5 #yolov6 #yolox
Stars: 285 Issues: 7 Forks: 11
https://github.com/open-mmlab/mmyolo

GitHub

GitHub - open-mmlab/mmyolo: OpenMMLab YOLO series toolbox and benchmark. Implemented RTMDet, RTMDet-Rotated,YOLOv5, YOLOv6, YOLOv7…

OpenMMLab YOLO series toolbox and benchmark. Implemented RTMDet, RTMDet-Rotated,YOLOv5, YOLOv6, YOLOv7, YOLOv8,YOLOX, PPYOLOE, etc. - open-mmlab/mmyolo

🤔1

2.08K views22:22

GitHub repos

roboflow-ai/notebooks
Set of Jupyter Notebooks linked to Roboflow Blogpost and used in our YouTube videos.
Language: Jupyter Notebook
#computer_vision #deep_learning #deep_neural_networks #image_classification #image_segmentation #object_detection #pytorch #tutorial #yolov5 #yolov6 #yolov7
Stars: 126 Issues: 1 Forks: 14
https://github.com/roboflow-ai/notebooks

GitHub

GitHub - roboflow/notebooks: A collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything…

A collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures like ResNet to cutting-edge models like YOLO11, RT-DETR, SAM ...

2.24K views23:00

GitHub repos

tinyvision/DAMO-YOLO
DAMO-YOLO: a fast and accurate object detection method with some new techs, including NAS backbones, efficient RepGFPN, ZeroHead, AlignedOTA, and distillation enhancement.
Language: Python
#deep_learning #nas #object_detection #onnx #pytorch #tensorrt #yolo #yolov5
Stars: 163 Issues: 8 Forks: 15
https://github.com/tinyvision/DAMO-YOLO

GitHub

GitHub - tinyvision/DAMO-YOLO: DAMO-YOLO: a fast and accurate object detection method with some new techs, including NAS backbones…

DAMO-YOLO: a fast and accurate object detection method with some new techs, including NAS backbones, efficient RepGFPN, ZeroHead, AlignedOTA, and distillation enhancement. - tinyvision/DAMO-YOLO

👍2

2.29K views17:01

GitHub repos

kadirnar/segment-anything-video
MetaSeg: Packaged version of the Segment Anything repository
Language: Python
#object_detection #object_segmentation #segment_anything #segmentation
Stars: 337 Issues: 4 Forks: 22
https://github.com/kadirnar/segment-anything-video

GitHub

GitHub - kadirnar/segment-anything-video: MetaSeg: Packaged version of the Segment Anything repository

MetaSeg: Packaged version of the Segment Anything repository - kadirnar/segment-anything-video

👍2

2.21K views16:09

GitHub repos

OpenGVLab/VisionLLM
VisionLLM: Large Language Model is also an Open-Ended Decoder for Vision-Centric Tasks
#generalist_model #large_language_models #object_detection
Stars: 205 Issues: 1 Forks: 2
https://github.com/OpenGVLab/VisionLLM

GitHub

GitHub - OpenGVLab/VisionLLM: VisionLLM Series

VisionLLM Series. Contribute to OpenGVLab/VisionLLM development by creating an account on GitHub.

👍2

2.35K views10:11

GitHub repos

roboflow/multimodal-maestro
Effective prompting for Large Multimodal Models like GPT-4 Vision or LLaVA. 🔥
Language: Python
#cross_modal #gpt_4 #gpt_4_vision #instance_segmentation #llava #lmm #multimodality #object_detection #prompt_engineering #segment_anything #vision_language_model #visual_prompting
Stars: 367 Issues: 1 Forks: 23
https://github.com/roboflow/multimodal-maestro

GitHub

GitHub - roboflow/maestro: streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL

streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL - roboflow/maestro

2.43K views11:22

GitHub repos

FoundationVision/GLEE
GLEE: General Object Foundation Model for Images and Videos at Scale
Language: Python
#foundation_model #object_detection #open_world #tracking
Stars: 153 Issues: 3 Forks: 9
https://github.com/FoundationVision/GLEE

GitHub

GitHub - FoundationVision/GLEE: [CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale

[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale - FoundationVision/GLEE

👍1

2.03K views17:23

GitHub repos

IDEA-Research/Grounding-DINO-1.5-API
API for Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series
Language: Python
#grounding_dino #object_detection #open_set
Stars: 228 Issues: 7 Forks: 7
https://github.com/IDEA-Research/Grounding-DINO-1.5-API

GitHub

GitHub - IDEA-Research/Grounding-DINO-1.5-API: Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model…

Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series - IDEA-Research/Grounding-DINO-1.5-API

2.62K views22:00

GitHub repos

roboflow/rf-detr
RF-DETR is a real-time object detection model architecture developed by Roboflow, released under the Apache 2.0 license.
Language: Python
#computer_vision #detr #machine_learning #object_detection #rf_detr
Stars: 292 Issues: 3 Forks: 19
https://github.com/roboflow/rf-detr

GitHub

GitHub - roboflow/rf-detr: RF-DETR is a real-time object detection and segmentation model architecture developed by Roboflow, SOTA…

RF-DETR is a real-time object detection and segmentation model architecture developed by Roboflow, SOTA on COCO and designed for fine-tuning. - roboflow/rf-detr

1.73K views10:00

GitHub repos

iMoonLab/yolov13
Implementation of "YOLOv13: Real-Time Object Detection with Hypergraph-Enhanced Adaptive Visual Perception".
Language: Python
#correlation_modelling #hypergraph_learning #object_detection #real_time_object_detection #visual_recognition #yolo #yolov13
Stars: 230 Issues: 14 Forks: 22
https://github.com/iMoonLab/yolov13

GitHub

GitHub - iMoonLab/yolov13: Implementation of "YOLOv13: Real-Time Object Detection with Hypergraph-Enhanced Adaptive Visual Perception".

Implementation of "YOLOv13: Real-Time Object Detection with Hypergraph-Enhanced Adaptive Visual Perception". - iMoonLab/yolov13

👍1

1.94K views16:00

About

Blog

Apps

Platform