GitHub repos

MasterBin-IIAU/Unicorn
[ECCV'22 Oral] Towards Grand Unification of Object Tracking
Language: Python
#multi_object_tracking_segmentation #multiple_object_tracking #object_tracking #single_object_tracking #video_object_segmentation
Stars: 132 Issues: 1 Forks: 5
https://github.com/MasterBin-IIAU/Unicorn

GitHub

GitHub - MasterBin-IIAU/Unicorn: [ECCV'22 Oral] Towards Grand Unification of Object Tracking

[ECCV'22 Oral] Towards Grand Unification of Object Tracking - MasterBin-IIAU/Unicorn

2.1K views16:18

GitHub repos

cvat-ai/cvat
Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.
Language: Open Policy Agent
#annotation #annotation_tool #annotations #bounding_box #computer_vision #computer_vision_annotation #dataset #deep_learning #image_annotation #image_classification #image_labeling #image_labeling_tool #imagenet #labeling #labeling_tool #semantic_segmentation #video_annotation #yolo
Stars: 99 Issues: 14 Forks: 4
https://github.com/cvat-ai/cvat

GitHub

GitHub - cvat-ai/cvat: Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams…

Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale. - cvat-ai/cvat

2.3K views22:19

GitHub repos

wjf5203/VNext
Next-generation Video instance recognition framework on top of Detectron2 which supports SeqFormer(ECCV Oral) and IDOL(ECCV Oral))
Language: Python
#instance_segmentation #object_detection #tracking #transformer #video_instance_segmentation
Stars: 109 Issues: 0 Forks: 4
https://github.com/wjf5203/VNext

GitHub

GitHub - wjf5203/VNext: Next-generation Video instance recognition framework on top of Detectron2 which supports InstMove (CVPR…

Next-generation Video instance recognition framework on top of Detectron2 which supports InstMove (CVPR 2023), SeqFormer(ECCV Oral), and IDOL(ECCV Oral)) - wjf5203/VNext

2.1K views22:19

GitHub repos

YuzukiHD/YuzukiLOHCC-PRO
Low cost USB3.2Gen1 HDMI-USB Video Acquisition With Loop Out (Loop Out HDMI Capture Card) base on MS2130 & MS9332
#hdmi #hdmi2usb #video_acquisition
Stars: 134 Issues: 0 Forks: 9
https://github.com/YuzukiHD/YuzukiLOHCC-PRO

GitHub

GitHub - YuzukiHD/YuzukiLOHCC-PRO: Low cost USB3.2Gen1 HDMI-USB Video Acquisition With Loop Out (Loop Out HDMI Capture Card) base…

Low cost USB3.2Gen1 HDMI-USB Video Acquisition With Loop Out (Loop Out HDMI Capture Card) base on MS2130 & MS9332 - YuzukiHD/YuzukiLOHCC-PRO

2.2K views10:23

GitHub repos

jaketae/storyteller
Multimodal AI Story Teller, built with Stable Diffusion, GPT, and neural text-to-speech
Language: Python
#gpt #image_generation #pytorch #stable_diffusion #text_to_image #text_to_speech #text_to_video #video_generation
Stars: 119 Issues: 1 Forks: 6
https://github.com/jaketae/storyteller

GitHub

GitHub - jaketae/storyteller: Multimodal AI Story Teller, built with Stable Diffusion, GPT, and neural text-to-speech

Multimodal AI Story Teller, built with Stable Diffusion, GPT, and neural text-to-speech - jaketae/storyteller

2.4K views23:03

GitHub repos

DvorakDwarf/Infinite-Storage-Glitch
ISG lets you use YouTube as cloud storage for ANY files, not just video
Language: Rust
#rust #storage #terminal #tui #video #youtube
Stars: 1079 Issues: 5 Forks: 36
https://github.com/DvorakDwarf/Infinite-Storage-Glitch

GitHub

GitHub - DvorakDwarf/Infinite-Storage-Glitch: ISG lets you use YouTube as cloud storage for ANY files, not just video

ISG lets you use YouTube as cloud storage for ANY files, not just video - DvorakDwarf/Infinite-Storage-Glitch

5.4K views05:06

GitHub repos

z-x-yang/Segment-and-Track-Anything
An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) for key-frame segmentation and Associating Objects with Transformers (AOT) for efficient tracking and propagation purposes.
Language: Jupyter Notebook
#interactive_segmentation #segment_anything #segment_anything_model #video_object_segmentation #visual_object_tracking
Stars: 474 Issues: 8 Forks: 43
https://github.com/z-x-yang/Segment-and-Track-Anything

GitHub

GitHub - z-x-yang/Segment-and-Track-Anything: An open-source project dedicated to tracking and segmenting any objects in videos…

An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) fo...

2.2K views16:09

GitHub repos

OpenGVLab/Ask-Anything
a simple yet interesting tool for chatting about video with chatGPT, miniGPT4 and StableLM
Language: Python
#captioning_videos #chat #chatgpt #gradio #langchain #moss #stablelm #video #video_question_answering #video_understanding
Stars: 294 Issues: 2 Forks: 15
https://github.com/OpenGVLab/Ask-Anything

GitHub

GitHub - OpenGVLab/Ask-Anything: [CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs…

[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS. - OpenGVLab/Ask-Anything

2.1K views10:09

GitHub repos

showlab/VLog
Transform Video as a Document with ChatGPT, CLIP, BLIP2, GRIT, Whisper, LangChain.
Language: Python
#chatgpt #langchain #large_language_model #video_language #whisper
Stars: 249 Issues: 1 Forks: 10
https://github.com/showlab/VLog

GitHub

GitHub - showlab/VLog: Transform Video as a Document with ChatGPT, CLIP, BLIP2, GRIT, Whisper, LangChain.

Transform Video as a Document with ChatGPT, CLIP, BLIP2, GRIT, Whisper, LangChain. - showlab/VLog

2.4K views04:09

GitHub repos

OpenGVLab/InternChat
InternChat allows you to interact with ChatGPT by clicking, dragging and drawing using a pointing device.
Language: Python
#chatgpt #click #foundation_model #gpt #gpt_4 #gradio #husky #image_captioning #internimage #langchain #llama #llm #multimodal #ocr #sam #segment_anything #vicuna #video #video_generation #vqa
Stars: 231 Issues: 1 Forks: 10
https://github.com/OpenGVLab/InternChat

GitHub

GitHub - OpenGVLab/InternGPT: InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now…

InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editin...

2.3K views22:10

GitHub repos

omerbt/TokenFlow
Official Pytorch Implementation for "TokenFlow: Consistent Diffusion Features for Consistent Video Editing" presenting "TokenFlow"
#stable_diffusion #text_to_image #text_to_video #tokenflow #video_editing
Stars: 310 Issues: 4 Forks: 13
https://github.com/omerbt/TokenFlow

GitHub

GitHub - omerbt/TokenFlow: Official Pytorch Implementation for "TokenFlow: Consistent Diffusion Features for Consistent Video Editing"…

Official Pytorch Implementation for "TokenFlow: Consistent Diffusion Features for Consistent Video Editing" presenting "TokenFlow" (ICLR 2024) - omerbt/TokenFlow

3.0K views16:15

GitHub repos

hkchengrex/Cutie
[arXiv 2023] Putting the Object Back Into Video Object Segmentation
Language: Python
#computer_vision #deep_learning #pytorch #segmentation #video_editing #video_object_segmentation #video_segmentation
Stars: 123 Issues: 1 Forks: 12
https://github.com/hkchengrex/Cutie

GitHub

GitHub - hkchengrex/Cutie: [CVPR 2024 Highlight] Putting the Object Back Into Video Object Segmentation

[CVPR 2024 Highlight] Putting the Object Back Into Video Object Segmentation - hkchengrex/Cutie

1.9K views16:20

GitHub repos

arthur-qiu/LongerCrafter
Code for FreeNoise
Language: Python
#aigc #diffusion #generative_model #video_diffusion_model
Stars: 116 Issues: 2 Forks: 6
https://github.com/arthur-qiu/LongerCrafter

GitHub

GitHub - AILab-CVC/FreeNoise: [ICLR 2024] Code for FreeNoise based on VideoCrafter

[ICLR 2024] Code for FreeNoise based on VideoCrafter - AILab-CVC/FreeNoise

1.8K views04:20

GitHub repos

TianxingWu/FreeInit
FreeInit: Bridging Initialization Gap in Video Diffusion Models
Language: Python
#aigc #text_to_video #video_diffusion_model #video_generation
Stars: 162 Issues: 4 Forks: 7
https://github.com/TianxingWu/FreeInit

GitHub

GitHub - TianxingWu/FreeInit: [ECCV 2024] FreeInit: Bridging Initialization Gap in Video Diffusion Models

[ECCV 2024] FreeInit: Bridging Initialization Gap in Video Diffusion Models - TianxingWu/FreeInit

1.9K views05:23

GitHub repos

ali-vilab/dreamtalk
Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models
Language: Python
#audio_visual_learning #face_animation #talking_head #video_generation
Stars: 217 Issues: 7 Forks: 20
https://github.com/ali-vilab/dreamtalk

GitHub

GitHub - ali-vilab/dreamtalk: Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion…

Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models - ali-vilab/dreamtalk

2.2K views17:24

GitHub repos

mayuelala/FollowYourClick
[arXiv 2024] Follow-Your-Click: This repo is the official implementation of "Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts"
#image_animation #image_to_video_generation #video_generation
Stars: 445 Issues: 0 Forks: 10
https://github.com/mayuelala/FollowYourClick

GitHub

GitHub - mayuelala/FollowYourClick: [arXiv 2024] Follow-Your-Click: This repo is the official implementation of "Follow-Your-Click:…

[arXiv 2024] Follow-Your-Click: This repo is the official implementation of "Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts" - GitHub - mayuelala/Fol...

3.1K views16:28

GitHub repos

PKU-YuanGroup/MagicTime
MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators
Language: Python
#diffusion_models #long_video_generation #metamorphic_video_generation #open_sora_plan #text_to_video #time_lapse #time_lapse_dataset #video_generation
Stars: 281 Issues: 4 Forks: 16
https://github.com/PKU-YuanGroup/MagicTime

GitHub

GitHub - PKU-YuanGroup/MagicTime: MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators

MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators - PKU-YuanGroup/MagicTime

2.6K views16:30

GitHub repos

BradyFU/Video-MME
✨✨Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis
Language: Python
#large_language_models #large_vision_language_models #mme #multimodal_large_language_models #video #video_mme
Stars: 182 Issues: 1 Forks: 6
https://github.com/BradyFU/Video-MME

GitHub

GitHub - BradyFU/Video-MME: ✨✨Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis

✨✨Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis - BradyFU/Video-MME

2.0K views04:00

GitHub repos

fudan-generative-vision/hallo
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
Language: Python
#face_animation #image_animation #video_animation
Stars: 653 Issues: 5 Forks: 102
https://github.com/fudan-generative-vision/hallo

GitHub

GitHub - fudan-generative-vision/hallo: Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation

Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation - fudan-generative-vision/hallo

3.0K views10:00

GitHub repos

SuperViz/superviz
SuperViz provides programmable low-code Collaboration and Communication components for web applications.
Language: TypeScript
#autodesk #autodesk_forge #collaboration #comments #crdt #matterport #multiplayer #presence #react #reactflow #real_time #superviz #three #video_conferencing #webrtc #websockets #yjs #yjs_provider
Stars: 198 Issues: 5 Forks: 0
https://github.com/SuperViz/superviz

GitHub

GitHub - SuperViz/superviz: SuperViz provides powerful SDKs and APIs that enable developers to easily integrate real-time features…

SuperViz provides powerful SDKs and APIs that enable developers to easily integrate real-time features into web applications. Our platform accelerates development across various industries with rob...

1.7K views22:00

About

Blog

Apps

Platform