MasterBin-IIAU/Unicorn
[ECCV'22 Oral] Towards Grand Unification of Object Tracking
Language: Python
#multi_object_tracking_segmentation #multiple_object_tracking #object_tracking #single_object_tracking #video_object_segmentation
Stars: 132 Issues: 1 Forks: 5
https://github.com/MasterBin-IIAU/Unicorn
[ECCV'22 Oral] Towards Grand Unification of Object Tracking
Language: Python
#multi_object_tracking_segmentation #multiple_object_tracking #object_tracking #single_object_tracking #video_object_segmentation
Stars: 132 Issues: 1 Forks: 5
https://github.com/MasterBin-IIAU/Unicorn
GitHub
GitHub - MasterBin-IIAU/Unicorn: [ECCV'22 Oral] Towards Grand Unification of Object Tracking
[ECCV'22 Oral] Towards Grand Unification of Object Tracking - MasterBin-IIAU/Unicorn
cvat-ai/cvat
Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.
Language: Open Policy Agent
#annotation #annotation_tool #annotations #bounding_box #computer_vision #computer_vision_annotation #dataset #deep_learning #image_annotation #image_classification #image_labeling #image_labeling_tool #imagenet #labeling #labeling_tool #semantic_segmentation #video_annotation #yolo
Stars: 99 Issues: 14 Forks: 4
https://github.com/cvat-ai/cvat
Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.
Language: Open Policy Agent
#annotation #annotation_tool #annotations #bounding_box #computer_vision #computer_vision_annotation #dataset #deep_learning #image_annotation #image_classification #image_labeling #image_labeling_tool #imagenet #labeling #labeling_tool #semantic_segmentation #video_annotation #yolo
Stars: 99 Issues: 14 Forks: 4
https://github.com/cvat-ai/cvat
GitHub
GitHub - cvat-ai/cvat: Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams…
Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale. - cvat-ai/cvat
wjf5203/VNext
Next-generation Video instance recognition framework on top of Detectron2 which supports SeqFormer(ECCV Oral) and IDOL(ECCV Oral))
Language: Python
#instance_segmentation #object_detection #tracking #transformer #video_instance_segmentation
Stars: 109 Issues: 0 Forks: 4
https://github.com/wjf5203/VNext
Next-generation Video instance recognition framework on top of Detectron2 which supports SeqFormer(ECCV Oral) and IDOL(ECCV Oral))
Language: Python
#instance_segmentation #object_detection #tracking #transformer #video_instance_segmentation
Stars: 109 Issues: 0 Forks: 4
https://github.com/wjf5203/VNext
GitHub
GitHub - wjf5203/VNext: Next-generation Video instance recognition framework on top of Detectron2 which supports InstMove (CVPR…
Next-generation Video instance recognition framework on top of Detectron2 which supports InstMove (CVPR 2023), SeqFormer(ECCV Oral), and IDOL(ECCV Oral)) - wjf5203/VNext
YuzukiHD/YuzukiLOHCC-PRO
Low cost USB3.2Gen1 HDMI-USB Video Acquisition With Loop Out (Loop Out HDMI Capture Card) base on MS2130 & MS9332
#hdmi #hdmi2usb #video_acquisition
Stars: 134 Issues: 0 Forks: 9
https://github.com/YuzukiHD/YuzukiLOHCC-PRO
Low cost USB3.2Gen1 HDMI-USB Video Acquisition With Loop Out (Loop Out HDMI Capture Card) base on MS2130 & MS9332
#hdmi #hdmi2usb #video_acquisition
Stars: 134 Issues: 0 Forks: 9
https://github.com/YuzukiHD/YuzukiLOHCC-PRO
GitHub
GitHub - YuzukiHD/YuzukiLOHCC-PRO: Low cost USB3.2Gen1 HDMI-USB Video Acquisition With Loop Out (Loop Out HDMI Capture Card) base…
Low cost USB3.2Gen1 HDMI-USB Video Acquisition With Loop Out (Loop Out HDMI Capture Card) base on MS2130 & MS9332 - YuzukiHD/YuzukiLOHCC-PRO
jaketae/storyteller
Multimodal AI Story Teller, built with Stable Diffusion, GPT, and neural text-to-speech
Language: Python
#gpt #image_generation #pytorch #stable_diffusion #text_to_image #text_to_speech #text_to_video #video_generation
Stars: 119 Issues: 1 Forks: 6
https://github.com/jaketae/storyteller
Multimodal AI Story Teller, built with Stable Diffusion, GPT, and neural text-to-speech
Language: Python
#gpt #image_generation #pytorch #stable_diffusion #text_to_image #text_to_speech #text_to_video #video_generation
Stars: 119 Issues: 1 Forks: 6
https://github.com/jaketae/storyteller
GitHub
GitHub - jaketae/storyteller: Multimodal AI Story Teller, built with Stable Diffusion, GPT, and neural text-to-speech
Multimodal AI Story Teller, built with Stable Diffusion, GPT, and neural text-to-speech - jaketae/storyteller
DvorakDwarf/Infinite-Storage-Glitch
ISG lets you use YouTube as cloud storage for ANY files, not just video
Language: Rust
#rust #storage #terminal #tui #video #youtube
Stars: 1079 Issues: 5 Forks: 36
https://github.com/DvorakDwarf/Infinite-Storage-Glitch
ISG lets you use YouTube as cloud storage for ANY files, not just video
Language: Rust
#rust #storage #terminal #tui #video #youtube
Stars: 1079 Issues: 5 Forks: 36
https://github.com/DvorakDwarf/Infinite-Storage-Glitch
GitHub
GitHub - DvorakDwarf/Infinite-Storage-Glitch: ISG lets you use YouTube as cloud storage for ANY files, not just video
ISG lets you use YouTube as cloud storage for ANY files, not just video - DvorakDwarf/Infinite-Storage-Glitch
z-x-yang/Segment-and-Track-Anything
An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) for key-frame segmentation and Associating Objects with Transformers (AOT) for efficient tracking and propagation purposes.
Language: Jupyter Notebook
#interactive_segmentation #segment_anything #segment_anything_model #video_object_segmentation #visual_object_tracking
Stars: 474 Issues: 8 Forks: 43
https://github.com/z-x-yang/Segment-and-Track-Anything
An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) for key-frame segmentation and Associating Objects with Transformers (AOT) for efficient tracking and propagation purposes.
Language: Jupyter Notebook
#interactive_segmentation #segment_anything #segment_anything_model #video_object_segmentation #visual_object_tracking
Stars: 474 Issues: 8 Forks: 43
https://github.com/z-x-yang/Segment-and-Track-Anything
GitHub
GitHub - z-x-yang/Segment-and-Track-Anything: An open-source project dedicated to tracking and segmenting any objects in videos…
An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) fo...
OpenGVLab/Ask-Anything
a simple yet interesting tool for chatting about video with chatGPT, miniGPT4 and StableLM
Language: Python
#captioning_videos #chat #chatgpt #gradio #langchain #moss #stablelm #video #video_question_answering #video_understanding
Stars: 294 Issues: 2 Forks: 15
https://github.com/OpenGVLab/Ask-Anything
a simple yet interesting tool for chatting about video with chatGPT, miniGPT4 and StableLM
Language: Python
#captioning_videos #chat #chatgpt #gradio #langchain #moss #stablelm #video #video_question_answering #video_understanding
Stars: 294 Issues: 2 Forks: 15
https://github.com/OpenGVLab/Ask-Anything
GitHub
GitHub - OpenGVLab/Ask-Anything: [CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs…
[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS. - OpenGVLab/Ask-Anything
showlab/VLog
Transform Video as a Document with ChatGPT, CLIP, BLIP2, GRIT, Whisper, LangChain.
Language: Python
#chatgpt #langchain #large_language_model #video_language #whisper
Stars: 249 Issues: 1 Forks: 10
https://github.com/showlab/VLog
Transform Video as a Document with ChatGPT, CLIP, BLIP2, GRIT, Whisper, LangChain.
Language: Python
#chatgpt #langchain #large_language_model #video_language #whisper
Stars: 249 Issues: 1 Forks: 10
https://github.com/showlab/VLog
GitHub
GitHub - showlab/VLog: Transform Video as a Document with ChatGPT, CLIP, BLIP2, GRIT, Whisper, LangChain.
Transform Video as a Document with ChatGPT, CLIP, BLIP2, GRIT, Whisper, LangChain. - showlab/VLog
OpenGVLab/InternChat
InternChat allows you to interact with ChatGPT by clicking, dragging and drawing using a pointing device.
Language: Python
#chatgpt #click #foundation_model #gpt #gpt_4 #gradio #husky #image_captioning #internimage #langchain #llama #llm #multimodal #ocr #sam #segment_anything #vicuna #video #video_generation #vqa
Stars: 231 Issues: 1 Forks: 10
https://github.com/OpenGVLab/InternChat
InternChat allows you to interact with ChatGPT by clicking, dragging and drawing using a pointing device.
Language: Python
#chatgpt #click #foundation_model #gpt #gpt_4 #gradio #husky #image_captioning #internimage #langchain #llama #llm #multimodal #ocr #sam #segment_anything #vicuna #video #video_generation #vqa
Stars: 231 Issues: 1 Forks: 10
https://github.com/OpenGVLab/InternChat
GitHub
GitHub - OpenGVLab/InternGPT: InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now…
InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editin...
omerbt/TokenFlow
Official Pytorch Implementation for "TokenFlow: Consistent Diffusion Features for Consistent Video Editing" presenting "TokenFlow"
#stable_diffusion #text_to_image #text_to_video #tokenflow #video_editing
Stars: 310 Issues: 4 Forks: 13
https://github.com/omerbt/TokenFlow
Official Pytorch Implementation for "TokenFlow: Consistent Diffusion Features for Consistent Video Editing" presenting "TokenFlow"
#stable_diffusion #text_to_image #text_to_video #tokenflow #video_editing
Stars: 310 Issues: 4 Forks: 13
https://github.com/omerbt/TokenFlow
GitHub
GitHub - omerbt/TokenFlow: Official Pytorch Implementation for "TokenFlow: Consistent Diffusion Features for Consistent Video Editing"…
Official Pytorch Implementation for "TokenFlow: Consistent Diffusion Features for Consistent Video Editing" presenting "TokenFlow" (ICLR 2024) - omerbt/TokenFlow
hkchengrex/Cutie
[arXiv 2023] Putting the Object Back Into Video Object Segmentation
Language: Python
#computer_vision #deep_learning #pytorch #segmentation #video_editing #video_object_segmentation #video_segmentation
Stars: 123 Issues: 1 Forks: 12
https://github.com/hkchengrex/Cutie
[arXiv 2023] Putting the Object Back Into Video Object Segmentation
Language: Python
#computer_vision #deep_learning #pytorch #segmentation #video_editing #video_object_segmentation #video_segmentation
Stars: 123 Issues: 1 Forks: 12
https://github.com/hkchengrex/Cutie
GitHub
GitHub - hkchengrex/Cutie: [CVPR 2024 Highlight] Putting the Object Back Into Video Object Segmentation
[CVPR 2024 Highlight] Putting the Object Back Into Video Object Segmentation - hkchengrex/Cutie
arthur-qiu/LongerCrafter
Code for FreeNoise
Language: Python
#aigc #diffusion #generative_model #video_diffusion_model
Stars: 116 Issues: 2 Forks: 6
https://github.com/arthur-qiu/LongerCrafter
Code for FreeNoise
Language: Python
#aigc #diffusion #generative_model #video_diffusion_model
Stars: 116 Issues: 2 Forks: 6
https://github.com/arthur-qiu/LongerCrafter
GitHub
GitHub - AILab-CVC/FreeNoise: [ICLR 2024] Code for FreeNoise based on VideoCrafter
[ICLR 2024] Code for FreeNoise based on VideoCrafter - AILab-CVC/FreeNoise
TianxingWu/FreeInit
FreeInit: Bridging Initialization Gap in Video Diffusion Models
Language: Python
#aigc #text_to_video #video_diffusion_model #video_generation
Stars: 162 Issues: 4 Forks: 7
https://github.com/TianxingWu/FreeInit
FreeInit: Bridging Initialization Gap in Video Diffusion Models
Language: Python
#aigc #text_to_video #video_diffusion_model #video_generation
Stars: 162 Issues: 4 Forks: 7
https://github.com/TianxingWu/FreeInit
GitHub
GitHub - TianxingWu/FreeInit: [ECCV 2024] FreeInit: Bridging Initialization Gap in Video Diffusion Models
[ECCV 2024] FreeInit: Bridging Initialization Gap in Video Diffusion Models - TianxingWu/FreeInit
ali-vilab/dreamtalk
Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models
Language: Python
#audio_visual_learning #face_animation #talking_head #video_generation
Stars: 217 Issues: 7 Forks: 20
https://github.com/ali-vilab/dreamtalk
Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models
Language: Python
#audio_visual_learning #face_animation #talking_head #video_generation
Stars: 217 Issues: 7 Forks: 20
https://github.com/ali-vilab/dreamtalk
GitHub
GitHub - ali-vilab/dreamtalk: Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion…
Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models - ali-vilab/dreamtalk
mayuelala/FollowYourClick
[arXiv 2024] Follow-Your-Click: This repo is the official implementation of "Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts"
#image_animation #image_to_video_generation #video_generation
Stars: 445 Issues: 0 Forks: 10
https://github.com/mayuelala/FollowYourClick
[arXiv 2024] Follow-Your-Click: This repo is the official implementation of "Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts"
#image_animation #image_to_video_generation #video_generation
Stars: 445 Issues: 0 Forks: 10
https://github.com/mayuelala/FollowYourClick
GitHub
GitHub - mayuelala/FollowYourClick: [arXiv 2024] Follow-Your-Click: This repo is the official implementation of "Follow-Your-Click:…
[arXiv 2024] Follow-Your-Click: This repo is the official implementation of "Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts" - GitHub - mayuelala/Fol...
PKU-YuanGroup/MagicTime
MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators
Language: Python
#diffusion_models #long_video_generation #metamorphic_video_generation #open_sora_plan #text_to_video #time_lapse #time_lapse_dataset #video_generation
Stars: 281 Issues: 4 Forks: 16
https://github.com/PKU-YuanGroup/MagicTime
MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators
Language: Python
#diffusion_models #long_video_generation #metamorphic_video_generation #open_sora_plan #text_to_video #time_lapse #time_lapse_dataset #video_generation
Stars: 281 Issues: 4 Forks: 16
https://github.com/PKU-YuanGroup/MagicTime
GitHub
GitHub - PKU-YuanGroup/MagicTime: MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators
MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators - PKU-YuanGroup/MagicTime
BradyFU/Video-MME
✨✨Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis
Language: Python
#large_language_models #large_vision_language_models #mme #multimodal_large_language_models #video #video_mme
Stars: 182 Issues: 1 Forks: 6
https://github.com/BradyFU/Video-MME
✨✨Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis
Language: Python
#large_language_models #large_vision_language_models #mme #multimodal_large_language_models #video #video_mme
Stars: 182 Issues: 1 Forks: 6
https://github.com/BradyFU/Video-MME
GitHub
GitHub - BradyFU/Video-MME: ✨✨Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis
✨✨Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis - BradyFU/Video-MME
fudan-generative-vision/hallo
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
Language: Python
#face_animation #image_animation #video_animation
Stars: 653 Issues: 5 Forks: 102
https://github.com/fudan-generative-vision/hallo
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
Language: Python
#face_animation #image_animation #video_animation
Stars: 653 Issues: 5 Forks: 102
https://github.com/fudan-generative-vision/hallo
GitHub
GitHub - fudan-generative-vision/hallo: Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation - fudan-generative-vision/hallo
SuperViz/superviz
SuperViz provides programmable low-code Collaboration and Communication components for web applications.
Language: TypeScript
#autodesk #autodesk_forge #collaboration #comments #crdt #matterport #multiplayer #presence #react #reactflow #real_time #superviz #three #video_conferencing #webrtc #websockets #yjs #yjs_provider
Stars: 198 Issues: 5 Forks: 0
https://github.com/SuperViz/superviz
SuperViz provides programmable low-code Collaboration and Communication components for web applications.
Language: TypeScript
#autodesk #autodesk_forge #collaboration #comments #crdt #matterport #multiplayer #presence #react #reactflow #real_time #superviz #three #video_conferencing #webrtc #websockets #yjs #yjs_provider
Stars: 198 Issues: 5 Forks: 0
https://github.com/SuperViz/superviz
GitHub
GitHub - SuperViz/superviz: SuperViz provides powerful SDKs and APIs that enable developers to easily integrate real-time features…
SuperViz provides powerful SDKs and APIs that enable developers to easily integrate real-time features into web applications. Our platform accelerates development across various industries with rob...