GitHub repos

z-x-yang/Segment-and-Track-Anything
An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) for key-frame segmentation and Associating Objects with Transformers (AOT) for efficient tracking and propagation purposes.
Language: Jupyter Notebook
#interactive_segmentation #segment_anything #segment_anything_model #video_object_segmentation #visual_object_tracking
Stars: 474 Issues: 8 Forks: 43
https://github.com/z-x-yang/Segment-and-Track-Anything

GitHub

GitHub - z-x-yang/Segment-and-Track-Anything: An open-source project dedicated to tracking and segmenting any objects in videos…

An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) fo...

2.2K views16:09

GitHub repos

OpenGVLab/Ask-Anything
a simple yet interesting tool for chatting about video with chatGPT, miniGPT4 and StableLM
Language: Python
#captioning_videos #chat #chatgpt #gradio #langchain #moss #stablelm #video #video_question_answering #video_understanding
Stars: 294 Issues: 2 Forks: 15
https://github.com/OpenGVLab/Ask-Anything

GitHub

GitHub - OpenGVLab/Ask-Anything: [CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs…

[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS. - OpenGVLab/Ask-Anything

2.2K views10:09

GitHub repos

showlab/VLog
Transform Video as a Document with ChatGPT, CLIP, BLIP2, GRIT, Whisper, LangChain.
Language: Python
#chatgpt #langchain #large_language_model #video_language #whisper
Stars: 249 Issues: 1 Forks: 10
https://github.com/showlab/VLog

GitHub

GitHub - showlab/VLog: Transform Video as a Document with ChatGPT, CLIP, BLIP2, GRIT, Whisper, LangChain.

Transform Video as a Document with ChatGPT, CLIP, BLIP2, GRIT, Whisper, LangChain. - showlab/VLog

2.4K views04:09

GitHub repos

OpenGVLab/InternChat
InternChat allows you to interact with ChatGPT by clicking, dragging and drawing using a pointing device.
Language: Python
#chatgpt #click #foundation_model #gpt #gpt_4 #gradio #husky #image_captioning #internimage #langchain #llama #llm #multimodal #ocr #sam #segment_anything #vicuna #video #video_generation #vqa
Stars: 231 Issues: 1 Forks: 10
https://github.com/OpenGVLab/InternChat

GitHub

GitHub - OpenGVLab/InternGPT: InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now…

InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editin...

2.3K views22:10

GitHub repos

omerbt/TokenFlow
Official Pytorch Implementation for "TokenFlow: Consistent Diffusion Features for Consistent Video Editing" presenting "TokenFlow"
#stable_diffusion #text_to_image #text_to_video #tokenflow #video_editing
Stars: 310 Issues: 4 Forks: 13
https://github.com/omerbt/TokenFlow

GitHub

GitHub - omerbt/TokenFlow: Official Pytorch Implementation for "TokenFlow: Consistent Diffusion Features for Consistent Video Editing"…

Official Pytorch Implementation for "TokenFlow: Consistent Diffusion Features for Consistent Video Editing" presenting "TokenFlow" (ICLR 2024) - omerbt/TokenFlow

3.0K views16:15

GitHub repos

hkchengrex/Cutie
[arXiv 2023] Putting the Object Back Into Video Object Segmentation
Language: Python
#computer_vision #deep_learning #pytorch #segmentation #video_editing #video_object_segmentation #video_segmentation
Stars: 123 Issues: 1 Forks: 12
https://github.com/hkchengrex/Cutie

GitHub

GitHub - hkchengrex/Cutie: [CVPR 2024 Highlight] Putting the Object Back Into Video Object Segmentation

[CVPR 2024 Highlight] Putting the Object Back Into Video Object Segmentation - hkchengrex/Cutie

1.9K views16:20

GitHub repos

arthur-qiu/LongerCrafter
Code for FreeNoise
Language: Python
#aigc #diffusion #generative_model #video_diffusion_model
Stars: 116 Issues: 2 Forks: 6
https://github.com/arthur-qiu/LongerCrafter

GitHub

GitHub - AILab-CVC/FreeNoise: [ICLR 2024] Code for FreeNoise based on VideoCrafter

[ICLR 2024] Code for FreeNoise based on VideoCrafter - AILab-CVC/FreeNoise

1.9K views04:20

GitHub repos

TianxingWu/FreeInit
FreeInit: Bridging Initialization Gap in Video Diffusion Models
Language: Python
#aigc #text_to_video #video_diffusion_model #video_generation
Stars: 162 Issues: 4 Forks: 7
https://github.com/TianxingWu/FreeInit

GitHub

GitHub - TianxingWu/FreeInit: [ECCV 2024] FreeInit: Bridging Initialization Gap in Video Diffusion Models

[ECCV 2024] FreeInit: Bridging Initialization Gap in Video Diffusion Models - TianxingWu/FreeInit

2.0K views05:23

GitHub repos

ali-vilab/dreamtalk
Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models
Language: Python
#audio_visual_learning #face_animation #talking_head #video_generation
Stars: 217 Issues: 7 Forks: 20
https://github.com/ali-vilab/dreamtalk

GitHub

GitHub - ali-vilab/dreamtalk: Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion…

Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models - ali-vilab/dreamtalk

2.2K views17:24

GitHub repos

mayuelala/FollowYourClick
[arXiv 2024] Follow-Your-Click: This repo is the official implementation of "Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts"
#image_animation #image_to_video_generation #video_generation
Stars: 445 Issues: 0 Forks: 10
https://github.com/mayuelala/FollowYourClick

GitHub

GitHub - mayuelala/FollowYourClick: [AAAI 2025] Follow-Your-Click: This repo is the official implementation of "Follow-Your-Click:…

[AAAI 2025] Follow-Your-Click: This repo is the official implementation of "Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts" - GitHub - mayuelala/Foll...

3.2K views16:28

About

Blog

Apps

Platform