umlx5h/LLPlayer
The media player for language learning, with dual subtitles, AI-generated subtitles, realtime-OCR, translation, word lookup, and more!
Language: C#
#asr #csharp #flyleaf #language_learning #media_player #ocr #player #tesseract #video #video_player #whisper #wpf #yt_dlp
Stars: 253 Issues: 5 Forks: 4
https://github.com/umlx5h/LLPlayer
The media player for language learning, with dual subtitles, AI-generated subtitles, realtime-OCR, translation, word lookup, and more!
Language: C#
#asr #csharp #flyleaf #language_learning #media_player #ocr #player #tesseract #video #video_player #whisper #wpf #yt_dlp
Stars: 253 Issues: 5 Forks: 4
https://github.com/umlx5h/LLPlayer
GitHub
GitHub - umlx5h/LLPlayer: The media player for language learning, with dual subtitles, AI-generated subtitles, real-time translation…
The media player for language learning, with dual subtitles, AI-generated subtitles, real-time translation, and more! - umlx5h/LLPlayer
❤1👍1
FoundationVision/FlashVideo
FlashVideo:Flowing Fidelity to Detail for Efficient High-Resolution Video Generation
Language: Python
#efficient_generative_model #text_to_video #video_generation
Stars: 195 Issues: 5 Forks: 3
https://github.com/FoundationVision/FlashVideo
FlashVideo:Flowing Fidelity to Detail for Efficient High-Resolution Video Generation
Language: Python
#efficient_generative_model #text_to_video #video_generation
Stars: 195 Issues: 5 Forks: 3
https://github.com/FoundationVision/FlashVideo
GitHub
GitHub - FoundationVision/FlashVideo: FlashVideo: Flowing Fidelity to Detail for Efficient High-Resolution Video Generation
FlashVideo: Flowing Fidelity to Detail for Efficient High-Resolution Video Generation - FoundationVision/FlashVideo
SkyworkAI/SkyReels-V1
SkyReels V1: the first and most advanced open-source human-centric video foundation model
Language: Python
#i2v #t2v #video_diffusion_transformers
Stars: 348 Issues: 5 Forks: 20
https://github.com/SkyworkAI/SkyReels-V1
SkyReels V1: the first and most advanced open-source human-centric video foundation model
Language: Python
#i2v #t2v #video_diffusion_transformers
Stars: 348 Issues: 5 Forks: 20
https://github.com/SkyworkAI/SkyReels-V1
GitHub
GitHub - SkyworkAI/SkyReels-V1: SkyReels V1: The first and most advanced open-source human-centric video foundation model
SkyReels V1: The first and most advanced open-source human-centric video foundation model - SkyworkAI/SkyReels-V1
liuff19/Video-T1
Official Implementation of Video-T1: Test-Time Scaling for Video Generation
Language: Python
#aigc #chain_of_thought #test_time_scaling #video #video_generation
Stars: 187 Issues: 2 Forks: 12
https://github.com/liuff19/Video-T1
Official Implementation of Video-T1: Test-Time Scaling for Video Generation
Language: Python
#aigc #chain_of_thought #test_time_scaling #video #video_generation
Stars: 187 Issues: 2 Forks: 12
https://github.com/liuff19/Video-T1
GitHub
GitHub - liuff19/Video-T1: [ICCV 2025] Video-T1: Test-Time Scaling for Video Generation
[ICCV 2025] Video-T1: Test-Time Scaling for Video Generation - liuff19/Video-T1
👍1
TencentARC/GeometryCrafter
GeometryCrafter: Consistent Geometry Estimation for Open-world Videos with Diffusion Priors
Language: Python
#depth_estimation #video_to_4d
Stars: 173 Issues: 0 Forks: 3
https://github.com/TencentARC/GeometryCrafter
GeometryCrafter: Consistent Geometry Estimation for Open-world Videos with Diffusion Priors
Language: Python
#depth_estimation #video_to_4d
Stars: 173 Issues: 0 Forks: 3
https://github.com/TencentARC/GeometryCrafter
GitHub
GitHub - TencentARC/GeometryCrafter: [ICCV 2025] GeometryCrafter: Consistent Geometry Estimation for Open-world Videos with Diffusion…
[ICCV 2025] GeometryCrafter: Consistent Geometry Estimation for Open-world Videos with Diffusion Priors - TencentARC/GeometryCrafter
hanyang-21/VideoScene
[CVPR 2025] VideoScene: Distilling Video Diffusion Model to Generate 3D Scenes in One Step
Language: Python
#3d_reconstruction #video #video_generation
Stars: 154 Issues: 4 Forks: 3
https://github.com/hanyang-21/VideoScene
[CVPR 2025] VideoScene: Distilling Video Diffusion Model to Generate 3D Scenes in One Step
Language: Python
#3d_reconstruction #video #video_generation
Stars: 154 Issues: 4 Forks: 3
https://github.com/hanyang-21/VideoScene
GitHub
GitHub - hanyang-21/VideoScene: [CVPR 2025 Highlight] VideoScene: Distilling Video Diffusion Model to Generate 3D Scenes in One…
[CVPR 2025 Highlight] VideoScene: Distilling Video Diffusion Model to Generate 3D Scenes in One Step - hanyang-21/VideoScene
❤2
ali-vilab/UniAnimate-DiT
UniAnimate-DiT: Human Image Animation with Large-Scale Video Diffusion Transformer
Language: Python
#human_image_animation #video_diffusion_transformers #video_generation
Stars: 225 Issues: 5 Forks: 17
https://github.com/ali-vilab/UniAnimate-DiT
UniAnimate-DiT: Human Image Animation with Large-Scale Video Diffusion Transformer
Language: Python
#human_image_animation #video_diffusion_transformers #video_generation
Stars: 225 Issues: 5 Forks: 17
https://github.com/ali-vilab/UniAnimate-DiT
GitHub
GitHub - ali-vilab/UniAnimate-DiT: UniAnimate-DiT: Human Image Animation with Large-Scale Video Diffusion Transformer
UniAnimate-DiT: Human Image Animation with Large-Scale Video Diffusion Transformer - ali-vilab/UniAnimate-DiT
❤1
SandAI-org/MAGI-1
MAGI-1: Autoregressive Video Generation at Scale
Language: Python
#autoregressive #diffusion_models #video_generation
Stars: 911 Issues: 7 Forks: 32
https://github.com/SandAI-org/MAGI-1
MAGI-1: Autoregressive Video Generation at Scale
Language: Python
#autoregressive #diffusion_models #video_generation
Stars: 911 Issues: 7 Forks: 32
https://github.com/SandAI-org/MAGI-1
GitHub
GitHub - SandAI-org/MAGI-1: MAGI-1: Autoregressive Video Generation at Scale
MAGI-1: Autoregressive Video Generation at Scale. Contribute to SandAI-org/MAGI-1 development by creating an account on GitHub.
👍1
Tencent/HunyuanCustom
HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation
Language: Python
#audio_driven #diffusion_models #image_to_video #image_to_video_generation #video_editing #video_generation
Stars: 360 Issues: 4 Forks: 14
https://github.com/Tencent/HunyuanCustom
HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation
Language: Python
#audio_driven #diffusion_models #image_to_video #image_to_video_generation #video_editing #video_generation
Stars: 360 Issues: 4 Forks: 14
https://github.com/Tencent/HunyuanCustom
GitHub
GitHub - Tencent/HunyuanCustom: HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation
HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation - Tencent/HunyuanCustom
❤1
Olow304/memvid
Video-based AI memory library. Store millions of text chunks in MP4 files with lightning-fast semantic search. No database needed.
Language: Python
#ai #context #embedded #faiss #knowledge_base #knowledge_graph #llm #machine_learning #memory #nlp #offline_first #opencv #python #rag #retrieval_augmented_generation #semantic_search #vector_database #video_processing
Stars: 252 Issues: 2 Forks: 25
https://github.com/Olow304/memvid
Video-based AI memory library. Store millions of text chunks in MP4 files with lightning-fast semantic search. No database needed.
Language: Python
#ai #context #embedded #faiss #knowledge_base #knowledge_graph #llm #machine_learning #memory #nlp #offline_first #opencv #python #rag #retrieval_augmented_generation #semantic_search #vector_database #video_processing
Stars: 252 Issues: 2 Forks: 25
https://github.com/Olow304/memvid
GitHub
GitHub - Olow304/memvid: Video-based AI memory library. Store millions of text chunks in MP4 files with lightning-fast semantic…
Video-based AI memory library. Store millions of text chunks in MP4 files with lightning-fast semantic search. No database needed. - Olow304/memvid
THUDM/GLM-4.1V-Thinking
GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning.
Language: Python
#image2text #reasoning #video_understanding #vlm
Stars: 449 Issues: 9 Forks: 8
https://github.com/THUDM/GLM-4.1V-Thinking
GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning.
Language: Python
#image2text #reasoning #video_understanding #vlm
Stars: 449 Issues: 9 Forks: 8
https://github.com/THUDM/GLM-4.1V-Thinking
GitHub
GitHub - THUDM/GLM-4.1V-Thinking: GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning.
GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning. - THUDM/GLM-4.1V-Thinking
❤1
liuff19/LangScene-X
[ICCV 2025] LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion
Language: Python
#3d_reconstruction #diffusion #unified_model #video_generation
Stars: 197 Issues: 1 Forks: 12
https://github.com/liuff19/LangScene-X
[ICCV 2025] LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion
Language: Python
#3d_reconstruction #diffusion #unified_model #video_generation
Stars: 197 Issues: 1 Forks: 12
https://github.com/liuff19/LangScene-X
GitHub
GitHub - liuff19/LangScene-X: [ICCV 2025] LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video…
[ICCV 2025] LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion - liuff19/LangScene-X
Wan-Video/Wan2.2
Wan: Open and Advanced Large-Scale Video Generative Models
Language: Python
#aigc #video_generation
Stars: 1285 Issues: 21 Forks: 26
https://github.com/Wan-Video/Wan2.2
Wan: Open and Advanced Large-Scale Video Generative Models
Language: Python
#aigc #video_generation
Stars: 1285 Issues: 21 Forks: 26
https://github.com/Wan-Video/Wan2.2
GitHub
GitHub - Wan-Video/Wan2.2: Wan: Open and Advanced Large-Scale Video Generative Models
Wan: Open and Advanced Large-Scale Video Generative Models - Wan-Video/Wan2.2
SkyworkAI/Matrix-3D
Generate large-scale explorable 3D scenes with high-quality panorama videos from a single image or text prompt.
Language: Python
#3d_generation #3d_reconstruction #3d_scene_generation #aigc #aigc3d #genie #genie3 #graphics #image_to_3d #image_to_video #panorama_synthesis #scene_generation #text_to_3d #text_to_video #video_generation #world_models
Stars: 284 Issues: 7 Forks: 14
https://github.com/SkyworkAI/Matrix-3D
Generate large-scale explorable 3D scenes with high-quality panorama videos from a single image or text prompt.
Language: Python
#3d_generation #3d_reconstruction #3d_scene_generation #aigc #aigc3d #genie #genie3 #graphics #image_to_3d #image_to_video #panorama_synthesis #scene_generation #text_to_3d #text_to_video #video_generation #world_models
Stars: 284 Issues: 7 Forks: 14
https://github.com/SkyworkAI/Matrix-3D
GitHub
GitHub - SkyworkAI/Matrix-3D: Generate large-scale explorable 3D scenes with high-quality panorama videos from a single image or…
Generate large-scale explorable 3D scenes with high-quality panorama videos from a single image or text prompt. - SkyworkAI/Matrix-3D