GitHub repos

umlx5h/LLPlayer
The media player for language learning, with dual subtitles, AI-generated subtitles, realtime-OCR, translation, word lookup, and more!
Language: C#
#asr #csharp #flyleaf #language_learning #media_player #ocr #player #tesseract #video #video_player #whisper #wpf #yt_dlp
Stars: 253 Issues: 5 Forks: 4
https://github.com/umlx5h/LLPlayer

GitHub

GitHub - umlx5h/LLPlayer: The media player for language learning, with dual subtitles, AI-generated subtitles, real-time translation…

The media player for language learning, with dual subtitles, AI-generated subtitles, real-time translation, and more! - umlx5h/LLPlayer

❤1👍1

1.85K views23:00

GitHub repos

FoundationVision/FlashVideo
FlashVideo:Flowing Fidelity to Detail for Efficient High-Resolution Video Generation
Language: Python
#efficient_generative_model #text_to_video #video_generation
Stars: 195 Issues: 5 Forks: 3
https://github.com/FoundationVision/FlashVideo

GitHub

GitHub - FoundationVision/FlashVideo: FlashVideo: Flowing Fidelity to Detail for Efficient High-Resolution Video Generation

FlashVideo: Flowing Fidelity to Detail for Efficient High-Resolution Video Generation - FoundationVision/FlashVideo

1.61K views17:00

GitHub repos

SkyworkAI/SkyReels-V1
SkyReels V1: the first and most advanced open-source human-centric video foundation model
Language: Python
#i2v #t2v #video_diffusion_transformers
Stars: 348 Issues: 5 Forks: 20
https://github.com/SkyworkAI/SkyReels-V1

GitHub

GitHub - SkyworkAI/SkyReels-V1: SkyReels V1: The first and most advanced open-source human-centric video foundation model

SkyReels V1: The first and most advanced open-source human-centric video foundation model - SkyworkAI/SkyReels-V1

1.69K views17:00

GitHub repos

liuff19/Video-T1
Official Implementation of Video-T1: Test-Time Scaling for Video Generation
Language: Python
#aigc #chain_of_thought #test_time_scaling #video #video_generation
Stars: 187 Issues: 2 Forks: 12
https://github.com/liuff19/Video-T1

GitHub

GitHub - liuff19/Video-T1: [ICCV 2025] Video-T1: Test-Time Scaling for Video Generation

[ICCV 2025] Video-T1: Test-Time Scaling for Video Generation - liuff19/Video-T1

👍1

1.71K views22:00

GitHub repos

TencentARC/GeometryCrafter
GeometryCrafter: Consistent Geometry Estimation for Open-world Videos with Diffusion Priors
Language: Python
#depth_estimation #video_to_4d
Stars: 173 Issues: 0 Forks: 3
https://github.com/TencentARC/GeometryCrafter

GitHub

GitHub - TencentARC/GeometryCrafter: [ICCV 2025] GeometryCrafter: Consistent Geometry Estimation for Open-world Videos with Diffusion…

[ICCV 2025] GeometryCrafter: Consistent Geometry Estimation for Open-world Videos with Diffusion Priors - TencentARC/GeometryCrafter

1.69K views04:00

GitHub repos

hanyang-21/VideoScene
[CVPR 2025] VideoScene: Distilling Video Diffusion Model to Generate 3D Scenes in One Step
Language: Python
#3d_reconstruction #video #video_generation
Stars: 154 Issues: 4 Forks: 3
https://github.com/hanyang-21/VideoScene

GitHub

GitHub - hanyang-21/VideoScene: [CVPR 2025 Highlight] VideoScene: Distilling Video Diffusion Model to Generate 3D Scenes in One…

[CVPR 2025 Highlight] VideoScene: Distilling Video Diffusion Model to Generate 3D Scenes in One Step - hanyang-21/VideoScene

❤2

1.66K views22:00

GitHub repos

ali-vilab/UniAnimate-DiT
UniAnimate-DiT: Human Image Animation with Large-Scale Video Diffusion Transformer
Language: Python
#human_image_animation #video_diffusion_transformers #video_generation
Stars: 225 Issues: 5 Forks: 17
https://github.com/ali-vilab/UniAnimate-DiT

GitHub

GitHub - ali-vilab/UniAnimate-DiT: UniAnimate-DiT: Human Image Animation with Large-Scale Video Diffusion Transformer

UniAnimate-DiT: Human Image Animation with Large-Scale Video Diffusion Transformer - ali-vilab/UniAnimate-DiT

❤1

1.58K views04:00

GitHub repos

SandAI-org/MAGI-1
MAGI-1: Autoregressive Video Generation at Scale
Language: Python
#autoregressive #diffusion_models #video_generation
Stars: 911 Issues: 7 Forks: 32
https://github.com/SandAI-org/MAGI-1

GitHub

GitHub - SandAI-org/MAGI-1: MAGI-1: Autoregressive Video Generation at Scale

MAGI-1: Autoregressive Video Generation at Scale. Contribute to SandAI-org/MAGI-1 development by creating an account on GitHub.

👍1

1.73K views16:00

GitHub repos

Tencent/HunyuanCustom
HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation
Language: Python
#audio_driven #diffusion_models #image_to_video #image_to_video_generation #video_editing #video_generation
Stars: 360 Issues: 4 Forks: 14
https://github.com/Tencent/HunyuanCustom

GitHub

GitHub - Tencent/HunyuanCustom: HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation

HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation - Tencent/HunyuanCustom

❤1

1.67K views16:00

GitHub repos

Olow304/memvid
Video-based AI memory library. Store millions of text chunks in MP4 files with lightning-fast semantic search. No database needed.
Language: Python
#ai #context #embedded #faiss #knowledge_base #knowledge_graph #llm #machine_learning #memory #nlp #offline_first #opencv #python #rag #retrieval_augmented_generation #semantic_search #vector_database #video_processing
Stars: 252 Issues: 2 Forks: 25
https://github.com/Olow304/memvid

GitHub

GitHub - Olow304/memvid: Video-based AI memory library. Store millions of text chunks in MP4 files with lightning-fast semantic…

Video-based AI memory library. Store millions of text chunks in MP4 files with lightning-fast semantic search. No database needed. - Olow304/memvid

1.56K views16:00

GitHub repos

THUDM/GLM-4.1V-Thinking
GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning.
Language: Python
#image2text #reasoning #video_understanding #vlm
Stars: 449 Issues: 9 Forks: 8
https://github.com/THUDM/GLM-4.1V-Thinking

GitHub

GitHub - THUDM/GLM-4.1V-Thinking: GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning.

GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning. - THUDM/GLM-4.1V-Thinking

❤1

1.57K views10:00

GitHub repos

liuff19/LangScene-X
[ICCV 2025] LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion
Language: Python
#3d_reconstruction #diffusion #unified_model #video_generation
Stars: 197 Issues: 1 Forks: 12
https://github.com/liuff19/LangScene-X

GitHub

GitHub - liuff19/LangScene-X: [ICCV 2025] LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video…

[ICCV 2025] LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion - liuff19/LangScene-X

1.48K views04:00

GitHub repos

Wan-Video/Wan2.2
Wan: Open and Advanced Large-Scale Video Generative Models
Language: Python
#aigc #video_generation
Stars: 1285 Issues: 21 Forks: 26
https://github.com/Wan-Video/Wan2.2

GitHub

GitHub - Wan-Video/Wan2.2: Wan: Open and Advanced Large-Scale Video Generative Models

Wan: Open and Advanced Large-Scale Video Generative Models - Wan-Video/Wan2.2

1.35K views10:00

GitHub repos

SkyworkAI/Matrix-3D
Generate large-scale explorable 3D scenes with high-quality panorama videos from a single image or text prompt.
Language: Python
#3d_generation #3d_reconstruction #3d_scene_generation #aigc #aigc3d #genie #genie3 #graphics #image_to_3d #image_to_video #panorama_synthesis #scene_generation #text_to_3d #text_to_video #video_generation #world_models
Stars: 284 Issues: 7 Forks: 14
https://github.com/SkyworkAI/Matrix-3D

GitHub

GitHub - SkyworkAI/Matrix-3D: Generate large-scale explorable 3D scenes with high-quality panorama videos from a single image or…

Generate large-scale explorable 3D scenes with high-quality panorama videos from a single image or text prompt. - SkyworkAI/Matrix-3D

1.13K views16:00

About

Blog

Apps

Platform