mihirp1998/AlignProp
AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more sample and compute efficient than reinforcement learning methods (PPO) for finetuning Stable Diffusion
Language: Python
#alignment #diffusion_models #reinforcement_learning #stable_diffusion #text_to_image
Stars: 104 Issues: 4 Forks: 1
https://github.com/mihirp1998/AlignProp
AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more sample and compute efficient than reinforcement learning methods (PPO) for finetuning Stable Diffusion
Language: Python
#alignment #diffusion_models #reinforcement_learning #stable_diffusion #text_to_image
Stars: 104 Issues: 4 Forks: 1
https://github.com/mihirp1998/AlignProp
GitHub
GitHub - mihirp1998/AlignProp: AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion…
AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more sample and compute efficient than reinforcement learning methods...
hustvl/GaussianDreamer
GaussianDreamer: Fast Generation from Text to 3D Gaussian Splatting with Point Cloud Priors
Language: Python
#aigc #computer_vision #diffusion_models #dreamfusion #gaussian_splatting #nerf #radiance_field #text_to_3d
Stars: 134 Issues: 0 Forks: 1
https://github.com/hustvl/GaussianDreamer
GaussianDreamer: Fast Generation from Text to 3D Gaussian Splatting with Point Cloud Priors
Language: Python
#aigc #computer_vision #diffusion_models #dreamfusion #gaussian_splatting #nerf #radiance_field #text_to_3d
Stars: 134 Issues: 0 Forks: 1
https://github.com/hustvl/GaussianDreamer
GitHub
GitHub - hustvl/GaussianDreamer: [CVPR 2024] GaussianDreamer: Fast Generation from Text to 3D Gaussians by Bridging 2D and 3D Diffusion…
[CVPR 2024] GaussianDreamer: Fast Generation from Text to 3D Gaussians by Bridging 2D and 3D Diffusion Models - hustvl/GaussianDreamer
netease-youdao/EmotiVoice
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
Language: Python
#ai #deep_learning #emotion #emotivoice #multi_speaker #prompt #python #pytorch #speech #speech_synthesis #style #text_to_speech #tts
Stars: 432 Issues: 3 Forks: 38
https://github.com/netease-youdao/EmotiVoice
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
Language: Python
#ai #deep_learning #emotion #emotivoice #multi_speaker #prompt #python #pytorch #speech #speech_synthesis #style #text_to_speech #tts
Stars: 432 Issues: 3 Forks: 38
https://github.com/netease-youdao/EmotiVoice
GitHub
GitHub - netease-youdao/EmotiVoice: EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine - netease-youdao/EmotiVoice
baaivision/GeoDream
GeoDream: Disentangling 2D and Geometric Priors for High-Fidelity and Consistent 3D Generation
Language: Python
#3d #3d_aigc #3d_generation #text_to_3d
Stars: 244 Issues: 1 Forks: 4
https://github.com/baaivision/GeoDream
GeoDream: Disentangling 2D and Geometric Priors for High-Fidelity and Consistent 3D Generation
Language: Python
#3d #3d_aigc #3d_generation #text_to_3d
Stars: 244 Issues: 1 Forks: 4
https://github.com/baaivision/GeoDream
GitHub
GitHub - baaivision/GeoDream: GeoDream: Disentangling 2D and Geometric Priors for High-Fidelity and Consistent 3D Generation
GeoDream: Disentangling 2D and Geometric Priors for High-Fidelity and Consistent 3D Generation - baaivision/GeoDream
TianxingWu/FreeInit
FreeInit: Bridging Initialization Gap in Video Diffusion Models
Language: Python
#aigc #text_to_video #video_diffusion_model #video_generation
Stars: 162 Issues: 4 Forks: 7
https://github.com/TianxingWu/FreeInit
FreeInit: Bridging Initialization Gap in Video Diffusion Models
Language: Python
#aigc #text_to_video #video_diffusion_model #video_generation
Stars: 162 Issues: 4 Forks: 7
https://github.com/TianxingWu/FreeInit
GitHub
GitHub - TianxingWu/FreeInit: [ECCV 2024] FreeInit: Bridging Initialization Gap in Video Diffusion Models
[ECCV 2024] FreeInit: Bridging Initialization Gap in Video Diffusion Models - TianxingWu/FreeInit
YangLing0818/RPG-DiffusionMaster
Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG)
Language: Python
#image_editing #large_language_models #multimodal_large_language_models #text_to_image_diffusion
Stars: 272 Issues: 5 Forks: 14
https://github.com/YangLing0818/RPG-DiffusionMaster
Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG)
Language: Python
#image_editing #large_language_models #multimodal_large_language_models #text_to_image_diffusion
Stars: 272 Issues: 5 Forks: 14
https://github.com/YangLing0818/RPG-DiffusionMaster
GitHub
GitHub - YangLing0818/RPG-DiffusionMaster: [ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating…
[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG) - YangLing0818/RPG-DiffusionMaster
reqable/re-editor
Re-Editor is a powerful lightweight text and code editor widget.
Language: Dart
#code_editor #flutter #syntax_highlighting #text_editor
Stars: 315 Issues: 0 Forks: 18
https://github.com/reqable/re-editor
Re-Editor is a powerful lightweight text and code editor widget.
Language: Dart
#code_editor #flutter #syntax_highlighting #text_editor
Stars: 315 Issues: 0 Forks: 18
https://github.com/reqable/re-editor
GitHub
GitHub - reqable/re-editor: Re-Editor is a powerful lightweight text and code editor widget.
Re-Editor is a powerful lightweight text and code editor widget. - reqable/re-editor
3DTopia/LGM
LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation.
Language: Python
#gaussian_splatting #image_to_3d #text_to_3d
Stars: 308 Issues: 7 Forks: 15
https://github.com/3DTopia/LGM
LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation.
Language: Python
#gaussian_splatting #image_to_3d #text_to_3d
Stars: 308 Issues: 7 Forks: 15
https://github.com/3DTopia/LGM
GitHub
GitHub - 3DTopia/LGM: [ECCV 2024 Oral] LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation.
[ECCV 2024 Oral] LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation. - 3DTopia/LGM
PKU-YuanGroup/MagicTime
MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators
Language: Python
#diffusion_models #long_video_generation #metamorphic_video_generation #open_sora_plan #text_to_video #time_lapse #time_lapse_dataset #video_generation
Stars: 281 Issues: 4 Forks: 16
https://github.com/PKU-YuanGroup/MagicTime
MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators
Language: Python
#diffusion_models #long_video_generation #metamorphic_video_generation #open_sora_plan #text_to_video #time_lapse #time_lapse_dataset #video_generation
Stars: 281 Issues: 4 Forks: 16
https://github.com/PKU-YuanGroup/MagicTime
GitHub
GitHub - PKU-YuanGroup/MagicTime: [TPAMI 2025🔥] MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators
[TPAMI 2025🔥] MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators - PKU-YuanGroup/MagicTime
AdityaNG/kan-gpt
The PyTorch implementation of Generative Pre-trained Transformers (GPTs) using Kolmogorov-Arnold Networks (KANs) for language modeling
Language: Python
#gpt #kanformers #kolmogorov_arnold_networks #kolmogorov_arnold_representation #llm #text_generation #transformers
Stars: 217 Issues: 2 Forks: 11
https://github.com/AdityaNG/kan-gpt
The PyTorch implementation of Generative Pre-trained Transformers (GPTs) using Kolmogorov-Arnold Networks (KANs) for language modeling
Language: Python
#gpt #kanformers #kolmogorov_arnold_networks #kolmogorov_arnold_representation #llm #text_generation #transformers
Stars: 217 Issues: 2 Forks: 11
https://github.com/AdityaNG/kan-gpt
GitHub
GitHub - AdityaNG/kan-gpt: The PyTorch implementation of Generative Pre-trained Transformers (GPTs) using Kolmogorov-Arnold Networks…
The PyTorch implementation of Generative Pre-trained Transformers (GPTs) using Kolmogorov-Arnold Networks (KANs) for language modeling - AdityaNG/kan-gpt
jishengpeng/WavTokenizer
SOTA discrete acoustic codec models with 40 tokens per second for audio language modeling
Language: Python
#acoustic #audio_representation #codec #dac #encodec #gpt4o #music_representation_learning #semantic #soundstream #speech_language_model #speech_representation #text_to_speech
Stars: 332 Issues: 6 Forks: 20
https://github.com/jishengpeng/WavTokenizer
SOTA discrete acoustic codec models with 40 tokens per second for audio language modeling
Language: Python
#acoustic #audio_representation #codec #dac #encodec #gpt4o #music_representation_learning #semantic #soundstream #speech_language_model #speech_representation #text_to_speech
Stars: 332 Issues: 6 Forks: 20
https://github.com/jishengpeng/WavTokenizer
GitHub
GitHub - jishengpeng/WavTokenizer: [ICLR 2025] SOTA discrete acoustic codec models with 40/75 tokens per second for audio language…
[ICLR 2025] SOTA discrete acoustic codec models with 40/75 tokens per second for audio language modeling - GitHub - jishengpeng/WavTokenizer: [ICLR 2025] SOTA discrete acoustic codec models with 4...
lucasnewman/f5-tts-mlx
Implementation of F5-TTS in MLX
Language: Python
#diffusion_transformer #flow_matching #mlx #text_to_speech #tts
Stars: 193 Issues: 2 Forks: 17
https://github.com/lucasnewman/f5-tts-mlx
Implementation of F5-TTS in MLX
Language: Python
#diffusion_transformer #flow_matching #mlx #text_to_speech #tts
Stars: 193 Issues: 2 Forks: 17
https://github.com/lucasnewman/f5-tts-mlx
GitHub
GitHub - lucasnewman/f5-tts-mlx: Implementation of F5-TTS in MLX
Implementation of F5-TTS in MLX. Contribute to lucasnewman/f5-tts-mlx development by creating an account on GitHub.
edwko/OuteTTS
Interface for OuteTTS models.
Language: Python
#gguf #llama #text_to_speech #transformers #tts
Stars: 278 Issues: 6 Forks: 13
https://github.com/edwko/OuteTTS
Interface for OuteTTS models.
Language: Python
#gguf #llama #text_to_speech #transformers #tts
Stars: 278 Issues: 6 Forks: 13
https://github.com/edwko/OuteTTS
GitHub
GitHub - edwko/OuteTTS: Interface for OuteTTS models.
Interface for OuteTTS models. Contribute to edwko/OuteTTS development by creating an account on GitHub.
Lightricks/LTX-Video
Official repository for LTX-Video
Language: Python
#diffusion_models #dit #image_to_video #image_to_video_generation #text_to_video #text_to_video_generation
Stars: 241 Issues: 8 Forks: 9
https://github.com/Lightricks/LTX-Video
Official repository for LTX-Video
Language: Python
#diffusion_models #dit #image_to_video #image_to_video_generation #text_to_video #text_to_video_generation
Stars: 241 Issues: 8 Forks: 9
https://github.com/Lightricks/LTX-Video
GitHub
GitHub - Lightricks/LTX-Video: Official repository for LTX-Video
Official repository for LTX-Video. Contribute to Lightricks/LTX-Video development by creating an account on GitHub.
Lightricks/ComfyUI-LTXVideo
LTX-Video Support for ComfyUI
Language: Python
#comfyui #diffusion_models #dit #image_to_video #image_to_video_generation #text_to_image #text_to_image_generation
Stars: 217 Issues: 18 Forks: 9
https://github.com/Lightricks/ComfyUI-LTXVideo
LTX-Video Support for ComfyUI
Language: Python
#comfyui #diffusion_models #dit #image_to_video #image_to_video_generation #text_to_image #text_to_image_generation
Stars: 217 Issues: 18 Forks: 9
https://github.com/Lightricks/ComfyUI-LTXVideo
GitHub
GitHub - Lightricks/ComfyUI-LTXVideo: LTX-Video Support for ComfyUI
LTX-Video Support for ComfyUI. Contribute to Lightricks/ComfyUI-LTXVideo development by creating an account on GitHub.
declare-lab/TangoFlux
TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching
Language: Jupyter Notebook
#flow_matching #generative_ai #text_to_audio #text_to_audio_ai #tta
Stars: 152 Issues: 2 Forks: 13
https://github.com/declare-lab/TangoFlux
TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching
Language: Jupyter Notebook
#flow_matching #generative_ai #text_to_audio #text_to_audio_ai #tta
Stars: 152 Issues: 2 Forks: 13
https://github.com/declare-lab/TangoFlux
GitHub
GitHub - declare-lab/TangoFlux: TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching
TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching - declare-lab/TangoFlux
FoundationVision/FlashVideo
FlashVideo:Flowing Fidelity to Detail for Efficient High-Resolution Video Generation
Language: Python
#efficient_generative_model #text_to_video #video_generation
Stars: 195 Issues: 5 Forks: 3
https://github.com/FoundationVision/FlashVideo
FlashVideo:Flowing Fidelity to Detail for Efficient High-Resolution Video Generation
Language: Python
#efficient_generative_model #text_to_video #video_generation
Stars: 195 Issues: 5 Forks: 3
https://github.com/FoundationVision/FlashVideo
GitHub
GitHub - FoundationVision/FlashVideo: FlashVideo: Flowing Fidelity to Detail for Efficient High-Resolution Video Generation
FlashVideo: Flowing Fidelity to Detail for Efficient High-Resolution Video Generation - FoundationVision/FlashVideo
isaiahbjork/orpheus-tts-local
Run Orpheus 3B Locally With LM Studio
Language: Python
#ai #python #text_to_speech #tts
Stars: 263 Issues: 15 Forks: 50
https://github.com/isaiahbjork/orpheus-tts-local
Run Orpheus 3B Locally With LM Studio
Language: Python
#ai #python #text_to_speech #tts
Stars: 263 Issues: 15 Forks: 50
https://github.com/isaiahbjork/orpheus-tts-local
GitHub
GitHub - isaiahbjork/orpheus-tts-local: Run Orpheus 3B Locally With LM Studio
Run Orpheus 3B Locally With LM Studio. Contribute to isaiahbjork/orpheus-tts-local development by creating an account on GitHub.
zobweyt/textcase
A feature-rich Python text case conversion library
Language: Python
#camel_case #case #constant_case #conversion #foss #just #kebab_case #lower_case #mypy #nix #pascal_case #pypi #pytest #python #ruff #sentence_case #snake_case #text #title_case #upper_case
Stars: 165 Issues: 2 Forks: 0
https://github.com/zobweyt/textcase
A feature-rich Python text case conversion library
Language: Python
#camel_case #case #constant_case #conversion #foss #just #kebab_case #lower_case #mypy #nix #pascal_case #pypi #pytest #python #ruff #sentence_case #snake_case #text #title_case #upper_case
Stars: 165 Issues: 2 Forks: 0
https://github.com/zobweyt/textcase
GitHub
GitHub - zobweyt/textcase: Python library for text case conversions
Python library for text case conversions. Contribute to zobweyt/textcase development by creating an account on GitHub.
mirth/chonky
Fully neural approach for text chunking
Language: Python
#ai #chunking #llms #ml #rag #semantic_chunking #text_splitter
Stars: 232 Issues: 0 Forks: 6
https://github.com/mirth/chonky
Fully neural approach for text chunking
Language: Python
#ai #chunking #llms #ml #rag #semantic_chunking #text_splitter
Stars: 232 Issues: 0 Forks: 6
https://github.com/mirth/chonky
GitHub
GitHub - mirth/chonky: Fully neural approach for text chunking
Fully neural approach for text chunking. Contribute to mirth/chonky development by creating an account on GitHub.