XavierXiao/Dreambooth-Stable-Diffusion
Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion
Language: Jupyter Notebook
#pytorch #pytorch_lightning #stable_diffusion #text_to_image
Stars: 141 Issues: 4 Forks: 9
https://github.com/XavierXiao/Dreambooth-Stable-Diffusion
Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion
Language: Jupyter Notebook
#pytorch #pytorch_lightning #stable_diffusion #text_to_image
Stars: 141 Issues: 4 Forks: 9
https://github.com/XavierXiao/Dreambooth-Stable-Diffusion
arXiv.org
DreamBooth: Fine Tuning Text-to-Image Diffusion Models for...
Large text-to-image models achieved a remarkable leap in the evolution of AI, enabling high-quality and diverse synthesis of images from a given text prompt. However, these models lack the ability...
YangLing0818/Diffusion-Models-Papers-Survey-Taxonomy
Diffusion model papers, survey, and taxonomy
#diffusion_models #generative_adversarial_network #generative_model #image_to_image_translation #stable_diffusion #survey #text_to_image #vae
Stars: 190 Issues: 0 Forks: 14
https://github.com/YangLing0818/Diffusion-Models-Papers-Survey-Taxonomy
Diffusion model papers, survey, and taxonomy
#diffusion_models #generative_adversarial_network #generative_model #image_to_image_translation #stable_diffusion #survey #text_to_image #vae
Stars: 190 Issues: 0 Forks: 14
https://github.com/YangLing0818/Diffusion-Models-Papers-Survey-Taxonomy
GitHub
GitHub - YangLing0818/Diffusion-Models-Papers-Survey-Taxonomy: Diffusion model papers, survey, and taxonomy
Diffusion model papers, survey, and taxonomy. Contribute to YangLing0818/Diffusion-Models-Papers-Survey-Taxonomy development by creating an account on GitHub.
jaketae/storyteller
Multimodal AI Story Teller, built with Stable Diffusion, GPT, and neural text-to-speech
Language: Python
#gpt #image_generation #pytorch #stable_diffusion #text_to_image #text_to_speech #text_to_video #video_generation
Stars: 119 Issues: 1 Forks: 6
https://github.com/jaketae/storyteller
Multimodal AI Story Teller, built with Stable Diffusion, GPT, and neural text-to-speech
Language: Python
#gpt #image_generation #pytorch #stable_diffusion #text_to_image #text_to_speech #text_to_video #video_generation
Stars: 119 Issues: 1 Forks: 6
https://github.com/jaketae/storyteller
GitHub
GitHub - jaketae/storyteller: Multimodal AI Story Teller, built with Stable Diffusion, GPT, and neural text-to-speech
Multimodal AI Story Teller, built with Stable Diffusion, GPT, and neural text-to-speech - jaketae/storyteller
lucidrains/muse-maskgit-pytorch
Implementation of Muse: Text-to-Image Generation via Masked Generative Transformers, in Pytorch
Language: Python
#artificial_intelligence #attention_mechanisms #deep_learning #text_to_image #transformers
Stars: 119 Issues: 1 Forks: 6
https://github.com/lucidrains/muse-maskgit-pytorch
Implementation of Muse: Text-to-Image Generation via Masked Generative Transformers, in Pytorch
Language: Python
#artificial_intelligence #attention_mechanisms #deep_learning #text_to_image #transformers
Stars: 119 Issues: 1 Forks: 6
https://github.com/lucidrains/muse-maskgit-pytorch
GitHub
GitHub - lucidrains/muse-maskgit-pytorch: Implementation of Muse: Text-to-Image Generation via Masked Generative Transformers,…
Implementation of Muse: Text-to-Image Generation via Masked Generative Transformers, in Pytorch - lucidrains/muse-maskgit-pytorch
lukasHoel/text2room
Text2Room generates textured 3D meshes from a given text prompt using 2D text-to-image models.
Language: Python
#3d_generation #diffusion_models #mesh_generation #text_to_image
Stars: 426 Issues: 1 Forks: 22
https://github.com/lukasHoel/text2room
Text2Room generates textured 3D meshes from a given text prompt using 2D text-to-image models.
Language: Python
#3d_generation #diffusion_models #mesh_generation #text_to_image
Stars: 426 Issues: 1 Forks: 22
https://github.com/lukasHoel/text2room
GitHub
GitHub - lukasHoel/text2room: Text2Room generates textured 3D meshes from a given text prompt using 2D text-to-image models (ICCV2023).
Text2Room generates textured 3D meshes from a given text prompt using 2D text-to-image models (ICCV2023). - lukasHoel/text2room
omerbt/TokenFlow
Official Pytorch Implementation for "TokenFlow: Consistent Diffusion Features for Consistent Video Editing" presenting "TokenFlow"
#stable_diffusion #text_to_image #text_to_video #tokenflow #video_editing
Stars: 310 Issues: 4 Forks: 13
https://github.com/omerbt/TokenFlow
Official Pytorch Implementation for "TokenFlow: Consistent Diffusion Features for Consistent Video Editing" presenting "TokenFlow"
#stable_diffusion #text_to_image #text_to_video #tokenflow #video_editing
Stars: 310 Issues: 4 Forks: 13
https://github.com/omerbt/TokenFlow
GitHub
GitHub - omerbt/TokenFlow: Official Pytorch Implementation for "TokenFlow: Consistent Diffusion Features for Consistent Video Editing"…
Official Pytorch Implementation for "TokenFlow: Consistent Diffusion Features for Consistent Video Editing" presenting "TokenFlow" (ICLR 2024) - omerbt/TokenFlow
google/break-a-scene
Official implementation for "Break-A-Scene: Extracting Multiple Concepts from a Single Image" [SIGGRAPH Asia 2023]
Language: Python
#deep_learning #diffusion_models #generative_ai #multimodal #text_to_image
Stars: 164 Issues: 1 Forks: 4
https://github.com/google/break-a-scene
Official implementation for "Break-A-Scene: Extracting Multiple Concepts from a Single Image" [SIGGRAPH Asia 2023]
Language: Python
#deep_learning #diffusion_models #generative_ai #multimodal #text_to_image
Stars: 164 Issues: 1 Forks: 4
https://github.com/google/break-a-scene
GitHub
GitHub - google/break-a-scene: Official implementation for "Break-A-Scene: Extracting Multiple Concepts from a Single Image" [SIGGRAPH…
Official implementation for "Break-A-Scene: Extracting Multiple Concepts from a Single Image" [SIGGRAPH Asia 2023] - google/break-a-scene
mihirp1998/AlignProp
AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more sample and compute efficient than reinforcement learning methods (PPO) for finetuning Stable Diffusion
Language: Python
#alignment #diffusion_models #reinforcement_learning #stable_diffusion #text_to_image
Stars: 104 Issues: 4 Forks: 1
https://github.com/mihirp1998/AlignProp
AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more sample and compute efficient than reinforcement learning methods (PPO) for finetuning Stable Diffusion
Language: Python
#alignment #diffusion_models #reinforcement_learning #stable_diffusion #text_to_image
Stars: 104 Issues: 4 Forks: 1
https://github.com/mihirp1998/AlignProp
GitHub
GitHub - mihirp1998/AlignProp: AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion…
AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more sample and compute efficient than reinforcement learning methods...
YangLing0818/RPG-DiffusionMaster
Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG)
Language: Python
#image_editing #large_language_models #multimodal_large_language_models #text_to_image_diffusion
Stars: 272 Issues: 5 Forks: 14
https://github.com/YangLing0818/RPG-DiffusionMaster
Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG)
Language: Python
#image_editing #large_language_models #multimodal_large_language_models #text_to_image_diffusion
Stars: 272 Issues: 5 Forks: 14
https://github.com/YangLing0818/RPG-DiffusionMaster
GitHub
GitHub - YangLing0818/RPG-DiffusionMaster: [ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating…
[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (RPG) - YangLing0818/RPG-DiffusionMaster
Lightricks/ComfyUI-LTXVideo
LTX-Video Support for ComfyUI
Language: Python
#comfyui #diffusion_models #dit #image_to_video #image_to_video_generation #text_to_image #text_to_image_generation
Stars: 217 Issues: 18 Forks: 9
https://github.com/Lightricks/ComfyUI-LTXVideo
LTX-Video Support for ComfyUI
Language: Python
#comfyui #diffusion_models #dit #image_to_video #image_to_video_generation #text_to_image #text_to_image_generation
Stars: 217 Issues: 18 Forks: 9
https://github.com/Lightricks/ComfyUI-LTXVideo
GitHub
GitHub - Lightricks/ComfyUI-LTXVideo: LTX-Video Support for ComfyUI
LTX-Video Support for ComfyUI. Contribute to Lightricks/ComfyUI-LTXVideo development by creating an account on GitHub.