Danbooru Dataset Filter: Fast local metadata-based search across 10M+ images for LoRA/Checkpoint training
https://redd.it/1sl8cqi
@rStableDiffusion
https://redd.it/1sl8cqi
@rStableDiffusion
"Necromancy" Short AI Animation (Wan 2.2 Text2video)
https://youtu.be/zsjtLSh0xVQ
https://redd.it/1slduvn
@rStableDiffusion
https://youtu.be/zsjtLSh0xVQ
https://redd.it/1slduvn
@rStableDiffusion
YouTube
183 | "Necromancy" | Local AI Animation (Wan 2.2 Text2video) [4K]
"Necromancy" Local AI Animation
Dive into a chaotic battle scene with this short AI Animation, depicting a desperate struggle for survival in a war-torn environment. Watch as characters, including skeletons, engage in intense combat with various weapons…
Dive into a chaotic battle scene with this short AI Animation, depicting a desperate struggle for survival in a war-torn environment. Watch as characters, including skeletons, engage in intense combat with various weapons…
We may have a new SOTA open-source model: ERNIE-Image Comparisons
https://redd.it/1slg4wh
@rStableDiffusion
https://redd.it/1slg4wh
@rStableDiffusion
Reddit
From the StableDiffusion community on Reddit: We may have a new SOTA open-source model: ERNIE-Image Comparisons
Explore this post and more from the StableDiffusion community
Ostris AI Toolkit has day zero support for training LoRAs on top of Baidu's ERNIE-Image
https://redd.it/1slivar
@rStableDiffusion
https://redd.it/1slivar
@rStableDiffusion
Media is too big
VIEW IN TELEGRAM
Tencent HY-World 2.0 appears to be dropping on April 15 — open-source multimodal 3D world generation from Tencent Hunyuan
https://redd.it/1sll638
@rStableDiffusion
https://redd.it/1sll638
@rStableDiffusion
New LTX model soon
https:\/\/x.com\/ltx\_model\/status\/2044110661488132371
link to their new paper too: https://doi.org/10.48550/arXiv.2604.11788
https://redd.it/1slh5rq
@rStableDiffusion
https:\/\/x.com\/ltx\_model\/status\/2044110661488132371
link to their new paper too: https://doi.org/10.48550/arXiv.2604.11788
https://redd.it/1slh5rq
@rStableDiffusion
Nucleus-Image Released
https://huggingface.co/NucleusAI/Nucleus-Image
Nucleus-Image is a text-to-image generation model built on a sparse mixture-of-experts (MoE) diffusion transformer architecture. It scales to 17B total parameters across 64 routed experts per layer while activating only \~2B parameters per forward pass, establishing a new Pareto frontier in quality-versus-efficiency. Nucleus-Image matches or exceeds leading models including Qwen-Image, GPT Image 1, Seedream 3.0, and Imagen4 on GenEval, DPG-Bench, and OneIG-Bench. This is a base model released without any post-training optimization (no DPO, no reinforcement learning, no human preference tuning). All reported results reflect pre-training performance only. We release the full model weights, training code, and dataset, making Nucleus-Image the first fully open-source MoE diffusion model at this quality tier.
https://redd.it/1slpfch
@rStableDiffusion
https://huggingface.co/NucleusAI/Nucleus-Image
Nucleus-Image is a text-to-image generation model built on a sparse mixture-of-experts (MoE) diffusion transformer architecture. It scales to 17B total parameters across 64 routed experts per layer while activating only \~2B parameters per forward pass, establishing a new Pareto frontier in quality-versus-efficiency. Nucleus-Image matches or exceeds leading models including Qwen-Image, GPT Image 1, Seedream 3.0, and Imagen4 on GenEval, DPG-Bench, and OneIG-Bench. This is a base model released without any post-training optimization (no DPO, no reinforcement learning, no human preference tuning). All reported results reflect pre-training performance only. We release the full model weights, training code, and dataset, making Nucleus-Image the first fully open-source MoE diffusion model at this quality tier.
https://redd.it/1slpfch
@rStableDiffusion
Reddit
From the StableDiffusion community on Reddit: Nucleus-Image Released
Explore this post and more from the StableDiffusion community