r/StableDiffusion

20 views17:40

r/StableDiffusion

This media is not supported in your browser

VIEW IN TELEGRAM

Control FLUX.2 with reference images instead of training a LoRA — demo
https://redd.it/1tjqssg
@rStableDiffusion

20 views18:40

r/StableDiffusion

0:36

This media is not supported in your browser

VIEW IN TELEGRAM

SAM3 added to Comfyui-Angelo (sampler/inpainter/refiner)

https://redd.it/1tjp4ir
@rStableDiffusion

8 views19:40

r/StableDiffusion

0:00

This media is not supported in your browser

VIEW IN TELEGRAM

decided to actually make stable diffusion

https://redd.it/1tjsv9s
@rStableDiffusion

8 views20:40

r/StableDiffusion

What happened to Hunyuan?

Hello!

I really liked the hunyuan model, did they go closed sources with further developments?
Any news about that? I think ltx is okay, but the visual quality of hunyuan sometimes even exceeded wan2.2, imo.

Best

https://redd.it/1tjvuvq
@rStableDiffusion

From the StableDiffusion community on Reddit

Explore this post and more from the StableDiffusion community

8 views21:40

r/StableDiffusion

As someone who can already run most of the larger models (RTX 5090) I'm extremely glad I gave Anima Base a chance

I'll be honest. I didn't expect much from a 2B parameter model. I had initially written it off as being not worth the time simply because I had access to such powerful models with much higher parameter counts. I didn't see how it could possibly outdo what I already had. But wow, they really did one hell of a job on this, and I find that it produces better anime images (with easier prompting) than most of what's out there.

It doesn't suffer from a lot of the NLP problems where you get near identical outputs each time. It reminds me more of the SDXL / Pony era where you could give a general idea of what you wanted with tags (or yes NLP as well) and the model itself would find a way to make it interesting. This is one of those models where you don't even need an LLM to rewrite your prompts. Just give it a general direction and let it go.

The fact that it can understand NLP means it has a lot of the strengths of the older models without the weakness of getting shit confused. Like a blue hat and a red hat and 2 orange hats.

https://redd.it/1tjymfl
@rStableDiffusion

From the StableDiffusion community on Reddit

Explore this post and more from the StableDiffusion community

8 views22:40

r/StableDiffusion

LTX 2.3 growing frustration

I have been defending LTX and had moved away from Wan 2.2 since LTX 2.3 came out. Now that I am trying to create a short narrative film I'm getting very frustrated with ltx's inability to follow prompt directions. For example shot of two estimate next to each other and all I want is for the camera to zoom in on one of the men as he talks. LTX keeps giving me a pullout or zoom out instead of a zoom in. Mo matter how I prompt for it it just won't do it. Should something so simple like that shot be so difficult to achieve. And I have used different workflows for example the new LTX director that has the prompt relay embedded.

Anyone else gets frustrated with this model.

https://redd.it/1tjtdi5
@rStableDiffusion

From the StableDiffusion community on Reddit

Explore this post and more from the StableDiffusion community

8 views23:40

Update from comfy-flow.com, I made a plugin for comfyui

https://redd.it/1tk4kod
@rStableDiffusion

8 views02:40

r/StableDiffusion

Microsoft Lens seems to be back.
https://huggingface.co/microsoft/Lens-Turbo

https://redd.it/1tkajke
@rStableDiffusion

huggingface.co