This media is not supported in your browser
VIEW IN TELEGRAM
Control FLUX.2 with reference images instead of training a LoRA — demo
https://redd.it/1tjqssg
@rStableDiffusion
https://redd.it/1tjqssg
@rStableDiffusion
What happened to Hunyuan?
Hello!
I really liked the hunyuan model, did they go closed sources with further developments?
Any news about that? I think ltx is okay, but the visual quality of hunyuan sometimes even exceeded wan2.2, imo.
Best
https://redd.it/1tjvuvq
@rStableDiffusion
Hello!
I really liked the hunyuan model, did they go closed sources with further developments?
Any news about that? I think ltx is okay, but the visual quality of hunyuan sometimes even exceeded wan2.2, imo.
Best
https://redd.it/1tjvuvq
@rStableDiffusion
Reddit
From the StableDiffusion community on Reddit
Explore this post and more from the StableDiffusion community
As someone who can already run most of the larger models (RTX 5090) I'm extremely glad I gave Anima Base a chance
I'll be honest. I didn't expect much from a 2B parameter model. I had initially written it off as being not worth the time simply because I had access to such powerful models with much higher parameter counts. I didn't see how it could possibly outdo what I already had. But wow, they really did one hell of a job on this, and I find that it produces better anime images (with easier prompting) than most of what's out there.
It doesn't suffer from a lot of the NLP problems where you get near identical outputs each time. It reminds me more of the SDXL / Pony era where you could give a general idea of what you wanted with tags (or yes NLP as well) and the model itself would find a way to make it interesting. This is one of those models where you don't even need an LLM to rewrite your prompts. Just give it a general direction and let it go.
The fact that it can understand NLP means it has a lot of the strengths of the older models without the weakness of getting shit confused. Like a blue hat and a red hat and 2 orange hats.
https://redd.it/1tjymfl
@rStableDiffusion
I'll be honest. I didn't expect much from a 2B parameter model. I had initially written it off as being not worth the time simply because I had access to such powerful models with much higher parameter counts. I didn't see how it could possibly outdo what I already had. But wow, they really did one hell of a job on this, and I find that it produces better anime images (with easier prompting) than most of what's out there.
It doesn't suffer from a lot of the NLP problems where you get near identical outputs each time. It reminds me more of the SDXL / Pony era where you could give a general idea of what you wanted with tags (or yes NLP as well) and the model itself would find a way to make it interesting. This is one of those models where you don't even need an LLM to rewrite your prompts. Just give it a general direction and let it go.
The fact that it can understand NLP means it has a lot of the strengths of the older models without the weakness of getting shit confused. Like a blue hat and a red hat and 2 orange hats.
https://redd.it/1tjymfl
@rStableDiffusion
Reddit
From the StableDiffusion community on Reddit
Explore this post and more from the StableDiffusion community
LTX 2.3 growing frustration
I have been defending LTX and had moved away from Wan 2.2 since LTX 2.3 came out. Now that I am trying to create a short narrative film I'm getting very frustrated with ltx's inability to follow prompt directions. For example shot of two estimate next to each other and all I want is for the camera to zoom in on one of the men as he talks. LTX keeps giving me a pullout or zoom out instead of a zoom in. Mo matter how I prompt for it it just won't do it. Should something so simple like that shot be so difficult to achieve. And I have used different workflows for example the new LTX director that has the prompt relay embedded.
Anyone else gets frustrated with this model.
https://redd.it/1tjtdi5
@rStableDiffusion
I have been defending LTX and had moved away from Wan 2.2 since LTX 2.3 came out. Now that I am trying to create a short narrative film I'm getting very frustrated with ltx's inability to follow prompt directions. For example shot of two estimate next to each other and all I want is for the camera to zoom in on one of the men as he talks. LTX keeps giving me a pullout or zoom out instead of a zoom in. Mo matter how I prompt for it it just won't do it. Should something so simple like that shot be so difficult to achieve. And I have used different workflows for example the new LTX director that has the prompt relay embedded.
Anyone else gets frustrated with this model.
https://redd.it/1tjtdi5
@rStableDiffusion
Reddit
From the StableDiffusion community on Reddit
Explore this post and more from the StableDiffusion community