r/StableDiffusion

6 views04:40

encoder-only version of T5-XL

Kinda old tech by now, but figure it still deserves an announcement...

I just made an "encoder-only" slimmed down version of the T5-XL text encoder model.

Use with

from transformers import T5EncoderModel

encoder = T5EncoderModel.from_pretrained("opendiffusionai/t5-v1_1-xl-encoder-only")

I had previously found that a version of T5-XXL is available in encoder-only form. But surprisingly, not T5-XL.

This may be important to some folks doing their own models, because while T5-XXL outputs Size(4096) embeddings, T5-XL outputs Size(2048) embeddings.

And unlike many other models... T5 has an apache2.0 license.

Fair warning: The T5-XL encoder itself is also smaller. 4B params vs 11B or something like that. But if you want it.. it is now available as above.

https://redd.it/1lbquj7
@rStableDiffusion

From the StableDiffusion community on Reddit

Explore this post and more from the StableDiffusion community

7 views05:40

r/StableDiffusion

0:06

This media is not supported in your browser

VIEW IN TELEGRAM

I think this is as good as my Lofi is gonna get. Any tips?

https://redd.it/1lbuc58
@rStableDiffusion

9 views07:40

WANS

https://redd.it/1lbu65e
@rStableDiffusion

8 views08:40

r/StableDiffusion

Chroma V37 is out (+ detail calibrated)
https://redd.it/1lbvooi
@rStableDiffusion

8 views09:40

r/StableDiffusion

Best Open Source Model for text to video generation?

Hey. When I looked it up, the last time this question was asked on the subreddit was 2 months ago. Since the space is fast moving, I thought it's appropriate to ask again.

What is the best open source text to video model currently? The opinion from the last post on this subject was that it's WAN 2.1. What do you think?

https://redd.it/1lbw9e2
@rStableDiffusion

From the StableDiffusion community on Reddit

Explore this post and more from the StableDiffusion community

7 views10:40

r/StableDiffusion

Where is FLUX.1 Kontextdev?

Did I miss the "open" weights version or did they forget to release it? I understand we are not entitled to anything and they can just not release at all if they don't want, that's fine by me. But when you announce it is coming "soon" and 2 weeks later there is no model, I feel the community is being used to hype closed models for free.

And no, being able to use an API through a node/app is not local. It is online generation with hidden/extra steps.

https://redd.it/1lbxjrr
@rStableDiffusion

From the StableDiffusion community on Reddit

Explore this post and more from the StableDiffusion community

6 views11:40

r/StableDiffusion

laws against manipulated images… in 1912

https://www.freethink.com/the-digital-frontier/fake-photo-ban-1912

tl;dr

as far back as 1912 there have been issues with photo manipulation, celebrity takes, etc.

the interesting thing is that it was a major problem even then… and had a law proposed… but did not pass it.

(fyi i found out about this article via a daily free news letter/email. 1440 is a great resource.

https://link.join1440.com/click/40294249.2749544/aHR0cHM6Ly9qb2luMTQ0MC5jb20vdG9waWNzL2RlZXBmYWtlcy9yL2FtZXJpY2EtdHJpZWQtdG8tYmFuLWZha2UtcGhvdG9zLWluLTE5MTI_dXRtX3NvdXJjZT0xNDQwLXN1biZ1dG1fbWVkaXVtPWVtYWlsJnV0bV9jYW1wYWlnbj12aWV3LWNvbnRlbnQtcHImdXNlcl9pZD02NmM0YzZlODYwMGFlMTUwNzVhMmIzMjM/66c4c6e8600ae15075a2b323B5ed6a86d)

https://redd.it/1lbz7gi
@rStableDiffusion

Freethink

America tried to ban fake photos in 1912

The nation has been wrestling with manipulated images since long before AI.

8 views12:40

r/StableDiffusion

Hunyuan Avatar in ComfyUI | Turn Any Image into a Talking AI Character
https://youtu.be/Z8frZpoUYsM

https://redd.it/1lbzkpi
@rStableDiffusion

YouTube

Hunyuan Avatar in ComfyUI | Turn Any Image into a Talking AI Character

Hunyuan Avatar in ComfyUI | Turn Any Image into a Talking AI Character

Running Hub - Open the link: https://www.runninghub.ai/?inviteCode=2ros5nl9
Register and get 1000 RH coins, free!

Workflow- http://www.aiverseblog.site/2025/06/hunyuan-avatar-in-comfyui…

7 views13:40

r/StableDiffusion

Wan 2.1 lora's working with Self Forcing DMT would be something incredible

I have been absolutely losing sleep the last day playing with Sef Forcing DMT. This thing is beyond amazing and major respect to the creator. I quickly gave up trying to figure out how to use Lora's. I am hoping(and praying) somebody here on Reddit is trying to figure out how to do this. I am not sure which Wan forcing is trained on (I'm guessing 1.3b) If anybody up here has the scoop on this being a possibility soon, or I just missed the boat on it already being possible. Please spill the beans.

https://redd.it/1lc0pab
@rStableDiffusion

From the StableDiffusion community on Reddit

Explore this post and more from the StableDiffusion community

5 views15:40

r/StableDiffusion

any interest in a comfyui for dummies? (web/mobile app)

hey everyone! I am tinkering on GiraffeDesigner. tldr is "comfyui for dummies" that works pretty well on web and mobile.

Gemini is free to use, for openai and fal.ai you can just insert your API key.

Curious from the community if this is interesting? What features would you like to see? I plan to keep the core product free, any feedback appreciated :)

https://redd.it/1lc5qbf
@rStableDiffusion

GiraffeDesigner

GiraffeDesigner - AI-Powered Visual Workflow Builder

Create stunning ads and marketing materials with our visual workflow builder. Drag, drop, and automate your creative process with AI-powered design tools.

6 views17:40

r/StableDiffusion

Be as if in your own home, wayfarer; I shall deny you nothing.

https://redd.it/1lc5193
@rStableDiffusion

From the StableDiffusion community on Reddit: Be as if in your own home, wayfarer; I shall deny you nothing.

Explore this post and more from the StableDiffusion community

8 views18:40

r/StableDiffusion

6 views18:40

r/StableDiffusion

MIGRATING CHROMA TO MLX
https://redd.it/1lc6xi4
@rStableDiffusion

6 views19:40

r/StableDiffusion

SD 3.5 is apparently fast now, good for SFW images?

With the recent announcements about SD 3.5 on new Nvidia cards getting a speed boost and memory requirement decrease, is it worth looking into for SFW gens? I know this community was down on it, but is there any upside with the faster / bigger models being more accessible?

https://redd.it/1lcaj3w
@rStableDiffusion

From the StableDiffusion community on Reddit

Explore this post and more from the StableDiffusion community

6 views21:40

Vace FusionX + background img + reference img + controlnet + 20 x (video extension with Vace FusionX + reference img). Just to see what would happen...

https://redd.it/1lccl41
@rStableDiffusion

6 views22:40

About

Blog

Apps

Platform