r/StableDiffusion

7 views21:40

LTX-2.3 PolarQuant Q5: 88% size reduction, near lossless quality (Cosine Similarity: 0.9986).
https://redd.it/1t7mhaw
@rStableDiffusion

7 views22:40

r/StableDiffusion

0:53

This media is not supported in your browser

VIEW IN TELEGRAM

Flux.2-Klein pipeline for real-time webcam stream processing in 30 FPS

https://redd.it/1t7nd7e
@rStableDiffusion

7 views23:40

r/StableDiffusion

3 years of training with AI tools finally put to use

I have learned so much from this community and I want to say thank you all who have contributed endlessly to this subreddit. Me and 2 other AI users teamed up to make children's music videos. Here are some of the clips that utilized WAN22. Not everything on the youtube channel is opensourced, so I won' t post the link here unless it's requested. These are all made with standard WAN22 FFLF workflow which I have tweaked over the years.

The one thing I realized along the way is that WAN can do some amazing things, it's all in the prompt. Such as block transition, crash zoom, pan, dolly, tilt, rotate. It can pretty much do it all.

Here is the workflow for the first video.

https://reddit.com/link/1t7nqgz/video/8dsi4qysuzzg1/player

https://reddit.com/link/1t7nqgz/video/01c16z8tuzzg1/player

https://reddit.com/link/1t7nqgz/video/0tz5363vuzzg1/player

https://reddit.com/link/1t7nqgz/video/n1guckfxuzzg1/player

https://reddit.com/link/1t7nqgz/video/plda65pxuzzg1/player

https://redd.it/1t7nqgz
@rStableDiffusion

Pastebin

{ "id": "f522eabf-4924-41b4-a7ce-ed9bcdcb53f4", "revision": 0, "last_no - Pastebin.com

Pastebin.com is the number one paste tool since 2002. Pastebin is a website where you can store text online for a set period of time.

8 views00:40

r/StableDiffusion

0:10

This media is not supported in your browser

VIEW IN TELEGRAM

another video from LTX-2.3 Distilled

https://redd.it/1t7kyn0
@rStableDiffusion

7 views01:40

r/StableDiffusion

LTX 2.3 Sulphur vs 10Eros

For those that have tried these models? Which one do you prefer and why? What strengths and weaknesses have you found with each model?

https://redd.it/1t7os5i
@rStableDiffusion

From the StableDiffusion community on Reddit

Explore this post and more from the StableDiffusion community

7 views03:40

r/StableDiffusion

Wan SCAIL Pose Control Workflow
https://redd.it/1t7p7pz
@rStableDiffusion

7 views04:40

r/StableDiffusion

HiDream-O1-Image - A pixel space model , no need for VAE, , 8B parameters.

https://redd.it/1t7v9fy
@rStableDiffusion

From the StableDiffusion community on Reddit: HiDream-O1-Image - A pixel space model , no need for VAE, , 8B parameters.

Explore this post and more from the StableDiffusion community

6 views05:40

r/StableDiffusion

IMG Dataset Refiner v4.0 Pro - The Ultimate Dataset Engineering Suite for LoRAs (Flux, SDXL, etc...)

https://redd.it/1t7ttp0
@rStableDiffusion

From the StableDiffusion community on Reddit: IMG Dataset Refiner v4.0 Pro - The Ultimate Dataset Engineering Suite for LoRAs (Flux…

Explore this post and more from the StableDiffusion community

8 views06:40

r/StableDiffusion

9 views06:40

r/StableDiffusion

Flux.2Klein Best open source image edit - work in progress

https://redd.it/1t7xue6
@rStableDiffusion

From the StableDiffusion community on Reddit: Flux.2Klein Best open source image edit - work in progress

Explore this post and more from the StableDiffusion community

7 views07:40

r/StableDiffusion

Why did we move away from booru tags?

I’m obviously wrong for this opinion but I believe booru tags are a far better descriptor of visual medium than natural language. Simply listing the contents in an image is far more clearer than “the light dramatically plays against blah blah” which I think is just subjective abstruseness.

Most new models now are using massive text encoders which is excellent for understanding, but there are too many ways to naturally describe an image.

Same for video, we could have time stamped tags describing scenes in a comma separated booru style method. Removes ambiguity.

Can anyone tell me why the open source community chose natural language over booru style?

https://redd.it/1t8150y
@rStableDiffusion

From the StableDiffusion community on Reddit

Explore this post and more from the StableDiffusion community

6 views11:40

About

Blog

Apps

Platform