r/StableDiffusion

Are there any simple paths to local image generation on Linux?

I've had no luck so far. To note, I have some general familiarity with the command line.

That said, I've tried ComfyUI, Foooocus, SwarmUI...I've had no luck getting any of those to even successfully install. Missing dependency that, can't find that, can't install that. All these wgets and git clones and 'throw it in python's seem to end badly for me.

I have managed to download and launch Invoke AI successfully. But I haven't had any luck generating an actual image: I got word of ROCm issues from the error messages, and it seems Fedora messes with that. Trying to fix that up still got me nowhere.

\--------

Is there anything a bit simpler to use, just to get started? I run LM Studio on this computer just fine, and as it stands I'm hoping they'll one day branch out into image / video gen. I don't care if it can barely do a smiley face, I just want it to be local, and FOSS.

Bonus Info:
GPU | Radeon 7600
CPU | Ryzen 5 7600
RAM | 16GB DDR5
OS | Fedora 43, Plasma 6.6

If you have ideas, let me know. Thank you for your time.

https://redd.it/1shagk4
@rStableDiffusion

From the StableDiffusion community on Reddit

Explore this post and more from the StableDiffusion community

6 views03:40

r/StableDiffusion

Bad news on Happy Horse from twitter
https://redd.it/1shcgox
@rStableDiffusion

7 views04:40

r/StableDiffusion

Which video model learns face likeness best when training LoRA?

Hey, I’m trying to train LoRAs for real human likeness and was wondering which video model currently does the best job at learning and preserving identity.

I’ve tried a bit with LTX and Wan, but still not sure which one is actually better for likeness. Would love to hear what people are getting the best results with right now

https://redd.it/1shbfra
@rStableDiffusion

From the StableDiffusion community on Reddit

Explore this post and more from the StableDiffusion community

6 views06:40

r/StableDiffusion

ACE-Step 1.5 XL Base — BF16 version (converted from FP32)

I converted the ACE-Step 1.5 XL Base model from FP32 to BF16. The original weights were \~18.8 GB in FP32, this version is \~7.5 GB — same quality, lower VRAM usage.

The Base model is the go-to starting point for fine-tuning (LoRA, etc.) — if you want to train your own style, this is the one to use. A great tool for that is Side Step.

🤗 https://huggingface.co/marcorez8/acestep-v15-xl-base-bf16

I also converted the XL Turbo variant yesterday: Reddit post | Model

https://redd.it/1shfihr
@rStableDiffusion

GitHub

GitHub - koda-dernet/Side-Step: The most powerful training scripts for ACE-Step 1.5 including a Command Line Interface, a Terminal…

The most powerful training scripts for ACE-Step 1.5 including a Command Line Interface, a Terminal Wizard and a Graphical User Interface. - koda-dernet/Side-Step

5 views07:40

r/StableDiffusion

HappyHorse is from Alibaba ATH, not Grok / Veo 3.2 / Wan 2.7 / Seedance 2

I finally found what looks like the official clarification.

According to the verified HappyHorse twitter account, HappyHorse is a product currently in internal testing under Alibaba's ATH innovation division. It also says the product is not officially launched yet, and that the so-called "official websites" circulating online are fake.

https://preview.redd.it/s0yc372pjbug1.png?width=760&format=png&auto=webp&s=77cb530ff67fbb68537c0a7417fa782b88c3981a

https://preview.redd.it/zlpry4m0jbug1.png?width=1337&format=png&auto=webp&s=4756801907a9adcbcad4dc8c3c859615fcc6a208

https://redd.it/1shfzip
@rStableDiffusion

4 views08:40

r/StableDiffusion

VoxCPM TTS model + LoRa training abilities right in Comfy
https://redd.it/1shfwcg
@rStableDiffusion

6 views09:40

r/StableDiffusion

Happy Horse deceiving practices

Kinda lame that Happy Horse was pushed as open weights early on, got people interested, and now it’s apparently becoming closed-source API only, they knew what they were doing.

Way less people are interested in closed video models but make a promise it’s open weights and you get way more traction… then have it closed.

A paid, censored, all you data stolen, closed video model is way less useful for a lot of us. The whole appeal was being able to run it ourselves, experiment freely, fine-tune, make loras, and build on top of it without being stuck behind someone else’s rules and pricing.

Feels like they used the open-weights angle to build hype and traction, then pulled the ladder up and i relly believe that. Also saying that the sources stating it’s open weights are fake also seem super fishy.

Like at this point alibaba just uses the name they built by releasing super good local models to promote closed models (that imo are not even close to other closed models)

https://redd.it/1shi6ca
@rStableDiffusion

From the StableDiffusion community on Reddit

Explore this post and more from the StableDiffusion community

7 views10:40

I can finally run LTX Desktop after the last update.

https://redd.it/1shizzu
@rStableDiffusion

5 views11:40

r/StableDiffusion

0:00

This media is not supported in your browser

VIEW IN TELEGRAM

LTX 2.3 Lip Sync Music Clip -- Drake - Toosie Slide

https://redd.it/1shjibe
@rStableDiffusion

5 views12:40

r/StableDiffusion

Anyone interested in this .. or did someone else make it already? LTX 2.3 Desktop - Lora injector + my own prompt tool..

https://redd.it/1shjyg8
@rStableDiffusion

From the StableDiffusion community on Reddit: Anyone interested in this .. or did someone else make it already? LTX 2.3 Desktop…

Explore this post and more from the StableDiffusion community

5 views13:40

r/StableDiffusion

5 views13:40

About

Blog

Apps

Platform