This media is not supported in your browser
VIEW IN TELEGRAM
ComfyUI-ConnectTheDots - Connect compatible nodes without scrolling across your graph

https://redd.it/1sgzyv3
@rStableDiffusion
Are there any simple paths to local image generation on Linux?

I've had no luck so far. To note, I have some general familiarity with the command line.

That said, I've tried ComfyUI, Foooocus, SwarmUI...I've had no luck getting any of those to even successfully install. Missing dependency that, can't find that, can't install that. All these wgets and git clones and 'throw it in python's seem to end badly for me.

I have managed to download and launch Invoke AI successfully. But I haven't had any luck generating an actual image: I got word of ROCm issues from the error messages, and it seems Fedora messes with that. Trying to fix that up still got me nowhere.

\--------

Is there anything a bit simpler to use, just to get started? I run LM Studio on this computer just fine, and as it stands I'm hoping they'll one day branch out into image / video gen. I don't care if it can barely do a smiley face, I just want it to be local, and FOSS.


Bonus Info:
GPU | Radeon 7600
CPU | Ryzen 5 7600
RAM | 16GB DDR5
OS | Fedora 43, Plasma 6.6


If you have ideas, let me know. Thank you for your time.

https://redd.it/1shagk4
@rStableDiffusion
Bad news on Happy Horse from twitter
https://redd.it/1shcgox
@rStableDiffusion
Which video model learns face likeness best when training LoRA?

Hey, I’m trying to train LoRAs for real human likeness and was wondering which video model currently does the best job at learning and preserving identity.

I’ve tried a bit with LTX and Wan, but still not sure which one is actually better for likeness. Would love to hear what people are getting the best results with right now

https://redd.it/1shbfra
@rStableDiffusion
ACE-Step 1.5 XL Base — BF16 version (converted from FP32)

I converted the ACE-Step 1.5 XL Base model from FP32 to BF16. The original weights were \~18.8 GB in FP32, this version is \~7.5 GB — same quality, lower VRAM usage.

The Base model is the go-to starting point for fine-tuning (LoRA, etc.) — if you want to train your own style, this is the one to use. A great tool for that is Side Step.

🤗 https://huggingface.co/marcorez8/acestep-v15-xl-base-bf16

I also converted the XL Turbo variant yesterday: Reddit post | Model

https://redd.it/1shfihr
@rStableDiffusion
HappyHorse is from Alibaba ATH, not Grok / Veo 3.2 / Wan 2.7 / Seedance 2

I finally found what looks like the official clarification.

According to the verified HappyHorse twitter account, HappyHorse is a product currently in internal testing under Alibaba's ATH innovation division. It also says the product is not officially launched yet, and that the so-called "official websites" circulating online are fake.

https://preview.redd.it/s0yc372pjbug1.png?width=760&format=png&auto=webp&s=77cb530ff67fbb68537c0a7417fa782b88c3981a

https://preview.redd.it/zlpry4m0jbug1.png?width=1337&format=png&auto=webp&s=4756801907a9adcbcad4dc8c3c859615fcc6a208

https://redd.it/1shfzip
@rStableDiffusion
VoxCPM TTS model + LoRa training abilities right in Comfy
https://redd.it/1shfwcg
@rStableDiffusion
Happy Horse deceiving practices

Kinda lame that Happy Horse was pushed as open weights early on, got people interested, and now it’s apparently becoming closed-source API only, they knew what they were doing.

Way less people are interested in closed video models but make a promise it’s open weights and you get way more traction… then have it closed.

A paid, censored, all you data stolen, closed video model is way less useful for a lot of us. The whole appeal was being able to run it ourselves, experiment freely, fine-tune, make loras, and build on top of it without being stuck behind someone else’s rules and pricing.

Feels like they used the open-weights angle to build hype and traction, then pulled the ladder up and i relly believe that. Also saying that the sources stating it’s open weights are fake also seem super fishy.

Like at this point alibaba just uses the name they built by releasing super good local models to promote closed models (that imo are not even close to other closed models)

https://redd.it/1shi6ca
@rStableDiffusion