This media is not supported in your browser
VIEW IN TELEGRAM
Local I2V finally feels less like image wiggle and more like shot direction with LTX Director

https://redd.it/1thuq4k
@rStableDiffusion
Kijai just uploaded LTX2.3 OmniNFT RL-LoRA for better video and audio!

Reposting this from Twitter (wildminder):

"LTX2.3 OmniNFT RL-LoRA generates high-quality video/audio + visuals and sound are perfectly synchronized, no laggy or mismatched audio.

\- realistic Lip-Sync

\- action-matched sound

\- reduces synchronization errors by 52%

really nice output"

https://reddit.com/link/1thxd1p/video/cygvtd81a52h1/player

Reddit keeps blocking my posts (removed by filters), so I'm editing the links to see if this post will work (just remove the spaces, sorry):

Project page: zghhui . github . io/OmniNFT/

Kijai HF repo: huggingface . co/Kijai/LTX2.3_comfy/tree/main

https://redd.it/1thxd1p
@rStableDiffusion
building a shared hair library for SD prompts - who's down to help

hair is probably the most inconsistent thing I generate and I reckon a lot of you feel the same. prompts like "wolf cut" or "space buns" work sometimes and totally miss other times depending on the checkpoint, lighting, face angle, even sampler settings and CFG. there's no universal hairstyle taxonomy baked into SD prompts the way there is for art styles or character, archetypes - there are some community prompt packs floating around but nothing really structured or tested across models. so I want to build something actually useful: a shared hair library. basically a structured list of hairstyle prompt terms, what models they tend to work on, what, breaks them, and practical notes on ControlNet, IP-Adapter, or reference image approaches for the trickier ones. not just a name dump - actual tested prompts with context on what conditions they need to land properly. things like aspect ratio, whether you need a LoRA to reinforce the shape, whether regional prompting helps when you're fighting bleed from the rest of the composition. worth noting: for anything beyond simple styles, prompts alone usually aren't enough. most reliable workflows I've seen lean on LoRAs for specific cuts, ControlNet for structure, or IP-Adapter/reference-only modes for style transfer. would be good to document what combination actually works per style rather than pretending a single tag is going to do the job. anyone already doing something like this or have a system that works for you? and when a hairstyle prompt just isn't cooperating, what's your fallback - reference images, inpainting, hair-specific LoRA, something else?

https://redd.it/1thz0ma
@rStableDiffusion
This media is not supported in your browser
VIEW IN TELEGRAM
RL lora for LTX2.3. It greatly increases coherence and quality while reducing artifacts.

https://redd.it/1ti3jar
@rStableDiffusion
Is it possible to add audio to a WAN video with LTX?

I prefer WAN over LTX. It would be nice to add audio to WAN.

https://redd.it/1ti5pvo
@rStableDiffusion