Vortex Next Gen Trends

Qwen Edit vs Nano Banana vs Flux Kontext Pro & Flux Kontext Dev
Prompt: Turn the motorcycle pink and put it against the backdrop of a big city at night, glowing with huge neon signs.
Banano really delivers! 🚀

👍632🎉594❤593🔥530

33.1K views09:21

Vortex Next Gen Trends

0:03

This media is not supported in your browser

VIEW IN TELEGRAM

0:06

This media is not supported in your browser

VIEW IN TELEGRAM

Virtual fitting room on VideoX-Fun / Wan2.1-I2V-14B
Qwen2.5-VL-7B-Instruct is used for clothing description.
And under the hood, there’s also OpenPose, DensePose, and more.

If anyone wanted to fine-tune WAN 2.1 for virtual try-on — here it is.

https://vivocameraresearch.github.io/magictryon/

❤873🎉820🔥794👍773

27.2K views19:37

Vortex Next Gen Trends

0:19

This media is not supported in your browser

VIEW IN TELEGRAM

Runway Game Worlds
The name is a bit misleading.
It’s more like Runway Comics Worlds or even Runway’s Board Games.
Because it goes back to the roots — text-based control. It’s basically text adventures: you write a prompt, the game reacts, but also generates an image of what’s happening.
Text games without the need for your imagination.

*“Game Worlds uses new AI technologies for nonlinear storytelling. This means that each game session you play is generated in real time with personalized stories, characters, and multimodal media.
In the beta version, you can play both pre-made text adventures and create your own.”*

https://play.runwayml.com/

👍58❤56🎉55🔥49

42.1K viewsedited 10:05

Vortex Next Gen Trends

Feel the difference between Nanabanana and other AI generators.
One of the prompts on a picture was: 'make only the plate and the soup itself in the style of 2D anime, and don’t touch anything else at all

❤969🔥932👍915🎉891

30.3K views16:57

Vortex Next Gen Trends

1:08

This media is not supported in your browser

VIEW IN TELEGRAM

VibeVoice: a new text-to-speech (TTS) model for long-form conversations with multiple voices from Microsoft.
• 1.5B parameters
• MIT licensed
• Up to 1.5 hours of generation
• Strong emotional expressiveness
More details: VibeVoice is a new framework designed for creating expressive and extended audio recordings of conversations with multiple speakers (such as podcasts) from text. It addresses key issues of traditional text-to-speech (TTS) systems, particularly those related to scalability, speaker consistency, and natural turn-taking.
The model can synthesize up to 90 minutes of speech with up to 4 distinct speakers, exceeding the typical limitations of many previous models restricted to 1–2 speakers.
Project page: https://microsoft.github.io/VibeVoice/ — lots of examples.
You’ll find the weights, code, and even a Gradio demo here: https://86636c494bbddc69c7.gradio.live/

🔥146🎉133👍125❤116

24.7K views06:22

Vortex Next Gen Trends

0:06

This media is not supported in your browser

VIEW IN TELEGRAM

0:08

This media is not supported in your browser

VIEW IN TELEGRAM

0:14

This media is not supported in your browser

VIEW IN TELEGRAM

0:31

This media is not supported in your browser

VIEW IN TELEGRAM

Examples of applications that can be built on top of Nanabanana—or, as it is now officially called: gemini-2.5-flash-image-preview.
This is done in Google AI Studio, and you can check out examples here: https://aistudio.google.com/apps
What really impressed me was “Gemini Co-Drawing”, which demonstrates the multimodal model’s ability to read hand-drawn diagrams, perform calculations, and follow complex editing instructions.
All of this is available at the link above.
And you can read more about development and pricing here: https://developers.googleblog.com/en/introducing-gemini-2-5-flash-image/

👍573🔥569❤552🎉540

32.2K views05:51

About

Blog

Apps

Platform