Vortex Next Gen Trends
64.9K subscribers
196 photos
210 videos
223 links
Vortex Channel is a part of BlockChainWorld 5,000.000 MASSIVE AI & Crypto COMMUNITY ๐Ÿ’Ž ALL ABOUT CRYPTO, TOKENS, AI, TAP GAMES, MEME COINS, PLAY AND EARN, DEFI, P2E, NFT, AI TOOLS, WEB3 & BITCOIN FORECASTS!

Order promo ๐Ÿ‘‰ PR@blockchainworld.ai
Download Telegram
Qwen Edit vs Nano Banana vs Flux Kontext Pro & Flux Kontext Dev
Prompt: Turn the motorcycle pink and put it against the backdrop of a big city at night, glowing with huge neon signs.
Banano really delivers! ๐Ÿš€
๐Ÿ‘632๐ŸŽ‰594โค593๐Ÿ”ฅ530
Virtual fitting room on VideoX-Fun / Wan2.1-I2V-14B
Qwen2.5-VL-7B-Instruct is used for clothing description.
And under the hood, thereโ€™s also OpenPose, DensePose, and more.

If anyone wanted to fine-tune WAN 2.1 for virtual try-on โ€” here it is.

https://vivocameraresearch.github.io/magictryon/
โค873๐ŸŽ‰820๐Ÿ”ฅ794๐Ÿ‘773
This media is not supported in your browser
VIEW IN TELEGRAM
Runway Game Worlds
The name is a bit misleading.
Itโ€™s more like Runway Comics Worlds or even Runwayโ€™s Board Games.
Because it goes back to the roots โ€” text-based control. Itโ€™s basically text adventures: you write a prompt, the game reacts, but also generates an image of whatโ€™s happening.
Text games without the need for your imagination.
*โ€œGame Worlds uses new AI technologies for nonlinear storytelling. This means that each game session you play is generated in real time with personalized stories, characters, and multimodal media.
In the beta version, you can play both pre-made text adventures and create your own.โ€*

https://play.runwayml.com/
๐Ÿ‘58โค56๐ŸŽ‰55๐Ÿ”ฅ49
Feel the difference between Nanabanana and other AI generators.
One of the prompts on a picture was: 'make only the plate and the soup itself in the style of 2D anime, and donโ€™t touch anything else at all
โค969๐Ÿ”ฅ932๐Ÿ‘915๐ŸŽ‰891
This media is not supported in your browser
VIEW IN TELEGRAM
VibeVoice: a new text-to-speech (TTS) model for long-form conversations with multiple voices from Microsoft.
โ€ข 1.5B parameters
โ€ข MIT licensed
โ€ข Up to 1.5 hours of generation
โ€ข Strong emotional expressiveness
More details: VibeVoice is a new framework designed for creating expressive and extended audio recordings of conversations with multiple speakers (such as podcasts) from text. It addresses key issues of traditional text-to-speech (TTS) systems, particularly those related to scalability, speaker consistency, and natural turn-taking.
The model can synthesize up to 90 minutes of speech with up to 4 distinct speakers, exceeding the typical limitations of many previous models restricted to 1โ€“2 speakers.
Project page: https://microsoft.github.io/VibeVoice/ โ€” lots of examples.
Youโ€™ll find the weights, code, and even a Gradio demo here: https://86636c494bbddc69c7.gradio.live/
๐Ÿ”ฅ146๐ŸŽ‰133๐Ÿ‘125โค116
Examples of applications that can be built on top of Nanabananaโ€”or, as it is now officially called: gemini-2.5-flash-image-preview.
This is done in Google AI Studio, and you can check out examples here: https://aistudio.google.com/apps
What really impressed me was โ€œGemini Co-Drawingโ€, which demonstrates the multimodal modelโ€™s ability to read hand-drawn diagrams, perform calculations, and follow complex editing instructions.
All of this is available at the link above.
And you can read more about development and pricing here: https://developers.googleblog.com/en/introducing-gemini-2-5-flash-image/
๐Ÿ‘573๐Ÿ”ฅ569โค552๐ŸŽ‰540