This media is not supported in your browser
VIEW IN TELEGRAM
Midjourney has introduced HD Mode
Only available for Pro and Mega subscription plans.
Costs 3.2 times more credits.
Because⦠lots of pixels.
Only available for Pro and Mega subscription plans.
Costs 3.2 times more credits.
Because⦠lots of pixels.
β€229π198π197π₯190
Media is too big
VIEW IN TELEGRAM
Minimax Speech 2.5
Minimax Text-to-Speech (TTS) Generator:
Compared to the Speech 02 version released in May, Speech 2.5 features three new improvements:
β’ Higher speech expressiveness in multiple languages,
β’ More realistic voice reproduction,
β’ Broad coverage of 40 languages.
https://www.minimax.io/audio
Minimax Text-to-Speech (TTS) Generator:
Compared to the Speech 02 version released in May, Speech 2.5 features three new improvements:
β’ Higher speech expressiveness in multiple languages,
β’ More realistic voice reproduction,
β’ Broad coverage of 40 languages.
https://www.minimax.io/audio
β€287π₯264π258π257
Grok 4 is now free to use! π
Cranks out and edits images like a pro.
Update: Free users get 5 requests every 12 hours.
Cranks out and edits images like a pro.
Update: Free users get 5 requests every 12 hours.
π277π277β€273π₯270
This media is not supported in your browser
VIEW IN TELEGRAM
A cool product feature.
A long press on an image turns it into a video.
This is the iOS app for Grok. Itβs coming to Android soon.
A long press on an image turns it into a video.
This is the iOS app for Grok. Itβs coming to Android soon.
β€253π244π234π₯229
This media is not supported in your browser
VIEW IN TELEGRAM
SkyWorks: theyβve released Matrix-3D, a 3D world generator that works through a combination of video generation and 3D reconstruction. Itβs sort of a response to Hunyuan World 1 from Tencent, the Odyssey project, and Googleβs recently announced Genie 3.
You enter a prompt or upload an image, and you get either a video panorama or a 3D scene to explore. The catch is, it seems youβll have to move through it by setting a trajectory. World 1, judging by the demos, supported a gamepad.
Generation can be done at resolutions of 960 Γ 480 or 1440 Γ 720. On a single A800 with 40 GB VRAM, rendering 720p takes about an hour...
You enter a prompt or upload an image, and you get either a video panorama or a 3D scene to explore. The catch is, it seems youβll have to move through it by setting a trajectory. World 1, judging by the demos, supported a gamepad.
Generation can be done at resolutions of 960 Γ 480 or 1440 Γ 720. On a single A800 with 40 GB VRAM, rendering 720p takes about an hour...
β€258π252π₯242π224
This media is not supported in your browser
VIEW IN TELEGRAM
Suno Studio!
Now this is intriguing.
At least because of this:
Multi-track creation. Export to MIDI.
If this is real multitrack β where one track = one instrument β then itβs an absolute bomb that will just wipe out all competitors.
Now this is intriguing.
At least because of this:
Multi-track creation. Export to MIDI.
If this is real multitrack β where one track = one instrument β then itβs an absolute bomb that will just wipe out all competitors.
π79β€69π61π₯56
Imagen 4 has been rolled out in Google AI Studio and via the API. And itβs damn good β especially Ultra.
AI Studio has limits.
And via API, the prices are: Fast ($0.02/image) Standard ($0.04/image) Ultra ($0.06/image)
AI Studio has limits.
And via API, the prices are: Fast ($0.02/image) Standard ($0.04/image) Ultra ($0.06/image)
π692π677β€659π₯637
Higgsfield keeps dropping viral features.
Product-to-Video is basically Flux Context, but for video.
Something similar was done by Pika and Runway, but Higgsβ cherry-picks look really polished.
Product-to-Video is basically Flux Context, but for video.
Something similar was done by Pika and Runway, but Higgsβ cherry-picks look really polished.
π409π₯408β€402π399
Qwen Edit vs Nano Banana vs Flux Kontext Pro & Flux Kontext Dev
Prompt: Turn the motorcycle pink and put it against the backdrop of a big city at night, glowing with huge neon signs.
Banano really delivers! π
Prompt: Turn the motorcycle pink and put it against the backdrop of a big city at night, glowing with huge neon signs.
Banano really delivers! π
π632π594β€593π₯530
Virtual fitting room on VideoX-Fun / Wan2.1-I2V-14B
Qwen2.5-VL-7B-Instruct is used for clothing description.
And under the hood, thereβs also OpenPose, DensePose, and more.
If anyone wanted to fine-tune WAN 2.1 for virtual try-on β here it is.
https://vivocameraresearch.github.io/magictryon/
Qwen2.5-VL-7B-Instruct is used for clothing description.
And under the hood, thereβs also OpenPose, DensePose, and more.
If anyone wanted to fine-tune WAN 2.1 for virtual try-on β here it is.
https://vivocameraresearch.github.io/magictryon/
β€873π820π₯794π773