π The main thing about GPT-5:
π GPT-5 is an intelligent system β depending on the complexity of the request, the model chooses the depth of reasoning. All current models (from GPT-4o mini to o3 Pro) will be replaced by the corresponding branch of GPT-5.
π Available to everyone today, including free users β if the limit is exceeded, they will be transferred to GPT-5 mini.
π The best model for coding on the market. The Pro version scores 100% in the AIME 2025 mathematical benchmark.
π Even the basic model hallucinates significantly less than GPT-4o and o3.
π The model is much less inclined to agree with the user and flatter him.
π GPT-5 was classified as a high-risk model for hazards in biology and chemistry β appropriate measures have been taken. At the same time, the model is 65% safer than its predecessors, and more than 9,000 hours were spent on testing.
More details are in the model's system map.
π GPT-5 is an intelligent system β depending on the complexity of the request, the model chooses the depth of reasoning. All current models (from GPT-4o mini to o3 Pro) will be replaced by the corresponding branch of GPT-5.
π Available to everyone today, including free users β if the limit is exceeded, they will be transferred to GPT-5 mini.
π The best model for coding on the market. The Pro version scores 100% in the AIME 2025 mathematical benchmark.
π Even the basic model hallucinates significantly less than GPT-4o and o3.
π The model is much less inclined to agree with the user and flatter him.
π GPT-5 was classified as a high-risk model for hazards in biology and chemistry β appropriate measures have been taken. At the same time, the model is 65% safer than its predecessors, and more than 9,000 hours were spent on testing.
More details are in the model's system map.
β€218π211π211π₯192
This media is not supported in your browser
VIEW IN TELEGRAM
Midjourney has introduced HD Mode
Only available for Pro and Mega subscription plans.
Costs 3.2 times more credits.
Because⦠lots of pixels.
Only available for Pro and Mega subscription plans.
Costs 3.2 times more credits.
Because⦠lots of pixels.
β€229π198π197π₯190
Media is too big
VIEW IN TELEGRAM
Minimax Speech 2.5
Minimax Text-to-Speech (TTS) Generator:
Compared to the Speech 02 version released in May, Speech 2.5 features three new improvements:
β’ Higher speech expressiveness in multiple languages,
β’ More realistic voice reproduction,
β’ Broad coverage of 40 languages.
https://www.minimax.io/audio
Minimax Text-to-Speech (TTS) Generator:
Compared to the Speech 02 version released in May, Speech 2.5 features three new improvements:
β’ Higher speech expressiveness in multiple languages,
β’ More realistic voice reproduction,
β’ Broad coverage of 40 languages.
https://www.minimax.io/audio
β€287π₯264π258π257
Grok 4 is now free to use! π
Cranks out and edits images like a pro.
Update: Free users get 5 requests every 12 hours.
Cranks out and edits images like a pro.
Update: Free users get 5 requests every 12 hours.
π277π277β€273π₯270
This media is not supported in your browser
VIEW IN TELEGRAM
A cool product feature.
A long press on an image turns it into a video.
This is the iOS app for Grok. Itβs coming to Android soon.
A long press on an image turns it into a video.
This is the iOS app for Grok. Itβs coming to Android soon.
β€253π244π234π₯229
This media is not supported in your browser
VIEW IN TELEGRAM
SkyWorks: theyβve released Matrix-3D, a 3D world generator that works through a combination of video generation and 3D reconstruction. Itβs sort of a response to Hunyuan World 1 from Tencent, the Odyssey project, and Googleβs recently announced Genie 3.
You enter a prompt or upload an image, and you get either a video panorama or a 3D scene to explore. The catch is, it seems youβll have to move through it by setting a trajectory. World 1, judging by the demos, supported a gamepad.
Generation can be done at resolutions of 960 Γ 480 or 1440 Γ 720. On a single A800 with 40 GB VRAM, rendering 720p takes about an hour...
You enter a prompt or upload an image, and you get either a video panorama or a 3D scene to explore. The catch is, it seems youβll have to move through it by setting a trajectory. World 1, judging by the demos, supported a gamepad.
Generation can be done at resolutions of 960 Γ 480 or 1440 Γ 720. On a single A800 with 40 GB VRAM, rendering 720p takes about an hour...
β€258π252π₯242π224
This media is not supported in your browser
VIEW IN TELEGRAM
Suno Studio!
Now this is intriguing.
At least because of this:
Multi-track creation. Export to MIDI.
If this is real multitrack β where one track = one instrument β then itβs an absolute bomb that will just wipe out all competitors.
Now this is intriguing.
At least because of this:
Multi-track creation. Export to MIDI.
If this is real multitrack β where one track = one instrument β then itβs an absolute bomb that will just wipe out all competitors.
π79β€69π61π₯56
Imagen 4 has been rolled out in Google AI Studio and via the API. And itβs damn good β especially Ultra.
AI Studio has limits.
And via API, the prices are: Fast ($0.02/image) Standard ($0.04/image) Ultra ($0.06/image)
AI Studio has limits.
And via API, the prices are: Fast ($0.02/image) Standard ($0.04/image) Ultra ($0.06/image)
π692π677β€659π₯637
Higgsfield keeps dropping viral features.
Product-to-Video is basically Flux Context, but for video.
Something similar was done by Pika and Runway, but Higgsβ cherry-picks look really polished.
Product-to-Video is basically Flux Context, but for video.
Something similar was done by Pika and Runway, but Higgsβ cherry-picks look really polished.
π409π₯408β€402π399
Qwen Edit vs Nano Banana vs Flux Kontext Pro & Flux Kontext Dev
Prompt: Turn the motorcycle pink and put it against the backdrop of a big city at night, glowing with huge neon signs.
Banano really delivers! π
Prompt: Turn the motorcycle pink and put it against the backdrop of a big city at night, glowing with huge neon signs.
Banano really delivers! π
π632π594β€593π₯530
Virtual fitting room on VideoX-Fun / Wan2.1-I2V-14B
Qwen2.5-VL-7B-Instruct is used for clothing description.
And under the hood, thereβs also OpenPose, DensePose, and more.
If anyone wanted to fine-tune WAN 2.1 for virtual try-on β here it is.
https://vivocameraresearch.github.io/magictryon/
Qwen2.5-VL-7B-Instruct is used for clothing description.
And under the hood, thereβs also OpenPose, DensePose, and more.
If anyone wanted to fine-tune WAN 2.1 for virtual try-on β here it is.
https://vivocameraresearch.github.io/magictryon/
β€873π820π₯794π773