4o Image Generation

shaky-carrousel · 2025-03-25T18:29:17 1742927357

Tried it, the "compise armporressed" and "Pros: made bord reqotons" didn't impress me in the slightest.

qoez · 2025-03-25T18:28:04 1742927284

Looks about what you'd get with FLUX and attaching some language model to enhance your prompt with eg more text

afro88 · 2025-03-25T18:29:24 1742927364

Flux doesn't do text that good

minimaxir · 2025-03-25T18:10:48 1742926248

OpenAI's livestream of GPT-4o Image Generation shows that it is slowwwwwwwwww (maybe 30 seconds per image, which Sam Altman had to spin "it's slow but the generated images are worth it"). Instead of using a diffusion approach, it appears to be generating the image tokens and decoding them akin to the original DALL-E (https://openai.com/index/dall-e/), which allows for streaming partial generations from top to bottom. In contrast, Google's Gemini can generate images and make edits in seconds.

No API yet, and given the slowness I imagine it will cost much more than the $0.03+/image of competitors.

occamschainsaw · 2025-03-25T18:18:35 1742926715

Did they time it with the Gemini 2.5 launch? https://news.ycombinator.com/item?id=43473489

Was it public information when Google was going to launch their new models? Interesting timing.

qoez · 2025-03-25T18:26:51 1742927211

"Interesting timing" It's like the 4th time by my counting they've done this

rvz · 2025-03-25T18:12:54 1742926374

> ChatGPT’s new image generation in GPT‑4o rolls out starting today to Plus, Pro, Team, and Free users as the default image generator in ChatGPT, with access coming soon to Enterprise and Edu. For those who hold a special place in their hearts for DALL·E, it can still be accessed through a dedicated DALL·E GPT.

> Developers will soon be able to generate images with GPT‑4o via the API, with access rolling out in the next few weeks.

That's it folks. Tens of thousands of so-called "AI" image generator startups have been obliterated and taking digital artists with them all reduced to near zero.

Now you have a widely accessible meme generator with the name "ChatGPT".

The last task is for an open weight model that competes against this and is faster and all for free.

afro88 · 2025-03-25T18:29:00 1742927340

Yep. The coherence and text quality is insanely good. Keen to play with it to find it's "mangled hands" style deficiencies, because of course they cherry picked the best examples.

（评论） (comments)

（评论）
(comments)