OpenAI claims a number of key enhancements: customers can refine pictures via dialog whereas sustaining visible consistency; the system can analyze uploaded pictures and incorporate their particulars into new generations; and it presents stronger photorealism—though what constitutes photorealism (for instance, imitations of HDR digicam options, element degree, and picture distinction) will be subjective.

A screenshot of OpenAI’s 4o Picture Technology mannequin in ChatGPT. We see an current AI-generated picture of a barbarian and a TV set, then a request to set the TV set on fireplace.
Credit score:
OpenAI / Benj Edwards
In its weblog publish, OpenAI offered examples of meant makes use of for the picture generator, together with creating diagrams, infographics, social media graphics utilizing particular coloration codes, logos, instruction posters, enterprise playing cards, customized inventory pictures with clear backgrounds, modifying consumer pictures, or visualizing ideas mentioned earlier in a chat dialog.
Notably absent: Any point out of the artists and graphic designers whose jobs could be affected by this expertise. As we lined all through 2022 and 2023, job impression remains to be a high concern amongst critics of AI-generated graphics.
Fluid media manipulation
Shortly after OpenAI launched 4o Picture Technology, the AI neighborhood on X put the characteristic via its paces, discovering that it’s fairly succesful at inserting somebody’s face into an current picture, creating pretend screenshots, and changing meme pictures into the type of Studio Ghibli, South Park, felt, Muppets, Rick and Morty, Household Man, and rather more.
It looks like we’re coming into a very fluid media “actuality” courtesy of a software that may effortlessly convert visible media between kinds. The kinds additionally doubtlessly encroach upon protected mental property. Given what Studio Ghibli co-founder Hayao Miyazaki has beforehand stated about AI-generated art work (“I strongly really feel that that is an insult to life itself”), it appears he’d be unlikely to understand the present AI-generated Ghibli fad on X in the intervening time.
To get a way of what 4o IG can do ourselves, we ran some casual checks, together with a few of the traditional CRT barbarians, queens of the universe, and beer-drinking cats, which you’ve got already seen above (and naturally, the plate of pickles).
The ChatGPT interface with the brand new 4o picture mannequin is conversational (like earlier than with DALL-E 3), however you’ll be able to recommend modifications over time. For instance, we took the creator’s EGA pixel bio (as we did with Google’s mannequin final week) and tried to offer it a full physique. Arguably, Google’s extra restricted picture mannequin did a much better job than 4o IG.