Ever since OpenAI introduced native image generation capabilities in ChatGPT, users like …. have been clamoring for it. . While users initially experimented with different styles of image manipulation, the Studio Ghibli-style images have gained the most traction on social media. While some people are frustrated by this ongoing trend, others just can’t get enough of the Japanese anime world. Regardless of which camp you fall into, it is always worth knowing what else can be done with this new technology.
What else can you do with ChatGPT’s new capabilties?
While most of the Twitter universe is filled with Ghibli portraits, Reddit users are now apparently moving on to other options.
Some users have started creating replicas of other popular themes using ChatGPT, including LEGO, Medusa, and unearthed Roman sculptures. A few examples of these filters are included below. Reddit users used the famous “distracted boyfriend” meme while applying all of these filters.
However, ChatGPT’s new capabilities have more to offer than just applying filters to images. In their release blog, OpenAI showed off some truly stunning images created with GPT-4o. In one such image, Karl Marx can be seen outside of a modern store carrying some shopping bags. While in another image, an astronaut appears to be painting a cosmic scene while inside a spacecraft or space station.
The possibilities are potentially endless with the new ChatGPT feature, but OpenAI has recently put restrictions on the number of images even paid users can create, so it may be hard to try out all the alternatives.
What is native image generation?
This is not the first time that ChatGPT has gotten the ability to generate images. In fact, the chatbot has been offering these services even to the free users for a while now, but the real catch is the native image generation capability that it has now acquired.
But what exactly is native image generation? Native image generation refers to the chatbot’s ability to directly generate and edit images using its multimodal capabilities, rather than relying on external models like DALL-E 3. While Gemini actually beat ChatGPT to the punch with native image generation support, OpenAI’s chatbot has definitely found better traction among the masses.
Why is native image generation a big deal?
OpenAI has unlocked image generation capabilities right into GPT-4o, this allows the chatbot to ‘refine images through natural conversation’. ChatGPT can also now handle between 10-20 different objects in an image, which allows for more control and consistency in the image.
The native image generation is also able to establish a link between ChatGPT’s text knowledge base and images, that lead to the chatbot generating more efficient and smarter feeling responses.