The latest update to the Gemini chatbot has introduced exciting new features that allow users to edit both AI-generated images and those uploaded from personal devices. This enhancement, announced by Google, marks a significant step forward in the realm of digital creativity.
Starting today, the native image editing capabilities will be gradually available to users across more than 45 languages and various countries, expanding the accessibility of this innovative tool.
This rollout follows the successful pilot of an AI image-editing model in Google’s AI Studio earlier this year, which gained attention for its ability to manipulate images in ways that sparked discussions about ethical use. Similar to recent advancements in other AI platforms, Gemini’s image editor aims to provide superior results compared to traditional standalone image generators.
One of the standout features of Gemini is its “multi-step” editing process, which allows for a more nuanced and contextual interaction with the user. This means that users can seamlessly integrate text and images, making it easier to modify backgrounds, swap out objects, and add new elements directly within the Gemini interface.
Editing images with Gemini opens up a world of possibilities. For instance, users can upload their own photos and request Gemini to visualize them with different hairstyles or colors. Additionally, it can assist in crafting creative narratives, such as generating a draft of a bedtime story featuring dragons, complete with accompanying illustrations.
While the potential for misuse, such as creating deepfakes, is a valid concern, Google has taken steps to mitigate these risks. Images produced or altered through Gemini will feature an invisible watermark, and the company is exploring the implementation of visible watermarks on all images generated by the platform.