Gemini’s current image analysis capabilities are restricted to single uploads. While the platform provides features like image description and text extraction, users are required to upload images individually.
This contrasts with ChatGPT, which allows for multi-image uploads, making Gemini’s process less efficient. Rumors suggest that Google is working on a multi-image upload feature for Gemini.
Recent leaks indicate that users may soon be able to upload up to ten images in a single prompt for a more comprehensive analysis. This function is currently hidden within the Google app beta version.
Although it is functional, manual activation is necessary, and a broader rollout is expected with a future update. Currently, Gemini can analyze images through its web and mobile app versions, helping users gain insights into their content.
Features include image description generation, text extraction, and object identification. However, unlike ChatGPT, Gemini can address only one image at a time.
This limitation necessitates that users submit images sequentially, creating a cumbersome experience for those needing multi-image analysis. When attempting to add a second image, Gemini displays a prompt indicating that adding a new image will replace the current one.
While users can create a collage of multiple photos in one submission, this approach is more time-consuming than sending individual images. Screenshots demonstrate that Gemini can analyze and provide information for all images within a single prompt and offer detailed feedback on each.
Despite the polished interface and functionality observed in beta, the multi-image analysis feature remains unavailable to the general public. The functionality was located in version 16.11.32 of the Google app and requires manual enabling.
Once officially released, it’s anticipated that the feature will first be available on the web before reaching the mobile app.
Leave a Reply