Gemini Live Can Now Visually Recognize Your Perspective: How Intelligent Is This AI Video Functionality?

Understanding Gemini Live: A New AI Feature from Google
Google’s Gemini Live is an innovative feature that allows users to utilize their phone’s cameras or screen sharing to gain insights and answers about their surroundings in real-time. This exciting capability was initially made available in April for specific devices, such as the Pixel 9 or Galaxy S25, as well as for those subscribed to Gemini Advanced. Soon after, Google expanded its availability to all compatible Android devices, showcasing their confidence in this powerful tool.
Having extensively tested various AI products, I found Gemini Live to be an impressive smartphone assistant, reminiscent of the aspirations many had for AI since Apple’s Siri launched over a decade ago. However, while Gemini Live shows immense potential, it is not without its flaws. The service can sometimes exhibit inaccuracies, leading to some doubts about its reliability.
Key Features of Gemini Live
Let’s explore what Gemini’s camera-sharing feature excels at:
- Object Recognition: I test it by asking it to identify items in my bathroom, and while it performed well, it did not always get everything right.
- Task Assistance: When I inquired how to center my icons in Windows 11, Gemini guided me correctly, although it advised me to enable auto-hide, which I did not need.
- Book Summaries: At a bookstore, I pointed my camera at books and received brief synopses, along with review excerpts from major publications like The New York Times and The Guardian.
- Gaming Help: While playing games, Gemini provided specific guidance for a quest, giving me useful information from online game guides.
- Family History Insights: When showing old family photos, Gemini enriched the experience by sharing details about locations and cultural attire, which made my exploration of my family’s past even more meaningful.
Challenges with Gemini’s Video Capabilities
Despite its many strengths, Gemini Live has its shortcomings. During my testing, I frequently encountered issues where it would get stuck in loops, providing limited information in response to inquiries.
For example, while asking for detailed information about a book, it often provided vague responses, similar to repeating a thesaurus. This pattern of circular responses persisted across various inquiries, making it less reliable for detailed knowledge.
Although Gemini Live with Video is undoubtedly impressive, it is essential to approach it with caution for significant matters, such as legal or medical advice, where accuracy is critical. There are privacy concerns, too, especially regarding sensitive material that you might choose to show the camera.
While it may not be perfect for more serious tasks, its capabilities for everyday, low-stakes situations have made it a noteworthy enhancement for Android users.