Google to enhance Gemini Live with voice-enabled file interaction support.
Google is reportedly expanding the capabilities of its Gemini chatbot, introducing voice-based interaction with uploaded files in its Gemini Live feature.
Initially launched for Gemini Advanced subscribers at the Google I/O event, this AI-powered functionality allows two-way verbal conversations, offering a hands-free experience that could now extend to document and spreadsheet discussions.
The update was first noted during an APK teardown of the Google app beta version 15.45.33.ve. arm64 by Android Authority, uncovering code suggesting Gemini Live may soon enable users to interact verbally with uploaded files.
Gemini Live Support for Uploaded Files Uncovered in APK Teardown
Android Authority’s analysis of the latest Google app beta revealed several strings of code pointing toward this upgrade.
Phrases like “Open Live,” “Talk about attachment,” and “Open Live with attachment” were identified, hinting that Gemini Live’s new functionality will enable users to discuss uploaded files via voice.
Currently, file interaction in Gemini is text-based, but this potential update could transform how users engage with complex documents, making it easier to extract insights without being restricted to typed queries.
Voice-Enabled Document Conversations Target Paid Subscribers
While this development could simplify interactions with text-heavy files, it’s expected to remain limited to Gemini Advanced subscribers.
At present, only these subscribers can upload files and interact with the AI about them, making the upcoming Gemini Live support for files likely exclusive to paying Android users.
Accessible through the Google One AI Premium plan at a monthly cost of Rs. 1,950, Gemini Live was initially launched for paid subscribers in August, followed by a broader rollout to Android users in September.
Gemini Live’s Language Support Expands Accessibility
Gemini Live’s two-way conversational feature already supports Hindi and eight regional Indian languages, expanding its appeal to a diverse user base in India.
By adding file-based voice interaction, Google aims to enhance user convenience, potentially allowing faster responses to complex queries within documents.
The proposed feature could revolutionize productivity for users needing immediate insights from their files, further distinguishing Gemini as a versatile AI tool in Google’s ecosystem.
If launched, the update would make Gemini Live a more comprehensive tool, providing voice-driven, hands-free interaction with uploaded content- a substantial leap forward in AI convenience and accessibility.