Thursday, November 28

Google’s AI surprise: Gemini Live speaks like a human, handling ChatGPT Advanced Voice Mode

August 13, 2024 11:17 AM

Credit: VentureBeat made with ChatGPT

Join our day-to-day and weekly newsletters for the current updates and unique material on industry-leading AI protection. Discover more

Google often seems like it’s playing catchup in the generative AI race to competitors such as Meta, OpenAI, Anthropic and Mistral– however not any longer.

Today, the business leapfrogged most others by revealing Gemini Live, a brand-new voice mode for its AI design Gemini through the Gemini mobile app, which permits users to talk to the design in plain, conversational language and even disrupt it and have it react back with the AI’s own humanlike voice and cadence. Or as Google put it in a post on X: “You can now have a free-flowing discussion, and even disrupt or alter subjects similar to you may on a routine telephone call.”

We’re presenting Gemini Live, a more natural method to communicate with Gemini. You can now have a free-flowing discussion, and even disrupt or alter subjects much like you may on a routine call. Readily Available to Gemini Advanced customers. #MadeByGoogle pic.twitter.com/eNjlNKubsv

— Google (@Google) August 13, 2024

If that sounds familiar, it’s since OpenAI in May demoed its own “Advanced Voice Mode” for ChatGPT which it honestly compared to the talking AI os from the film Herjust to postpone the function and start to roll it out just selectively to alpha individuals late last month.

Gemini Live is now readily available in English on the Google Gemini app for Android gadgets through a Gemini Advanced membership ($19.99 USD monthly), with an iOS variation and assistance for more languages to follow in the coming weeks.

To put it simply: despite the fact that OpenAI displayed a comparable function initially, Google is set to make it more offered to a much larger possible audience (more than 3 billion active users on Android and 2.2 billion iOS gadgets) rather than ChatGPT’s Advanced Voice Mode.

Part of the factor OpenAI might have postponed ChatGPT Advanced Voice Mode was due to its own internal “red-teaming” or managed adversarial security screening that revealed the voice mode in specific in some cases engaged in odd, perplexing, and even possibly unsafe habits such as simulating the user’s own voice without permission– which might be utilized for scams or harmful functions.

How is Google resolving the possible damages triggered by this kind of tech? We do not actually understand yet, however VentureBeat connected to the business to ask and will upgrade when we hear back.

What is Gemini Live helpful for?

Google pitches Gemini Live as providing free-flowing, natural discussion that’s excellent for conceptualizing concepts, getting ready for crucial discussions, or just talking delicately about “different subjects.” Gemini Live is created to react and adjust in real-time.

Furthermore, this function can run hands-free, permitting users to continue their interactions even when their gadget is locked or running other apps in the background.

ยป …
Learn more