Google has introduced a new capability within its Gemini mobile applications, enabling users to access Audio Overviews. This feature, initially launched in NotebookLM, made its way to Gemini earlier this year. Previously, while users could generate podcast-style audio conversations by uploading files, they were required to visit the AI chatbot’s website to listen to the audio, as the app lacked a built-in playback option. The latest update addresses this limitation by incorporating an inline media player directly within the app.
The new feature was first reported by 9to5Google and is now accessible to all Gemini users worldwide, including those using the free version of the service. When a user uploads a file to create an Audio Overview, an interactive media player will now appear on their screen, allowing them to easily listen to the generated audio.
Media player in the Gemini app
The new media player features a seeking bar, play/pause controls, buttons for rewinding and skipping ahead by ten seconds, and options to adjust the speech speed between 0.5x and 2x. Moreover, a download button lets users save the file for local playback through third-party applications.
Notably, Audio Overviews enable Gemini to create podcast-like discussions featuring two AI hosts, one male and one female. These virtual hosts engage in conversations, share reactions, and provide supplemental information sourced from the internet. The process of generating an Audio Overview can take up to five minutes, depending on the file size and text length.
This feature gained traction last year when it first appeared in Google’s NotebookLM. Since then, the company has rolled it out to the mobile and web versions of Gemini, and there are speculations about ongoing testing for AI Overviews in Google Search.