OpenAI has unveiled a new feature for ChatGPT that allows the AI to vocalize its responses using text-to-speech technology. This capability, known as “Read Aloud,” is now accessible on both the web and mobile versions of ChatGPT, including platforms on iOS and Android devices. The feature, compatible with GPT-4 and GPT-3.5 models, supports an impressive array of 37 languages, enhancing the tool’s utility for a diverse global user base.
Functionality and User Experience
ChatGPT’s Read Aloud function is designed with convenience in mind, offering users a choice among five different voice options to deliver the AI-generated responses out loud. This advancement is particularly beneficial for users engaged in tasks that preclude reading from a screen, providing a hands-free alternative to engage with the AI. The language detection is automatic, ensuring that the spoken responses match the language of the submitted text. On mobile applications, a simple tap and hold gesture over the text reveals the Read Aloud player, through which users can play, pause, or rewind the audio feedback. Desktop users can activate the feature by clicking a speaker icon displayed below the text box.
ChatGPT can now read responses to you.
On iOS or Android, tap and hold the message and then tap “Read Aloud”. We’ve also started rolling on web – click the "Read Aloud" button below the message. pic.twitter.com/KevIkgAFbG
— OpenAI (@OpenAI) March 4, 2024
Comparative Developments and Future Implications
The introduction of the Read Aloud feature positions OpenAI in direct competition with Anthropic, which recently added similar capabilities by introducing three Claude 3 AI models. Moreover, this update follows the launch of a voice chat feature by ChatGPT in September 2023, enabling users to communicate with the AI through voice commands directly. These advancements underscore OpenAI’s commitment to enhancing the multimodal capabilities of its AI systems, which allows interaction through multiple mediums.
As technology evolves, the ability of AI to interpret and respond in various languages and formats becomes crucial for accessibility and user convenience. OpenAI’s continuous improvements to ChatGPT indicate a broader trend towards making AI communication more natural and intuitive, closely mirroring human interaction. This series of updates not only enriches the user experience but also paves the way for more sophisticated applications of AI in daily life.
Last Updated on November 7, 2024 9:56 pm CET