OpenAI announced Monday that ChatGPT has now acquired the capability to engage in conversational interactions with users. This enhanced interface, featuring voice and image functionalities, promises a more intuitive and versatile user experience, the artificial intelligence powerhouse said.
OpenAI emphasized that these conversations with ChatGPT will be dynamic and interactive, allowing users to engage in back-and-forth exchanges. Users will be able to converse with the AI on-the-go, request bedtime stories for their families, or even resolve dinner table debates.
Voice conversations with ChatGPT are set to roll out to Plus and Enterprise users within the next two weeks. What sets this apart is OpenAI’s collaboration with voice actors to craft a text-to-speech model that generates remarkably human-like audio.
Peter Deng, Vice President of Consumer Products at OpenAI, underscored the significance of this move, telling the Washington Post: “One of the hardest jobs is taking that amazing technology and translating it into the simplicity that the next 300-400 million people are looking for.”
Yet, OpenAI acknowledges potential risks, such as the threat of malicious actors impersonating public figures or engaging in fraudulent activities. However, the company is quick to emphasize that the technology has been designed with the same principles as voice assistants like Amazon’s Alexa and Apple’s Siri.
This transformative technology has already caught the attention of industry players. Spotify, for instance, is harnessing ChatGPT’s power for its Voice Translation feature, aiming to expand the reach of podcasters by translating content into multiple languages using their own voices.
But that’s not all—OpenAI is taking things a step further. They’re introducing image capabilities across all platforms. Users can now upload multiple photos and even use a drawing tool to highlight specific areas within an image, providing context and enhancing the AI’s understanding.
For instance, users can capture a picture of a landmark while traveling and engage in a live conversation about its interesting features. Similarly, they can photograph their fridge and pantry at home to determine dinner options, making the AI an integral part of their daily lives.