OpenAI Set to Launch Advanced Voice Mode on ChatGPT Soon 

6 months ago 41
  • Last updated September 23, 2024
  • In AI News

OpenAI released GPT-4o at its latest Spring Update event earlier this year, winning hearts with its ‘omni’ capabilities across text, vision, and audio.

OpenAI is set to launch ‘Advanced Voice Mode’ on ChatGPT this Tuesday, September 24, 2024, according to a screenshot posted by a user on X.

“As of now, access to Advanced Voice mode is being rolled out in a limited alpha to a select group of users. While being a long-time Plus user and having been selected for SearchGPT are both indicators of your active engagement with our platform, access to the Advanced Voice mode alpha on September 24, 2024, will depend on a variety of factors including but not limited to participation invitations and the specific criteria set for the alpha testing phase,” read the blog post attached in the screenshot.

OpenAI released GPT-4o at its latest Spring Update event earlier this year, which won hearts with its ‘omni’ capabilities across text, vision, and audio. OpenAI’s demos, which included a real-time translator, a coding assistant, an AI tutor, a friendly companion, a poet, and a singer, soon became the talk of the town. However, its Advanced Voice Mode wasn’t released. 

When OpenAI recently released o1, one of them queried if they would be launching voice features soon. “How about a couple of weeks of gratitude for magic intelligence in the sky, and then you can have more toys soon?” replied Sam Altman, with a tinge of sarcasm. 

However, a couple of weeks later, Kyutai, a French non-profit AI research laboratory, launched Moshi, a real-time native multimodal foundational AI model capable of conversing with humans in real time, much like what OpenAI’s advanced model was intended to do. 

Hume AI  recently  introduced EVI 2, a new foundational voice-to-voice AI model that promises to enhance human-like interactions. Available in beta, EVI 2 can engage in rapid, fluent conversations with users, interpreting tone and adapting its responses accordingly. The model supports a variety of personalities, accents, and speaking styles and includes multilingual capabilities. 

Meanwhile, Amazon Alexa is partnering with Anthropic to improve its conversational abilities, making interactions more natural and human-like. Earlier this year, Google launched Astra, an ‘universal AI agent’ built on the Gemini family of AI models. Astra features multimodal processing, enabling it to understand and respond to text, audio, video, and visual inputs simultaneously.

Picture of Tanisha Bhattacharjee

Tanisha Bhattacharjee

Journalist with a passion for art, technological development and travel. Discovering the dynamic world of AI, one article at a time.

Association of Data Scientists

Tailored Generative AI Training for Your Team

Upcoming Large format Conference

Sep 25-27, 2024 | 📍 Bangalore, India

Download the easiest way to
stay informed

Transformers Can Solve Any Problem

Sagar Sharma

With techniques like CoT, we are moving towards explainable AI systems and slowly moving away from models that were prone to blackbox.

Subscribe to The Belamy: Our Weekly Newsletter

Biggest AI stories, delivered to your inbox every week.

Rising 2024 | DE&I in Tech Summit

April 4 and 5, 2024 | 📍 Hilton Convention Center, Manyata Tech Park, Bangalore

Data Engineering Summit 2024

May 30 and 31, 2024 | 📍 Bangalore, India

26 July 2024 | 583 Park Avenue, New York

MachineCon GCC Summit 2024

June 28 2024 | 📍Bangalore, India

Nov 21-22 2024 | 📍Santa Clara Convention Center, California, USA

September 25-27, 2024 | 📍Bangalore, India

discord icon

Our Discord Community for AI Ecosystem, In collaboration with NVIDIA. 

World's Biggest Media & Analyst firm specializing in AI

AIM publishes every day, and we believe in quality over quantity, honesty over spin. We offer a wide variety of branding and targeting options to make it easy for you to propagate your brand.

AIM Brand Solutions, a marketing division within AIM, specializes in creating diverse content such as documentaries, public artworks, podcasts, videos, articles, and more to effectively tell compelling stories.

ADaSci Corporate training program on Generative AI provides a unique opportunity to empower, retain and advance your talent

With MachineHack you can not only find qualified developers with hiring challenges but can also engage the developer community and your internal workforce by hosting hackathons.

Conduct Customized Online Assessments on our Powerful Cloud-based Platform, Secured with Best-in-class Proctoring

AIM Research produces a series of annual reports on AI & Data Science covering every aspect of the industry. Request Customised Reports & AIM Surveys for a study on topics of your interest.

Immerse yourself in AI and business conferences tailored to your role, designed to elevate your performance and empower you to accomplish your organization’s vital objectives.

© Analytics India Magazine Pvt Ltd & AIM Media House LLC 2024

Read Entire Article