- Last updated September 23, 2024
- In AI News
OpenAI released GPT-4o at its latest Spring Update event earlier this year, winning hearts with its ‘omni’ capabilities across text, vision, and audio.

OpenAI is set to launch ‘Advanced Voice Mode’ on ChatGPT this Tuesday, September 24, 2024, according to a screenshot posted by a user on X.
“As of now, access to Advanced Voice mode is being rolled out in a limited alpha to a select group of users. While being a long-time Plus user and having been selected for SearchGPT are both indicators of your active engagement with our platform, access to the Advanced Voice mode alpha on September 24, 2024, will depend on a variety of factors including but not limited to participation invitations and the specific criteria set for the alpha testing phase,” read the blog post attached in the screenshot.
OpenAI released GPT-4o at its latest Spring Update event earlier this year, which won hearts with its ‘omni’ capabilities across text, vision, and audio. OpenAI’s demos, which included a real-time translator, a coding assistant, an AI tutor, a friendly companion, a poet, and a singer, soon became the talk of the town. However, its Advanced Voice Mode wasn’t released.
When OpenAI recently released o1, one of them queried if they would be launching voice features soon. “How about a couple of weeks of gratitude for magic intelligence in the sky, and then you can have more toys soon?” replied Sam Altman, with a tinge of sarcasm.
However, a couple of weeks later, Kyutai, a French non-profit AI research laboratory, launched Moshi, a real-time native multimodal foundational AI model capable of conversing with humans in real time, much like what OpenAI’s advanced model was intended to do.
Hume AI recently introduced EVI 2, a new foundational voice-to-voice AI model that promises to enhance human-like interactions. Available in beta, EVI 2 can engage in rapid, fluent conversations with users, interpreting tone and adapting its responses accordingly. The model supports a variety of personalities, accents, and speaking styles and includes multilingual capabilities.
Meanwhile, Amazon Alexa is partnering with Anthropic to improve its conversational abilities, making interactions more natural and human-like. Earlier this year, Google launched Astra, an ‘universal AI agent’ built on the Gemini family of AI models. Astra features multimodal processing, enabling it to understand and respond to text, audio, video, and visual inputs simultaneously.
Tanisha Bhattacharjee
Journalist with a passion for art, technological development and travel. Discovering the dynamic world of AI, one article at a time.

Transformers Can Solve Any Problem
Sagar Sharma
With techniques like CoT, we are moving towards explainable AI systems and slowly moving away from models that were prone to blackbox.
Subscribe to The Belamy: Our Weekly Newsletter
Biggest AI stories, delivered to your inbox every week.
Rising 2024 | DE&I in Tech Summit
April 4 and 5, 2024 | 📍 Hilton Convention Center, Manyata Tech Park, Bangalore
Data Engineering Summit 2024
May 30 and 31, 2024 | 📍 Bangalore, India
26 July 2024 | 583 Park Avenue, New York
MachineCon GCC Summit 2024
June 28 2024 | 📍Bangalore, India
Nov 21-22 2024 | 📍Santa Clara Convention Center, California, USA
September 25-27, 2024 | 📍Bangalore, India
Our Discord Community for AI Ecosystem, In collaboration with NVIDIA.
World's Biggest Media & Analyst firm specializing in AI
AIM publishes every day, and we believe in quality over quantity, honesty over spin. We offer a wide variety of branding and targeting options to make it easy for you to propagate your brand.
AIM Brand Solutions, a marketing division within AIM, specializes in creating diverse content such as documentaries, public artworks, podcasts, videos, articles, and more to effectively tell compelling stories.
ADaSci Corporate training program on Generative AI provides a unique opportunity to empower, retain and advance your talent
With MachineHack you can not only find qualified developers with hiring challenges but can also engage the developer community and your internal workforce by hosting hackathons.
Conduct Customized Online Assessments on our Powerful Cloud-based Platform, Secured with Best-in-class Proctoring
AIM Research produces a series of annual reports on AI & Data Science covering every aspect of the industry. Request Customised Reports & AIM Surveys for a study on topics of your interest.
© Analytics India Magazine Pvt Ltd & AIM Media House LLC 2024