Soket AI Labs Launches Realtime Speech API, Offers Multilingual Capabilities

3 months ago 28
  • Published on January 15, 2025
  • In AI News

The Realtime Speech API boasts ultra-low latency of under 500 milliseconds.

Soket AI Labs Launches Realtime Speech API, Offers Multilingual Capabilities

Soket AI Labs, the Gurugram-based AI startup, has unveiled its Realtime Speech API, aiming to transform AI interactions with voice intelligence and seamless integration. 

Soket AI Labs claims that the Realtime Speech API boasts ultra-low latency of under 500 milliseconds, ensuring near-instantaneous responses for real-time interactions. It supports multilingual capabilities to overcome language barriers and includes advanced functionalities such as tool calling, Retrieval-Augmented Generation (RAG) support, custom voice creation and cloning, and the ability to handle dynamic voice interruptions for natural conversations.

Developers can integrate the API effortlessly within 1-4 weeks using SDKs available for Python and JavaScript. The service is priced competitively at $0.012 per minute, providing an affordable alternative to industry leaders like OpenAI.

Soket AI Labs emphasised the platform’s versatility, highlighting its applications across industries such as banking, financial services, insurance (BFSI), healthcare, and telecommunications. Additional features include fine-tunable models and customisable voice options to meet specific business needs.

The company is set to launch its “Voice Innovators Beta Program” soon, inviting users to explore and shape the future of voice technology.

In a separate post on LinkedIn, Abhishek Upperwal, founder and CEO of Soket AI Labs, highlighted the importance of making ‘General Voice Intelligence’. “Voice is one of the most important interfaces to AI today and language models are at the core of intelligence,” Upperwal said.

In May, the company also launched India’s first open source multilingual foundational model, Pragna-1B. Upperwal said that it took the company six months to train the model, which involved many experiments with different models and a total of 150 billion tokens.

Founded in 2019, Soket AI Labs’ focus was on building a decentralised data exchange for smart cities. However, things changed significantly after OpenAI CEO Sam Altman’s visit to India, which motivated the team to build the best AI models in the country. 

Apart from Soket AI Labs, startups like Sarvam AI and CoRover.ai have been focused heavily on building speech models. Speaking at Cypher 2024, Sarvam AI chief Vivek Raghavan demoed the speech capabilities of its AI models, leaving everyone at Cypher speechless.

Picture of Mohit Pandey

Mohit Pandey

Mohit writes about AI in simple, explainable, and sometimes funny words. He holds keen interest in discussing AI with people building it for India, and for Bharat, while also talking a little bit about AGI.

Association of Data Scientists

GenAI Corporate Training Programs

India's Biggest Developers Summit

February 5 – 7, 2025 | Nimhans Convention Center, Bangalore

Download the easiest way to
stay informed

Chris Miller Chip War

Chip War: The India Chapter

Vandana Nair

“India is arguably the world’s first or second country in terms of chip design talent, right next to the United States,” said Chris Miller, the author of Chip War.

Subscribe to The Belamy: Our Weekly Newsletter

Biggest AI stories, delivered to your inbox every week.

February 5 – 7, 2025 | Nimhans Convention Center, Bangalore

Rising 2025 | DE&I in Tech & AI

Mar 20 and 21, 2025 | 📍 J N Tata Auditorium, Bengaluru

Data Engineering Summit 2025

15-16 May, 2025 | 📍 Taj Yeshwantpur, Bengaluru, India

AI Startups Conference.
April 25 / Hotel Radisson Blu / Bangalore, India

17-19 September, 2025 | 📍KTPO, Whitefield, Bangalore, India

MachineCon GCC Summit 2025

19-20th June 2025 | Bangalore

discord icon

Our Discord Community for AI Ecosystem.

Read Entire Article