- Last updated September 16, 2024
- In AI News
The organisation has also revamped its website, making datasets, models, and tools more accessible.

AI4Bharat has launched a series of innovations aimed at enhancing Indian language technology, including speech recognition, data annotation, and expressive text-to-speech (TTS).
One of the key releases is IndicASR, India’s first speech recognition model covering all 22 official languages. A web demo is available for users to test and provide feedback. Also called ndicConformers, it is a comprehensive set of ASR models designed to accurately convert speech to text in all 22 official Indian languages.
Another major release is Anudesh, v0.1, an open-source platform designed to improve LLMs for Indian languages through data annotation. The platform’s first version facilitates conversational data collection through LLM interactions and supports model evaluation workflows.
Additionally, AI4Bharat has introduced Rasa, a dataset for expressive TTS that spans nine languages and features 14 speakers. The dataset includes at least 20 hours of speech per speaker, aiming to feature both male and female voices across all 22 official Indian languages. In its initial version, it presents a practical approach to gathering high-quality data for languages with limited resources, focusing on easily accessible neutral speech data, complemented by smaller samples of expressive speech.
The organisation has also revamped its website, making datasets, models, and tools more accessible. Users can now find clear download instructions, usage guidelines, and supporting Colab notebooks for easier integration of AI4Bharat’s resources.
Last month, Sarvam AI also launched ASR models Shuka v1, which comprises approximately 60 million parameters and is trained on less than 100 hours of audio data.
Mohit Pandey
Mohit dives deep into the AI world to bring out information in simple, explainable, and sometimes funny words.
Subscribe to The Belamy: Our Weekly Newsletter
Biggest AI stories, delivered to your inbox every week.
Rising 2024 | DE&I in Tech Summit
April 4 and 5, 2024 | 📍 Hilton Convention Center, Manyata Tech Park, Bangalore
Data Engineering Summit 2024
May 30 and 31, 2024 | 📍 Bangalore, India
26 July 2024 | 583 Park Avenue, New York
MachineCon GCC Summit 2024
June 28 2024 | 📍Bangalore, India
Nov 21-22 2024 | 📍Santa Clara Convention Center, California, USA
September 25-27, 2024 | 📍Bangalore, India
Our Discord Community for AI Ecosystem, In collaboration with NVIDIA.
World's Biggest Media & Analyst firm specializing in AI
AIM publishes every day, and we believe in quality over quantity, honesty over spin. We offer a wide variety of branding and targeting options to make it easy for you to propagate your brand.
AIM Brand Solutions, a marketing division within AIM, specializes in creating diverse content such as documentaries, public artworks, podcasts, videos, articles, and more to effectively tell compelling stories.
ADaSci Corporate training program on Generative AI provides a unique opportunity to empower, retain and advance your talent
With MachineHack you can not only find qualified developers with hiring challenges but can also engage the developer community and your internal workforce by hosting hackathons.
Conduct Customized Online Assessments on our Powerful Cloud-based Platform, Secured with Best-in-class Proctoring
AIM Research produces a series of annual reports on AI & Data Science covering every aspect of the industry. Request Customised Reports & AIM Surveys for a study on topics of your interest.
© Analytics India Magazine Pvt Ltd & AIM Media House LLC 2024