- Last updated October 16, 2024
- In AI News
Benchmarks indicate that the Ministral models outperform competitors, including Gemma 2 2B and Llama 3.2 3B and Llama 3.1 8B
Mistral AI has announced the launch of two new models, Ministral 3B and Ministral 8B, on the first anniversary of its Mistral 7B model. These new models focus on on-device computing and edge applications, enhancing capabilities in areas such as knowledge reasoning and function-calling.
Ministral models can handle up to 128k context length and offer a unique sliding-window attention pattern for efficient inference, especially in resource-constrained environments. They aim to meet the demand for local and privacy-first inference in applications like on-device translation, smart assistants, local analytics, and robotics. The models serve as intermediaries for larger models, improving task routing and API calling across various contexts.
Benchmarks indicate that the Ministral models outperform competitors, including Gemma 2 2B and Llama 3.2 3B and Llama 3.1 8B
Both models are available for commercial use, with pricing set at $0.04 per million tokens for Ministral 3B and $0.1 for Ministral 8B. The model weights for the 8B Instruct model will be available for research use. Last month, Mistral AI launched Pixtral 12B, a model capable of processing both images and text. With approximately 12 billion parameters, it employs vision encoding to interpret images alongside text.
A day after Meta released Llama 3.1, Mistral AI also launched Mistral Large 2, the latest generation of its flagship model, offering substantial improvements in code generation, mathematics, and multilingual support. The model introduces advanced function-calling capabilities and is available on la Plateforme.
Siddharth Jindal
Siddharth is a media graduate who loves to explore tech through journalism and putting forward ideas worth pondering about in the era of artificial intelligence.
Subscribe to The Belamy: Our Weekly Newsletter
Biggest AI stories, delivered to your inbox every week.
Rising 2025 | DE&I in Tech & AI Summit
Mar 20 and 21, 2025 | 📍 J N Tata Auditorium, Bengaluru
Data Engineering Summit 2024
May 30 and 31, 2024 | 📍 Bangalore, India
February 5 – 7, 2025 | Nimhans Convention Center, Bangalore
MachineCon GCC Summit 2024
June 28 2024 | 📍Bangalore, India
September 25-27, 2024 | 📍Bangalore, India
Nov 21-22 2024 | 📍Santa Clara Convention Center, California, USA
Our Discord Community for AI Ecosystem, In collaboration with NVIDIA.
AIM publishes every day, and we believe in quality over quantity, honesty over spin. We offer a wide variety of branding and targeting options to make it easy for you to propagate your brand.
AIM Brand Solutions, a marketing division within AIM, specializes in creating diverse content such as documentaries, public artworks, podcasts, videos, articles, and more to effectively tell compelling stories.
ADaSci Corporate training program on Generative AI provides a unique opportunity to empower, retain and advance your talent
With MachineHack you can not only find qualified developers with hiring challenges but can also engage the developer community and your internal workforce by hosting hackathons.
Conduct Customized Online Assessments on our Powerful Cloud-based Platform, Secured with Best-in-class Proctoring
AIM Research produces a series of annual reports on AI & Data Science covering every aspect of the industry. Request Customised Reports & AIM Surveys for a study on topics of your interest.
© Analytics India Magazine Pvt Ltd & AIM Media House LLC 2024