- Last updated October 21, 2024
- In AI News
Developers can utilize the Granite 3.0 8B Instruct model for a range of natural language applications, including text generation, classification, summarization, entity extraction, and customer service chatbots.
IBM has launched Granite 3.0, the latest generation of its large language models (LLMs) for enterprise applications. The Granite 3.0 collection includes several models, highlighted by the Granite 3.0 8B Instruct, which has been trained on over 12 trillion tokens across multiple languages.
The Granite 3.0 8B Instruct model is intended for enterprise use, demonstrating competitive performance against similar models while excelling in specific business tasks. IBM claims that on academic benchmarks included in Hugging Face’s OpenLLM Leaderboard v2, Granite 3.0 8B Instruct rivals similarly sized models from Meta and Mistral AI.
Fine-tuning options through InstructLab allow organisations to customise models to their needs, potentially reducing costs. All Granite models are released under the Apache 2.0 license, with detailed disclosures of training datasets and methodologies included in the accompanying technical paper.
The Granite 3.0 release includes:
- General Purpose LLMs: Granite-3.0-8B-Instruct, Granite-3.0-8B-Base, Granite-3.0-2B-Instruct, and Granite-3.0-2B-Base.
- Guardrail Models: Granite-Guardian-3.0-8B and Granite-Guardian-3.0-2B for monitoring input and output risks.
- Mixture of Experts (MoE) Models: Granite-3.0-3B-A800M-Instruct and Granite-3.0-1B-A400M-Instruct for efficient inference.
- Speculative Decoder: Granite-3.0-8B-Instruct-Accelerator for faster token generation.
Developers can utilize the Granite 3.0 8B Instruct model for a range of natural language applications, including text generation, classification, summarization, entity extraction, and customer service chatbots. The model also supports programming tasks such as code generation, code explanation, and code editing, as well as agentic use cases that require tool calling.
Upcoming updates planned for 2024 will increase model context windows to 128K tokens and introduce multimodal capabilities. The Granite 3.0 models are available for commercial use on the IBM watsonx platform and through partners like Google Cloud, Hugging Face, and NVIDIA.
IBM emphasises safety and transparency in AI, with Granite 3.0 models incorporating robust safety features and extensive training data filtering to mitigate risks. The Granite Guardian models enhance input and output management across various dimensions, outperforming existing models in key safety benchmarks.
IBM’s new models leverage innovative training techniques, including the use of the Data Prep Kit for efficient data processing and a power scheduler for optimised learning rates. This enables faster convergence to optimal model weights while minimizing training costs.
Granite 3.0 language models were trained on Blue Vela, powered entirely by renewable energy, reinforcing IBM’s commitment to sustainability in AI development.
Siddharth Jindal
Siddharth is a media graduate who loves to explore tech through journalism and putting forward ideas worth pondering about in the era of artificial intelligence.
Subscribe to The Belamy: Our Weekly Newsletter
Biggest AI stories, delivered to your inbox every week.
Rising 2025 | DE&I in Tech & AI Summit
Mar 20 and 21, 2025 | 📍 J N Tata Auditorium, Bengaluru
Data Engineering Summit 2024
May 30 and 31, 2024 | 📍 Bangalore, India
February 5 – 7, 2025 | Nimhans Convention Center, Bangalore
MachineCon GCC Summit 2024
June 28 2024 | 📍Bangalore, India
September 25-27, 2024 | 📍Bangalore, India
25 July 2025 | 583 Park Avenue, New York
Our Discord Community for AI Ecosystem, In collaboration with NVIDIA.
AIM publishes every day, and we believe in quality over quantity, honesty over spin. We offer a wide variety of branding and targeting options to make it easy for you to propagate your brand.
AIM Brand Solutions, a marketing division within AIM, specializes in creating diverse content such as documentaries, public artworks, podcasts, videos, articles, and more to effectively tell compelling stories.
ADaSci Corporate training program on Generative AI provides a unique opportunity to empower, retain and advance your talent
With MachineHack you can not only find qualified developers with hiring challenges but can also engage the developer community and your internal workforce by hosting hackathons.
Conduct Customized Online Assessments on our Powerful Cloud-based Platform, Secured with Best-in-class Proctoring
AIM Research produces a series of annual reports on AI & Data Science covering every aspect of the industry. Request Customised Reports & AIM Surveys for a study on topics of your interest.
© Analytics India Magazine Pvt Ltd & AIM Media House LLC 2024