ETFOptimize | High-performance ETF-based Investment Strategies

Quantitative strategies, Wall Street-caliber research, and insightful market analysis since 1998.


ETFOptimize | HOME
Close Window

AI21's Jamba 1.6 Now Available as NVIDIA NIM Microservice for Enterprise-Ready Inference

New availability as part of NVIDIA AI Enterprise expands access to high-performance, secure LLMs for private deployment

TEL AVIV, ISRAEL / ACCESS Newswire / June 12, 2025 / AI21 today announced that its Jamba 1.6 model is now available as a downloadable NVIDIA NIM microservice, supported by the NVIDIA AI Enterprise software platform. The release unlocks faster, easier access for enterprise developers looking to deploy high-performing, secure LLMs (large language models) across public and private infrastructure.

Jamba is AI21's family of LLMs designed for enterprise use, optimized for long-context processing, low latency, and efficient hardware utilization. With this new NIM availability, enterprise teams can now deploy Jamba 1.6 in under five minutes via standardized APIs - whether in cloud environments such as AWS, Azure, or GCP, or on-premises with NVIDIA-accelerated infrastructure.

The release is made possible by NIM microservices, which streamline the integration of a broad range of LLMs into enterprise workflows. This is a major milestone in expanding AI21's footprint within the NVIDIA developer ecosystem and providing a faster path to AI deployment for global enterprises.

"Jamba 1.6 is a benchmark leader for secure, private deployment, especially for enterprises that can't compromise on performance, reliability, or data privacy," said Ori Goshen, AI21's CEO and Co-Founder. "We've seen strong demand for private deployments in financial services, legal, and healthcare, where our customers have struggled to find a model that meets both their security needs and performance expectations. Jamba is that model."

The Jamba 1.6 NIM provides:

  • Plug-and-play deployment through NVIDIA AI Enterprise l with NIM microservices

  • Private and hybrid-cloud flexibility, supporting on-prem, multi-cloud, and regulated environments

  • Best-in-class latency and context window, ideal for AI agents and RAG (retrieval-augmented generation) use cases

  • Enterprise-grade security, delivered through NVIDIA's continuously managed AI Enterprise stack

Jamba is already in use by leading enterprise customers powering high-stakes AI applications across legal document processing, internal knowledge agents, and real-time customer service tools. The new NIM support makes it easier than ever to adopt Jamba in production environments that require trust, transparency, and speed.

AI21's Jamba 1.6 NIM can be deployed using the universal LLM NIM microservice. Get started today at build.nvidia.com.

About AI21

AI21 is a pioneer in Foundation Models and AI Systems designed for enterprises. AI21's mission is to create trustworthy artificial intelligence that powers humanity towards superproductivity. Founded in 2017 by AI visionaries Prof. Amnon Shashua, Prof. Yoav Shoham, and Ori Goshen, AI21 has secured $336 million in funding from industry leaders, including NVIDIA, Google, and Intel, reinforcing its commitment to advancing AI innovation.

Contact Information

Mia Balaban
mia@tellny.com

.

SOURCE: AI21



View the original press release on ACCESS Newswire

Stock Quote API & Stock News API supplied by www.cloudquote.io
Quotes delayed at least 20 minutes.
By accessing this page, you agree to the following
Privacy Policy and Terms Of Service.


 

IntelligentValue Home
Close Window

DISCLAIMER

All content herein is issued solely for informational purposes and is not to be construed as an offer to sell or the solicitation of an offer to buy, nor should it be interpreted as a recommendation to buy, hold or sell (short or otherwise) any security.  All opinions, analyses, and information included herein are based on sources believed to be reliable, but no representation or warranty of any kind, expressed or implied, is made including but not limited to any representation or warranty concerning accuracy, completeness, correctness, timeliness or appropriateness. We undertake no obligation to update such opinions, analysis or information. You should independently verify all information contained on this website. Some information is based on analysis of past performance or hypothetical performance results, which have inherent limitations. We make no representation that any particular equity or strategy will or is likely to achieve profits or losses similar to those shown. Shareholders, employees, writers, contractors, and affiliates associated with ETFOptimize.com may have ownership positions in the securities that are mentioned. If you are not sure if ETFs, algorithmic investing, or a particular investment is right for you, you are urged to consult with a Registered Investment Advisor (RIA). Neither this website nor anyone associated with producing its content are Registered Investment Advisors, and no attempt is made herein to substitute for personalized, professional investment advice. Neither ETFOptimize.com, Global Alpha Investments, Inc., nor its employees, service providers, associates, or affiliates are responsible for any investment losses you may incur as a result of using the information provided herein. Remember that past investment returns may not be indicative of future returns.

Copyright © 1998-2017 ETFOptimize.com, a publication of Optimized Investments, Inc. All rights reserved.