ETFOptimize | High-performance ETF-based Investment Strategies

Quantitative strategies, Wall Street-caliber research, and insightful market analysis since 1998.


ETFOptimize | HOME
Close Window

Elastic Introduces Native Inference Service in Elastic Cloud

New service to provide GPU-accelerated embedding and retrieval models

Elastic (NYSE: ESTC), the Search AI Company, today announced the Elastic Inference Service (EIS), a GPU-accelerated inference-as-a-service for Elasticsearch semantic search, vector search, and generative AI workflows.

Every generative AI and vector search application relies on inference, and Elastic now delivers these capabilities natively as part of Elastic Cloud. As volumes grow, managing infrastructure, testing models, and handling integrations creates operational overhead that slows teams down. This has created a need for GPU-acceleration and an integrated workflow to provide speed, scalability, and cost efficiency.

“Inference at scale is incredibly important for vector search, semantic search and GenAI workflows,” said Steve Kearns, General Manager, Search at Elastic. “The Elastic Inference Service meets that challenge by providing our customers with an API-based inference service using NVIDIA GPUs with our best-in-class Elasticsearch vector database for low-latency, high-throughput inference.”

Elastic Learned Sparse EncodeR (ELSER) — Elastic’s built-in sparse vector model for state-of-the-art search relevance — is the first text-embedding model available on EIS in technical preview. Support for additional models for multilingual embeddings, reranking, and models from the recently announced Jina acquisition, will be available soon.

Some key benefits for developers who use EIS include:

  • Streamlined developer experience: No model downloads, manual configuration, or resource provisioning. EIS integrates directly with semantic_text and the Inference API for a seamless developer experience.
  • Improved end-to-end semantic search experience: EIS is compatible with sparse vectors, dense vectors, or semantic reranking.
  • Simplified generative AI workflows: AI features for ingest, investigation, detection, and analysis work out of the box, reducing the friction of contracts, API keys, and external services.
  • Backward compatibility: The Open Inference API gives users full flexibility to connect any third-party service, while existing Elasticsearch ML Nodes remain supported during adoption.
  • Enhanced performance: GPU-accelerated inference provides consistent latency and up to 10x higher throughput for ingest compared to CPU-based alternatives.
  • Easy to understand pricing: EIS provides consumption-based pricing similar to other inference services, charged per model per million tokens. It is also easy to get started and access support.
  • Peace of mind: Elastic also provides an intellectual property indemnity for all models provided on EIS.

For additional information on the Elastic Inference Service, read the Elastic blog.

Availability

The Elastic Inference Service is available to use on Serverless and Elastic Cloud Hosted deployments. All CSPs and regions can access the inference endpoints on EIS.

Additional models will be available soon to support a wider variety of search and inference needs.

About Elastic

Elastic (NYSE: ESTC), the Search AI Company, integrates its deep expertise in search technology with artificial intelligence to help everyone transform all of their data into answers, actions, and outcomes. Elastic's Search AI Platform — the foundation for its search, observability, and security solutions — is used by thousands of companies, including more than 50% of the Fortune 500. Learn more at elastic.co.

Elastic and associated marks are trademarks or registered trademarks of elasticsearch BV and its subsidiaries. All other company and product names may be trademarks of their respective owners. The release and timing of any features such as the additional models and region availability or functionality described in this post remain at Elastic's sole discretion. Any features or functionality not currently available may not be delivered on time or at all.

Contacts

Stock Quote API & Stock News API supplied by www.cloudquote.io
Quotes delayed at least 20 minutes.
By accessing this page, you agree to the following
Privacy Policy and Terms Of Service.


 

IntelligentValue Home
Close Window

DISCLAIMER

All content herein is issued solely for informational purposes and is not to be construed as an offer to sell or the solicitation of an offer to buy, nor should it be interpreted as a recommendation to buy, hold or sell (short or otherwise) any security.  All opinions, analyses, and information included herein are based on sources believed to be reliable, but no representation or warranty of any kind, expressed or implied, is made including but not limited to any representation or warranty concerning accuracy, completeness, correctness, timeliness or appropriateness. We undertake no obligation to update such opinions, analysis or information. You should independently verify all information contained on this website. Some information is based on analysis of past performance or hypothetical performance results, which have inherent limitations. We make no representation that any particular equity or strategy will or is likely to achieve profits or losses similar to those shown. Shareholders, employees, writers, contractors, and affiliates associated with ETFOptimize.com may have ownership positions in the securities that are mentioned. If you are not sure if ETFs, algorithmic investing, or a particular investment is right for you, you are urged to consult with a Registered Investment Advisor (RIA). Neither this website nor anyone associated with producing its content are Registered Investment Advisors, and no attempt is made herein to substitute for personalized, professional investment advice. Neither ETFOptimize.com, Global Alpha Investments, Inc., nor its employees, service providers, associates, or affiliates are responsible for any investment losses you may incur as a result of using the information provided herein. Remember that past investment returns may not be indicative of future returns.

Copyright © 1998-2017 ETFOptimize.com, a publication of Optimized Investments, Inc. All rights reserved.