Cerebras Systems and Cirrascale Cloud Services® Introduce Cerebras AI Model Studio to Train GPT-Class Models with 8x Faster Time to Accuracy, at Half the Price of Traditional Cloud Providers

With Predictable Fixed Pricing, Faster Time to Solution, and Unprecedented Flexibility and Ease of Use, Customers Can Train GPU-Impossible Sequence Lengths and Keep Trained Weights

Cerebras Systems, the pioneer in accelerating artificial intelligence (AI) compute, and Cirrascale Cloud Services®, a provider of deep learning infrastructure solutions for autonomous vehicle, NLP, and computer vision workflows, today announced the availability of the Cerebras AI Model Studio. Hosted on the Cerebras Cloud @ Cirrascale, this new offering enables customers to train generative Transformer (GPT)-class models, including GPT-J, GPT-3 and GPT-NeoX, on industry-leading Cerebras Wafer-Scale Clusters, including the newly announced Andromeda AI supercomputer.

Traditional cloud providers struggle with large language models as they are unable to guarantee latency between large numbers of GPUs. Variable latency produces complex and time-consuming challenges in distributing a large AI model amongst GPUs and large swings in time to train. The Cerebras AI Model Studio overcomes these challenges. Set up is quick and easy; clusters of dedicated CS-2s guarantee deterministic latency; and because the clusters rely solely on data parallelization, there is zero distributed compute work required.

Training Large Language Models (LLMs) is challenging and expensive -- multi-billion parameter models require months to train on clusters of GPUs and a team of engineers experienced in distributed programming and hybrid data-model parallelism. It is a multi-million dollar investment that many organizations simply cannot afford.

Model

Parameters (B)

Tokens to train to Chinchilla point (B)

Cerebras AI Model Studio days to train

Cerebras AI Model Studio price to train

GPT-3 XL

1.3

26

0.4

$2,500

GPT-J

6

120

8

$45,000

GPT-3 6.7B

6.7

134

11

$40,000

T-5 11B

11

34*

9

$60,000

GPT-3 13B

13

260

39

$150,000

GPT-NeoX

20

400

47

$525,000

GPT 70B

70

1,400

85

$2,500,000

GPT 175B

175

3,500

Call for quote

Call for quote

* T5 tokens to train from the original T5 paper. Chinchilla scaling laws not applicable.

The Cerebras AI Model Studio offers users the ability to train GPT-class models at half the cost of traditional cloud providers and requires only a few lines of code to get going. Users can choose from state-of-the-art GPT-class models, ranging from 1.3 billion parameters up to 175 billion parameters, and complete training with 8x faster time to accuracy than on an A100.

"The new Cerebras AI Model Studio expands our partnership with Cirrascale and further democratizes AI by providing customers with access to multi-billion parameter NLP models on our powerful CS-2 clusters, with predictable, competitive model-as-a-service pricing,” said Andrew Feldman, CEO and co-founder of Cerebras Systems. “Our mission at Cerebras is to broaden access to deep learning and rapidly accelerate the performance of AI workloads. The Cerebras AI Model Studio makes this easy and dead simple – just load your dataset and run a script.”

The Cerebras AI Model Studio offers users cloud access to the Cerebras Wafer-Scale Cluster, which enables GPU-impossible work with first-of-its-kind near-perfect linear scale performance. Users can access up to a 16-node Cerebras Wafer-Scale Cluster and train models using longer sequence lengths of up to 50,000 tokens – a capability only available to Cerebras users – opening up new opportunities for exciting research.

“We are really excited to offer our enterprise, research and academic customers easy, affordable access to the leading CS-2 accelerator to train GPT-class models in less than one day,” said PJ Go, CEO, Cirrascale Cloud Services. “We’ve made the process extremely simple – eliminating the need for dev-ops and distributed programming – with push-button model scaling, from 1 to 20 billion parameters.”

With every component optimized for AI work, the Cerebras Cloud @ Cirrascale delivers more compute performance at less space and less power than any other solution. Depending on workload, from AI to HPC, it delivers hundreds or thousands of times more performance than legacy alternatives, but uses only a fraction of the space and power. Cerebras Cloud is designed to enable fast, flexible training and low-latency datacenter inference, thanks to greater compute density, faster memory, and higher bandwidth interconnect than any other datacenter AI solution.

The Cerebras AI Model Studio is available now. For a limited time, users can sign up for free 2-day trial evaluation run. Customers can begin using the Cerebras AI Model Studio by visiting https://cirrascale.com/cerebras/. For more information, please visit https://www.cerebras.net/product-cloud/.

About Cirrascale Cloud Services

Cirrascale Cloud Services is a premier provider of public and private dedicated cloud solutions enabling deep learning workflows. The company offers cloud-based infrastructure solutions for large-scale deep learning operators, service providers, as well as HPC users. To learn more about Cirrascale Cloud Services and its unique cloud offerings, please visit www.cirrascale.com or call (888) 942-3800.

About Cerebras Systems

Cerebras Systems is a team of pioneering computer architects, computer scientists, deep learning researchers, and engineers of all types who have come together to build a new class of computer system. That system is designed for the singular purpose of accelerating AI and changing the future of AI work forever, enabling customers to accelerate their deep learning work by orders of magnitude.

Cirrascale Cloud Services, Cirrascale and the Cirrascale logo are trademarks or registered trademarks of Cirrascale Cloud Services LLC.

Contacts

More News

View More

Recent Quotes

View More
Symbol Price Change (%)
AMZN  224.21
+3.12 (1.41%)
AAPL  262.82
+3.24 (1.25%)
AMD  252.92
+17.93 (7.63%)
BAC  52.57
+0.81 (1.56%)
GOOG  260.51
+6.78 (2.67%)
META  738.36
+4.36 (0.59%)
MSFT  523.61
+3.05 (0.59%)
NVDA  186.26
+4.10 (2.25%)
ORCL  283.33
+3.26 (1.16%)
TSLA  433.72
-15.26 (-3.40%)
Stock Quote API & Stock News API supplied by www.cloudquote.io
Quotes delayed at least 20 minutes.
By accessing this page, you agree to the Privacy Policy and Terms Of Service.