Better AI Inference Stock to Own: Nvidia or Cerebras?

While large language model (LLM) training dominated the first phase of artificial intelligence (AI), inference is eventually expected to become the much larger market. While LLM training is compute-heavy and more technically challenging, inference tends to be memory-centric and needs to be more cost-efficient given that it’s an ongoing process. Traditionally, graphics processing units (GPUs)…


Better AI Inference Stock to Own: Nvidia or Cerebras?

While large language model (LLM) training dominated the first phase of artificial intelligence (AI), inference is eventually expected to become the much larger market.

While LLM training is compute-heavy and more technically challenging, inference tends to be memory-centric and needs to be more cost-efficient given that it’s an ongoing process. Traditionally, graphics processing units (GPUs) and other AI accelerators are packaged with high-bandwidth memory (HBM) to help optimize their performance in this area.

Will AI create the world’s first trillionaire?ย Our team just released a report on the one little-known company, called an “Indispensable Monopoly” providing the critical technology Nvidia and Intel both need.ย Continue ยป

However, Nvidia (NASDAQ: NVDA), through its recent “acquisition” of Groq, and Cerebras Systems (NASDAQ: CBRS) are now looking toward on-chip SRAM (static random-access memory) to speed up AI workloads for inference. This is a new approach, and both companies are using SRAM in a much different way. While using SRAM can dramatically increase inference speeds, it is physically bulky, which creates some trade-offs between chip size, memory capacity, and the data center infrastructure required to power and cool the chips.

Let’s look at the two approaches and see which semiconductor stock looks better positioned to become the inference market leader.

Cerebras and Nvidia logos.
Image source: The Motley Fool.

Cerebras: Is bigger better?

To deal with the physical bulkiness of SRAM, Cerebras creates massive wafer-sized chips that can fit both a large amount of computing power and SRAM onto a single chip. However, this comes with additional issues that need to be addressed.

The first is that the chip manufacturing process is complex, and defects are common. The reason Taiwan Semiconductor Manufacturing has become a virtual monopoly in advanced chip manufacturing is that it can produce advanced chips at high yields, but even its goal for its newest technology is a yield of around 80%. When you’re looking at very expensive, wafer-sized chips, though, that type of yield doesn’t cut it. To address this issue, Cerebras adds extra cores to help it work around any defects to its chips.

In addition, its chips need special cooling and power management, which is why it doesn’t sell them individually, instead only selling or renting them as part of its complete end-to-end server rack CS-3 system. While the company boasts that its systems can perform inference 15 times faster than a GPU, everything involved leads to a very expensive premium solution.

Source link