Home > News > The affordability of DeepSeek is a myth: The revolutionary AI actually cost $1.6 billion to develop

The affordability of DeepSeek is a myth: The revolutionary AI actually cost $1.6 billion to develop

DeepSeek's surprisingly inexpensive AI chatbot challenges industry norms. Initially touted as costing only $6 million to train, DeepSeek V3, a powerful AI model, has become a major competitor, even causing significant stock drops for NVIDIA. This seemingly low cost, however, belies a much larger i
By David
Mar 06,2025

DeepSeek's surprisingly inexpensive AI chatbot challenges industry norms. Initially touted as costing only $6 million to train, DeepSeek V3, a powerful AI model, has become a major competitor, even causing significant stock drops for NVIDIA. This seemingly low cost, however, belies a much larger investment.

DeepSeek TestImage: ensigame.com

DeepSeek V3's success stems from its innovative architecture, incorporating:

  • Multi-token Prediction (MTP): Predicting multiple words simultaneously for improved accuracy and efficiency.
  • Mixture of Experts (MoE): Utilizing 256 neural networks, activating eight for each token, to accelerate training and enhance performance.
  • Multi-head Latent Attention (MLA): Repeatedly extracting key details to minimize information loss and capture crucial nuances.

DeepSeek V3Image: ensigame.com

Contrary to initial claims, SemiAnalysis revealed DeepSeek's infrastructure comprises approximately 50,000 Nvidia GPUs (including H800, H100, and H20 units) across multiple data centers, representing a ~$1.6 billion investment and ~$944 million in operational expenses. This contrasts sharply with the publicized $6 million pre-training cost, which excludes research, refinement, data processing, and overall infrastructure.

DeepSeek, a subsidiary of High-Flyer, a Chinese hedge fund, owns its data centers, unlike cloud-reliant competitors, granting greater control and faster innovation. Its self-funded nature also boosts agility. High salaries (over $1.3 million annually for some researchers) attract top Chinese talent, excluding foreign specialists.

DeepSeekImage: ensigame.com

While DeepSeek's overall investment exceeds $500 million, its lean structure enables efficient innovation. The $6 million figure is misleading; the true cost is far higher. However, even with the corrected costs, DeepSeek's model training remains significantly cheaper than competitors like ChatGPT4o ($100 million vs. DeepSeek's estimated $5 million for R1).

DeepSeekImage: ensigame.com

DeepSeek's success highlights the competitive potential of well-funded, independent AI companies. While the "budget-friendly" narrative is exaggerated, its achievements are undeniable, demonstrating that significant investment, technological advancements, and a strong team are key to success in the AI market.

Top News

Copyright ruanh.com © 2024 — All rights reserved