Is Deepseek Training Lying About Chips Used for Training its AI ?
Altimeter Capital analyst and partner puts what Deepseek claims and results into numbers. $6M Training Costs = Plausible IMO Quick math: Training costs ∝ (active params * tokens). DeepSeek v3 (37B params; 14.8T tokens) vs. Llama3.1 (405B params; 15T tokens) = v3 theoretically should be 9% of Llama3.1’s cost. And the disclosed actual figures aligned ...
Keep reading with a 7-day free trial
Subscribe to next BIG future to keep reading this post and get 7 days of free access to the full post archives.