next BIG future

next BIG future

Share this post

next BIG future
next BIG future
Is Deepseek Training Lying About Chips Used for Training its AI ?

Is Deepseek Training Lying About Chips Used for Training its AI ?

NextBigFuture's avatar
NextBigFuture
Jan 28, 2025
∙ Paid

Share this post

next BIG future
next BIG future
Is Deepseek Training Lying About Chips Used for Training its AI ?
Share

Altimeter Capital analyst and partner puts what Deepseek claims and results into numbers. $6M Training Costs = Plausible IMO Quick math: Training costs ∝ (active params * tokens). DeepSeek v3 (37B params; 14.8T tokens) vs. Llama3.1 (405B params; 15T tokens) = v3 theoretically should be 9% of Llama3.1’s cost. And the disclosed actual figures aligned ...

R…

Keep reading with a 7-day free trial

Subscribe to next BIG future to keep reading this post and get 7 days of free access to the full post archives.

Already a paid subscriber? Sign in
© 2025 Nextbigfuture
Privacy ∙ Terms ∙ Collection notice
Start writingGet the app
Substack is the home for great culture

Share