Meet DeepSeek: the Chinese start-up that is changing how AI models are trained

January 1, 2025

1 View

SaveSavedRemoved 0

Meet DeepSeek: the Chinese start-up that is changing how AI models are trained

Chinese language start-up DeepSeek has emerged as “the largest darkish horse” within the open-source giant language mannequin (LLM) enviornment in 2025, simply days after the agency made waves within the international artificial intelligence (AI) group with its newest launch.

That evaluation got here from Jim Fan, a senior analysis scientist at Nvidia and lead of its AI Brokers Initiative, in a New Yr’s Day submit on social-media platform X, following the Hangzhou-based start-up’s launch final week of its namesake LLM, DeepSeek V3.

“[The new AI model] exhibits that useful resource constraints pressure you to reinvent your self in spectacular methods,” Fan wrote, referring to how DeepSeek developed the product at a fraction of the capital outlay that different tech corporations spend money on constructing LLMs.

DeepSeek V3 comes with 671 billion parameters and was educated in round two months at a value of US$5.58 million, utilizing considerably fewer computing sources than fashions developed by larger tech corporations comparable to Facebook mum or dad Meta Platforms and ChatGPT creator OpenAI.

LLM refers back to the know-how underpinning generative AI companies comparable to ChatGPT. In AI, a excessive variety of parameters is pivotal in enabling an LLM to adapt to extra complicated knowledge patterns and make exact predictions. Open supply provides public entry to a software program program’s supply code, permitting third-party builders to switch or share its design, repair damaged hyperlinks or scale up its capabilities.

Jim Fan, a senior research scientist at semiconductor design giant Nvidia, says he has been closely following developments at artificial intelligence start-up DeepSeek. Photo: SCMP — Jim Fan, a senior analysis scientist at semiconductor design large Nvidia, says he has been carefully following developments at synthetic intelligence start-up DeepSeek. Photograph: SCMP

DeepSeek’s growth of a robust LLM at much less value than what larger corporations spend exhibits how far Chinese language AI corporations have progressed, regardless of US sanctions which have largely blocked their entry to superior semiconductors used for coaching fashions.

Meet DeepSeek: the Chinese start-up that is changing how AI models are trained

Our Favorite Fitness Apps for 2025, Tested and Reviewed

What’s new in January 2025: 7 upcoming games worth checking out

Google Pixel 10 Pro Concept Teases a Vertical Camera Bump, Larger Displays, and a Tensor G5 Chip

Peter Voss and the quest for Artificial General Intelligence

I Just Spent The Past 15 Minutes With My Jaw On The Floor After Looking At All These Fascinating Pictures And Now I Need You To See Them

Top 6 innovations shaping tomorrow’s tech