DeepSeek announced the release and Watch Take Turns Tasting With College Alumni Onlineopen-source launch of its latest AI model, DeepSeek-V3, via a WeChat post on Tuesday. Users can now interact with the V3 model on DeepSeek’s official website. According to the post, DeepSeek-V3 boasts 671 billion parameters, with 37 billion activated, and was pre-trained on 14.8 trillion tokens. Compared to the V2.5 version, the new model’s generation speed has tripled, with a throughput of 60 tokens per second. Although it currently lacks multi-modal input and output support, DeepSeek-V3 excels in multilingual processing, particularly in algorithmic code and mathematics. In multiple benchmark tests, DeepSeek-V3 outperformed open-source models such as Qwen2.5-72B and Llama-3.1-405B, matching the performance of top proprietary models such as GPT-4o and Claude-3.5-Sonnet. [DeepSeek official WeChat account, in Chinese]
Related Articles
2025-06-26 13:30
2735 views
NYT Connections hints and answers for May 2: Tips to solve 'Connections' #691.
Connectionsis the one of the most popular New York Times word games that's captured the public's att
Read More
2025-06-26 13:00
248 views
California congresswoman tries to dab during U.S. Senate debate
Rep. Loretta Sanchez really wants that young vote. In an probable attempt to court millennial voters
Read More
2025-06-26 12:52
2181 views
Donald Trump issues video statement on that obscene tape
UPDATE: Oct. 8, 2016, 1:09 p.m. EDT Donald Trump told the Wall Street Journal on Saturday that there
Read More