Navigation
Recherche
|
Tencent’s New DeepSeek Competitor Looks Promising Based on Key AI Benchmarks
samedi 1 mars 2025, 18:49 , par eWeek
Headquartered in Shenzhen, China, the team with Tencent recently unveiled their new AI platform called Hunyuan Turbo S. Designed specifically as a competitor to DeepSeek, which was also created by a Chinese AI company, Tencent hopes its generative AI platform will help it gain recognition amongst the top AI companies in the world.
Hunyuan Turbo S is capable of replying to user inputs and queries within one second, which is even faster than DeepSeek-R1, according to the company and as reported by Reuters. We haven’t found any speed benchmarks that confirm Tencent’s claim. How Hunyuan Turbo S compares to competitors in the benchmarks According to benchmarks provided by Tencent as reported by WinBuzzer, Hunyuan Turbo S leads many competitors in a variety of areas. The following benchmarks are commonly used to evaluate the functionality, efficiency, and accuracy of large language models (LLMs). Chinese: Hunyuan Turbo S ranks the highest in Chinese language benchmarks performed by CMMLU, but DeepSeek-R1-Zero leads in C-Eval’s benchmarks. Alignment: Although Hunyuan Turbo S outperforms GPT-4o, Claude 3.5, Llama 3.1, and DeepSeek-V3 in benchmarks from LiveBench, it lags slightly behind Claude 3.5 in benchmarks from IF-Eval. Some of Hunyuan Turbo S’s weaknesses include: Math: Hunyuan Turbo S outperforms GPT-4o, Claude 3.5, Llama 3.1, and DeepSeek-V3 in some benchmarks, but DeepSeek-R1-Zero leads them all as scored by AIME 2024 and MATH. Knowledge: Hunyuan Turbo S ranks fairly high on most knowledge benchmarks, but it doesn’t quite match up to DeepSeek-R1-Zero in the benchmarks from MMLU, MMLU-Pro, and SimpleQA. Reasoning: Hunyuan Turbo S only ranks third-highest, behind GPT-4o and Claude 3.5, on BBH’s reasoning benchmarks. Code: While HumanEval has Hunyuan Turbo S sitting right behind Claude for coding capabilities, it’s a bit further behind DeepSeek-V3, DeepSeek-R1-Zero, and GPT-4o on LiveCodeBench’s results. While Hunyuan Turbo S is the clear winner in certain cases, it still falls behind DeepSeek-R1-Zero in several instances. A strong competitor in the AI race Tencent’s new Hunyuan Turbo S platform solidifies the Chinese tech giant’s position in the race to develop the fastest and most powerful AI platform. Although it’s not Tencent’s first foray into the world of generative AI tools, it is the company’s most noteworthy entry to date – and it’s certainly one to watch over the coming weeks, months, and years. The post Tencent’s New DeepSeek Competitor Looks Promising Based on Key AI Benchmarks appeared first on eWEEK.
https://www.eweek.com/artificial-intelligence/tencent-hunyuan-turbo-s-deepseek-competitor-benchmarks
Voir aussi |
56 sources (32 en français)
Date Actuelle
mer. 5 mars - 19:31 CET
|