Navigation
Recherche
|
Inception Emerges From Stealth With a New Type of AI Model
jeudi 27 février 2025, 01:20 , par Slashdot
![]() 'What we found is that our models can leverage the GPUs much more efficiently,' Ermon said, referring to the computer chips commonly used to run models in production. 'I think this is a big deal. This is going to change the way people build language models.' Inception offers an API as well as on-premises and edge device deployment options, support for model fine-tuning, and a suite of out-of-the-box DLMs for various use cases. The company claims its DLMs can run up to 10x faster than traditional LLMs while costing 10x less. 'Our 'small' coding model is as good as [OpenAI's] GPT-4o mini while more than 10 times as fast,' a company spokesperson told TechCrunch. 'Our 'mini' model outperforms small open-source models like [Meta's] Llama 3.1 8B and achieves more than 1,000 tokens per second.' Read more of this story at Slashdot.
https://slashdot.org/story/25/02/26/2257224/inception-emerges-from-stealth-with-a-new-type-of-ai-mod...
Voir aussi |
56 sources (32 en français)
Date Actuelle
jeu. 27 févr. - 05:59 CET
|