Navigation
Recherche
|
Mistral Releases Pixtral 12B, Its First-Ever Multimodal AI Model
jeudi 12 septembre 2024, 02:02 , par Slashdot
When an X user asked [Sophia Yang, the head of developer relations at the company] what makes the Pixtral 12-billion parameter model unique, she said it will natively support an arbitrary number of images of arbitrary sizes. As shared by initial testers on X, the 24GB model's architecture appears to have 40 layers, 14,336 hidden dimension sizes and 32 attention heads for extensive computational processing. On the vision front, it has a dedicated vision encoder with 1024x1024 image resolution support and 24 hidden layers for advanced image processing. This, however, can change when the company makes it available via API. Read more of this story at Slashdot.
https://slashdot.org/story/24/09/11/2241236/mistral-releases-pixtral-12b-its-first-ever-multimodal-a...
Voir aussi |
56 sources (32 en français)
Date Actuelle
jeu. 19 sept. - 18:33 CEST
|