Navigation
Recherche
|
Waymo Explores Using Google's Gemini To Train Its Robotaxis
samedi 2 novembre 2024, 01:10 , par Slashdot
The paper outlines how, historically, autonomous driving systems have developed specific 'modules' for the various functions, including perception, mapping, prediction, and planning. This approach has proven useful for many years but has problems scaling 'due to the accumulated errors among modules and limited inter-module communication.' Moreover, these modules could struggle to respond to 'novel environments' because, by nature, they are 'pre-defined,' which can make it hard to adapt. Waymo says that MLLMs like Gemini present an interesting solution to some of these challenges for two reasons: the chat is a 'generalist' trained on vast sets of scraped data from the internet 'that provide rich 'world knowledge' beyond what is contained in common driving logs'; and they demonstrate 'superior' reasoning capabilities through techniques like 'chain-of-thought reasoning,' which mimics human reasoning by breaking down complex tasks into a series of logical steps. Waymo developed EMMA as a tool to help its robotaxis navigate complex environments. The company identified several situations in which the model helped its driverless cars find the right route, including encountering various animals or construction in the road. But EMMA also has its limitations, and Waymo acknowledges that there will need to be future research before the model is put into practice. For example, EMMA couldn't incorporate 3D sensor inputs from lidar or radar, which Waymo said was 'computationally expensive.' And it could only process a small amount of image frames at a time. There are also risks to using MLLMs to train robotaxis that go unmentioned in the research paper. Chatbots like Gemini often hallucinate or fail at simple tasks like reading clocks or counting objects. Read more of this story at Slashdot.
https://tech.slashdot.org/story/24/11/01/2150228/waymo-explores-using-googles-gemini-to-train-its-ro...
Voir aussi |
56 sources (32 en français)
Date Actuelle
jeu. 21 nov. - 15:52 CET
|