Navigation
Recherche
|
Google rolls out Vertex AI RAG Engine
vendredi 17 janvier 2025, 01:55 , par InfoWorld
Google has formally introduced Vertex AI RAG Engine, a developer tool aimed at streamlining the complex process of retrieving relevant information from a knowledge base and feeding it to an LLM (large language model).
Introduced in a January 15 blog post as a component of the Vertex AI platform, Vertex AI RAG Engine is a managed orchestration service and data framework for developing context-augmented LLM applications. In elaborating on the Vertex AI RAG Engine, Google said generative AI and LLMs are transforming industries, but that challenges such as hallucinations (generating incorrect or nonsensical information) and limited knowledge beyond training data can hinder enterprise adoption. Vertex AI RAG Engine implements retrieval-augmented generation to empower software and AI developers to build grounded, generative AI solutions. Google noted the following key advantages of Vertex AI RAG Engine: Ease of use, with developers able to get started via an API enabling rapid prototyping and experimentation. Managed orchestration, to handle data retrieval and LLM integration. Customization and open source support, with developers able to choose from parsing, chunking, annotation, embedding, vector storage, and open source models. Developers also can customize their own components. Integration flexibility, to connect to various vector databases such as Pinecone and Weaviate, or use Vertex AI Search. In the introductory blog post, Google cited industry use cases for Vertex AI RAG Engine in financial services, health care, and legal. The post also provided links to resources including a getting started notebook, example integrations with Vertex AI Vector Search, Vertex AI Feature Store, Pinecone, and Weaviate, and a guide to hyperparameter tuning for retrieval with RAG Engine.
https://www.infoworld.com/article/3804343/google-rolls-out-vertex-ai-rag-engine.html
Voir aussi |
56 sources (32 en français)
Date Actuelle
ven. 17 janv. - 14:56 CET
|