Navigation
Recherche
|
Google Offers Its AI Watermarking Tech As Free Open Source Toolkit
jeudi 24 octobre 2024, 23:00 , par Slashdot
Google uses a version of SynthID to watermark audio, video, and images generated by its multimodal AI systems, with differing techniques that are explained briefly in this video. But in a new paper published in Nature, Google researchers go into detail on how the SynthID process embeds an unseen watermark in the text-based output of its Gemini model. The core of the text watermarking process is a sampling algorithm inserted into an LLM's usual token-generation loop (the loop picks the next word in a sequence based on the model's complex set of weighted links to the words that came before it). Using a random seed generated from a key provided by Google, that sampling algorithm increases the correlational likelihood that certain tokens will be chosen in the generative process. A scoring function can then measure that average correlation across any text to determine the likelihood that the text was generated by the watermarked LLM (a threshold value can be used to give a binary yes/no answer). Read more of this story at Slashdot.
https://news.slashdot.org/story/24/10/24/206215/google-offers-its-ai-watermarking-tech-as-free-open-...
Voir aussi |
56 sources (32 en français)
Date Actuelle
sam. 23 nov. - 11:23 CET
|