MacMusic  |  PcMusic  |  440 Software  |  440 Forums  |  440TV  |  Zicos
crawlers
Recherche

AI Crawlers Haven't Learned To Play Nice With Websites

mercredi 19 mars 2025, 18:25 , par Slashdot
AI Crawlers Haven't Learned To Play Nice With Websites
SourceHut, an open-source-friendly git-hosting service, says web crawlers for AI companies are slowing down services through their excessive demands for data. From a report: 'SourceHut continues to face disruptions due to aggressive LLM crawlers,' the biz reported Monday on its status page. 'We are continuously working to deploy mitigations. We have deployed a number of mitigations which are keeping the problem contained for now. However, some of our mitigations may impact end-users.'

SourceHut said it had deployed Nepenthes, a tar pit to catch web crawlers that scrape data primarily for training large language models, and noted that doing so might degrade access to some web pages for users. 'We have unilaterally blocked several cloud providers, including GCP [Google Cloud] and [Microsoft] Azure, for the high volumes of bot traffic originating from their networks,' the biz said, advising administrators of services that integrate with SourceHut to get in touch to arrange an exception to the blocking.

Read more of this story at Slashdot.
https://slashdot.org/story/25/03/19/1027251/ai-crawlers-havent-learned-to-play-nice-with-websites?ut...

Voir aussi

News copyright owned by their original publishers | Copyright © 2004 - 2025 Zicos / 440Network
Date Actuelle
mer. 19 mars - 23:37 CET