Because I don’t want my material used for training LLM’s.
Good luck with that. Sure, maybe OpenAI respects your robots.txt and doesn’t scan you, but I seriously doubt everyone is going to abide by the honor system here.
I seriously doubt everyone is going to abide by the honor system here.
I think that we (people in general) should be creating honeytraps to punish the ones not abiding to the honour system. For example:
create pages with text composed of random babble
link those pages in the site, somewhere that humans won’t access
exclude those pages from bot crawling in robots.txt
Bots abiding to the honour system will ignore those pages and move on. The ones not abiding to it will crawl through those pages and feed them into their owners’ AI models, that will be worse in result.
Good luck with that. Sure, maybe OpenAI respects your robots.txt and doesn’t scan you, but I seriously doubt everyone is going to abide by the honor system here.
I think that we (people in general) should be creating honeytraps to punish the ones not abiding to the honour system. For example:
Bots abiding to the honour system will ignore those pages and move on. The ones not abiding to it will crawl through those pages and feed them into their owners’ AI models, that will be worse in result.