• 𝙣𝙪𝙠𝙚@yah.lol
    link
    fedilink
    English
    arrow-up
    2
    ·
    1 year ago

    Why did I do this?

    Because I don’t want my material used for training LLM’s.

    Good luck with that. Sure, maybe OpenAI respects your robots.txt and doesn’t scan you, but I seriously doubt everyone is going to abide by the honor system here.

    • Lvxferre@lemmy.ml
      link
      fedilink
      English
      arrow-up
      1
      arrow-down
      1
      ·
      1 year ago

      I seriously doubt everyone is going to abide by the honor system here.

      I think that we (people in general) should be creating honeytraps to punish the ones not abiding to the honour system. For example:

      • create pages with text composed of random babble
      • link those pages in the site, somewhere that humans won’t access
      • exclude those pages from bot crawling in robots.txt

      Bots abiding to the honour system will ignore those pages and move on. The ones not abiding to it will crawl through those pages and feed them into their owners’ AI models, that will be worse in result.