Marketers and their bots have been using reddit to hype up brands. No wonder Reddit feels like shit these days.

    • 4am@lemmy.zip
      link
      fedilink
      English
      arrow-up
      3
      ·
      3 hours ago

      That’s the job of the web server, not of the application that runs on it.

      There is already software you can get that feeds a never-ending maze of text to AI scrapers, some of which is AI generated and/or designed to poison LLM training. The problem is that these still use up a ton of bandwidth.

    • Rimu@piefed.social
      link
      fedilink
      English
      arrow-up
      29
      ·
      7 hours ago

      A never-ending maze would mean the scrapers just hammer our servers forever. Better to lead them into a honeypot and automatically ban their IP. Like PieFed does.

      • davidgro@lemmy.world
        link
        fedilink
        English
        arrow-up
        10
        ·
        6 hours ago

        What about a maze that adds a few hundred ms to the response time with each request, so the load gets less the longer it’s trapped?

        • Rimu@piefed.social
          link
          fedilink
          English
          arrow-up
          15
          ·
          6 hours ago

          There are a lot of strategies. afaik a tar pit tries to waste the attacker’s resources by delaying our responses to their traffic? A honey pot tries to funnel bot traffic towards a place which only bots would go to. Once they go there you know they’re a bot and they can be banned.