archive.today and archive.ph (also .is, .md, .fo, .li, .vn) could be Russian assets.

  • 9 Posts
  • 442 Comments
Joined 9 months ago
cake
Cake day: March 5th, 2025

help-circle







  • The only non-lemmy.world account: Melon Husk™@sh.itjust.works
    Banned in some community because: Widely reported as a likely unmarked bot using an LLM to generate engagement bait
    (Profile not deleted)

    telokic, sededor, pali, Yecoh, vanes, henaw2, kogito @lemmy.world: gone

    All post sorta normal news stories as “YSK”.

    As others probably pointed out already, this is likely somebody playing with LLM bots (unmarked). Good riddance. Not sure about the first one though.












  • If crowdsec works for you thats great but also its a corporate product

    It’s also fully FLOSS with dozens of contributors (not to speak of the community-driven blocklists). If they make money with it, great.

    not exactly a pure self hosted solution.

    Why? I host it, I run it. It’s even in Debian Stable repos, but I choose their own more up-to-date ones.

    Allow me to expand on the problem I was having. It wasnt just that I was getting a knock or two, its that I was getting 40 knocks every few seconds scraping every page and searching for a bunch that didnt exist that would allow exploit points in unsecured production vps systems.

    • Again, a properly set up WAF will deal with this pronto
    • You should not have exploit points in unsecured production systems, full stop.

    On a computational level the constant network activity of bytes from webpage, zip files and images downloaded from scrapers pollutes traffic. Anubis stops this by trapping them in a landing page that transmits very little information from the server side.

    • And instead you leave the computations to your clients. Which becomes a problem on slow hardware.
    • Again, with a properly set up WAF there’s no “traffic pollution” or “downloading of zip files”.

    Anubis uses a weighted priority which grades how legit a browser client is.

    And apart from the user agent and a few other responses, all of which are easily spoofed, this means “do some javascript stuff on the local client” (there’s a link to an article here somewhere that explains this well) which will eat resources on the client’s machine, which becomes a real pita on e.g. smartphones.

    Also, I use one of those less-than-legit, weird and non-regular browsers, and I am being punished by tools like this.

    All the self hosters in my internet circle started adopting anubis so I wanted to try it. Anubis was relatively plug and play with prebuilt packages


    edit: I feel like this part of OP’s argument needs to be pointed out, it explains so much:

    All the self hosters in my internet circle started adopting anubis so I wanted to try it. Anubis was relatively plug and play with prebuilt packages