Cynicus Rex@lemmy.ml to Privacy@lemmy.mlEnglish · 1 year agoHow to block AI Crawler Bots using robots.txt filewww.cyberciti.bizexternal-linkmessage-square63fedilinkarrow-up1112
arrow-up1112external-linkHow to block AI Crawler Bots using robots.txt filewww.cyberciti.bizCynicus Rex@lemmy.ml to Privacy@lemmy.mlEnglish · 1 year agomessage-square63fedilink
minus-squareasudox@lemmy.worldlinkfedilinkarrow-up6·1 year agoNot sure if that is effective at all. Why would a crawler check the robots.txt if it’s programmed to ignore it anyways?
minus-squareOha@lemmy.ohaa.xyzlinkfedilinkarrow-up16·1 year agocause many crawlers seem to explicitly crawl “forbidden” sites
minus-squareCrashumbc@lemmy.worldlinkfedilinkEnglisharrow-up3·1 year agoGoogle and script kiddies copying code…
minus-squareMangoPenguin@lemmy.blahaj.zonelinkfedilinkEnglisharrow-up1·1 year agoYou could also place the same page as a hidden link on your home page.
Not sure if that is effective at all. Why would a crawler check the robots.txt if it’s programmed to ignore it anyways?
cause many crawlers seem to explicitly crawl “forbidden” sites
Google and script kiddies copying code…
You could also place the same page as a hidden link on your home page.