Andy Reid@lemmy.world to Technology@lemmy.worldEnglish · 9 months agoAI companies are violating a basic social contract of the web and and ignoring robots.txtwww.theverge.comexternal-linkmessage-square200fedilinkarrow-up11.09Karrow-down115cross-posted to: technology@midwest.socialtechnology@beehaw.orgwolnyinternet@szmer.infotechnology@lemmy.zip
arrow-up11.08Karrow-down1external-linkAI companies are violating a basic social contract of the web and and ignoring robots.txtwww.theverge.comAndy Reid@lemmy.world to Technology@lemmy.worldEnglish · 9 months agomessage-square200fedilinkcross-posted to: technology@midwest.socialtechnology@beehaw.orgwolnyinternet@szmer.infotechnology@lemmy.zip
minus-squareShitpostCentral@lemmy.worldlinkfedilinkEnglisharrow-up16·9 months agoYou’re second point is a good one, but you absolutely can log the IP which requested robots.txt. That’s just a standard part of any http server ever, no JavaScript needed.
minus-squareGenderNeutralBro@lemmy.sdf.orglinkfedilinkEnglisharrow-up11·9 months agoYou’d probably have to go out of your way to avoid logging this. I’ve always seen such logs enabled by default when setting up web servers.
You’re second point is a good one, but you absolutely can log the IP which requested robots.txt. That’s just a standard part of any http server ever, no JavaScript needed.
You’d probably have to go out of your way to avoid logging this. I’ve always seen such logs enabled by default when setting up web servers.