-
How are you blocking them? You have any kind of rate limiting?
Coz these guys appear to be ignoring robot restrictions and crawling stuff anyway using IPs they're not publishing
"WIRED was able to confirm that a server at the IP address Knight observed—44.221.181.252—will, on demand, visit and download webpages when a user asks Perplexity about the webpage, regardless of what the site’s robots.txt says."
https://www.wired.com/story/perplexity-is-a-bullshit-machine/
. edited out