• interdimensionalmeme@lemmy.ml
          link
          fedilink
          arrow-up
          4
          ·
          5 hours ago

          If you allow my searchxng search scraper then an AI scraper is indistinguishable.

          If you mean, “google and duckduckgo are whitelisted” then lemmy will only be searchable there, those specific whitelisted hosts. And google search index is also an AI scraper bot.

    • deadcade@lemmy.deadca.de
      link
      fedilink
      arrow-up
      9
      ·
      16 hours ago

      “Yes”, for any bits the user sees. The frontend UI can be behind Anubis without issues. The API, including both user and federation, cannot. We expect “bots” to use an API, so you can’t put human verification in front of it. These "bots* also include applications that aren’t aware of Anubis, or unable to pass it, like all third party Lemmy apps.

      That does stop almost all generic AI scraping, though it does not prevent targeted abuse.

    • seang96@spgrn.com
      link
      fedilink
      arrow-up
      4
      ·
      20 hours ago

      As long as its not configured improperly. When forgejo devs added it it broke downloading images with Kubernetes for a moment. Basically would need to make sure user agent header for federation is allowed.