• Outwit1294@lemmy.today
    link
    fedilink
    English
    arrow-up
    9
    ·
    11 hours ago

    If I had to choose my favourite corporation, it would be Cloudflare. They at least do something good.

    • r00ty@kbin.life
      link
      fedilink
      arrow-up
      3
      ·
      8 hours ago

      There seems to be a line, so far as I can tell. If everything you need sits on the free tier, they’re really good (well tbh their R2 storage is reasonably priced too). But once you stray into needing a paid tier, it apparently (I’m not there) quickly gets expensive as you’re lured into every higher tiers.

      But yes, in general I don’t mind cloudflare so much and do use their free (and R2 paid) services.

  • fubarx@lemmy.world
    link
    fedilink
    English
    arrow-up
    11
    arrow-down
    2
    ·
    15 hours ago

    Totally understandable.

    If scanning to help send traffic to your website, that’s cool. If scanning to generate summaries that won’t send any traffic your way. No bueno.

    Ultimately, it should be whatever most benefits users.

    • Outwit1294@lemmy.today
      link
      fedilink
      English
      arrow-up
      3
      ·
      11 hours ago

      No, things should not benefit users, they should benefit the creator of the original content.

    • acosmichippo@lemmy.world
      link
      fedilink
      English
      arrow-up
      4
      ·
      edit-2
      13 hours ago

      but there also needs to be incentive for sites to host content. if it all gets hijacked by search engines that isn’t sustainable.

  • Pika@sh.itjust.works
    link
    fedilink
    English
    arrow-up
    23
    arrow-down
    2
    ·
    18 hours ago

    I mean, with a company as large as cloudflare. I think they could /easily/ strong-arm this move by making blocking google crawlers a default setting on websites. The amount of traffic drop alone from that would make google think twice about the whole ordeal. And people who care about the google search indexers can turn them on again which will allow indexing again. but a default block would cause a lot of disruption google side and many people I don’t think would go in and fix the setting till later on down the road.

    • sbv@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      16
      arrow-down
      1
      ·
      17 hours ago

      Cloudflare’s customers probably wouldn’t be on board with that. Google’s properties provide a tonne of traffic to businesses. Doing anything to put that in jeopardy would probably have many of Cloudflare’s customers looking for a new provider.

      • Glitchvid@lemmy.world
        link
        fedilink
        English
        arrow-up
        10
        ·
        15 hours ago

        Google used to provide a ton of traffic, they hoard it all themselves now through AI and summaries of content. Eventually the balance of cost/benefit will shift and Google will suddenly see itself rejected from scraping, furthering the product deathspiral.

        • Pup Biru@aussie.zone
          link
          fedilink
          English
          arrow-up
          4
          ·
          9 hours ago

          content is only 1 category of website

          ecommerce drives all the advertising that funds content… it’s a much bigger market, and they don’t care about content scraping as long as you buy their product

          • Glitchvid@lemmy.world
            link
            fedilink
            English
            arrow-up
            2
            ·
            9 hours ago

            And the long term plan there is to strangle sites and take %100 of the adrev spend for themselves since users won’t ever leave the Google site. Either way Google as a search engine enters a death spiral, it’s already bleeding users.

      • Pika@sh.itjust.works
        link
        fedilink
        English
        arrow-up
        4
        ·
        17 hours ago

        it would need to be advertised as a change and have it as a setting that had to be set yea, just have it default blocking abusive trackers, having Google bot or whatever it’s crawler name is as on there, with a toggle to allow it again

        • osaerisxero@kbin.melroy.org
          link
          fedilink
          arrow-up
          2
          ·
          16 hours ago

          Alternatively, you use the cloudflare money to sue the monopoly to decouple search and all other products, since blocking the AI trawlers shouldn’t have any measurable impact in search rankings

  • BaroqueInMind@piefed.social
    link
    fedilink
    English
    arrow-up
    11
    arrow-down
    6
    ·
    edit-2
    17 hours ago

    Additionally, Cloudflare’s initiative faces criticism from those who “worry that academic research, security scans, and other types of benign web crawling will get elbowed out of websites as barriers are built around more sites” through Cloudflare’s blocks and paywalls, the WSJ reported.

    The fuck? Since when is a bot designed to enumerate your network weaknesses to sell to Russian/Chinese/US hacking groups a bad thing to block? Fuck the WSJ for even putting that dumb as fuck take on the internet for other idiots to think about.

    NO , its not a good fucking idea to allow the equivalent of an incessant door-to-door salesman into your home to take notes of everything you own and sell to a random motherfucker somewhere else you don’t know.

    That behavior is fucking weird and shouldn’t be tolerated. Cloudflare arbitrarily blocking that network traffic for you is a good thing.

  • dblsaiko@discuss.tchncs.de
    link
    fedilink
    English
    arrow-up
    4
    arrow-down
    3
    ·
    18 hours ago

    Google’s bot is fine in my book, their crawler doesn’t absolutely blast your server with web requests like other AI crawlers do. (Speaking of, I need to update my list of netblocks and UAs to get iocaine-holed.)

    That said, two evil megacorps potentially fighting? I hope they kill each other.