actually awesome and fast search engine (depending on which instance you use) with no trashy AI and ADs results also great for privacy, if you don’t know which instance to use go to https://searx.space/ and choose an instance closest to you

  • notfromhere@lemmy.ml
    link
    fedilink
    arrow-up
    1
    arrow-down
    2
    ·
    1 month ago

    Is that legally binding? What happens of they catch you, ban your IPs then you’re in the same situation as now. Literally no reason to not do it IMO.

    • Mojeek Search Engine@lemmy.ml
      link
      fedilink
      arrow-up
      2
      ·
      1 month ago

      IP already hits a wall, also better to not get a reputation as a bad bot, it’s taken a while to get known for being friendly and respecting rules, to us you should follow robots

      • notfromhere@lemmy.ml
        link
        fedilink
        arrow-up
        2
        arrow-down
        1
        ·
        1 month ago

        I seem to recall creative ways to index things without robots, e.g. browser addon that users opt into to send pages and such, essentially crowdsourcing the indexing. Anyways good to see you’re taking the high road!

        • Mojeek Search Engine@lemmy.ml
          link
          fedilink
          arrow-up
          3
          ·
          1 month ago

          our preference is always to find out why the block is happening and try to convince people it should be otherwise; widespread abuse of robots.txt does no-one any good, having been crawling and indexing for so long it’s a standard that we understand and are quite fond of

          we can see some of the perils and pitfalls of it too, but web builders need to be given some tools and assurances that those tools will work for them

          • notfromhere@lemmy.ml
            link
            fedilink
            arrow-up
            2
            arrow-down
            1
            ·
            1 month ago

            That makes sense. One thing I’ve noticed with Mojeek search results compared to Google is that I do not encounter the “old web” any more on Mojeek than on Google. Are you not crawling/indexing web 1.0 blogs and sites at all?

            • Mojeek Search Engine@lemmy.ml
              link
              fedilink
              arrow-up
              2
              ·
              29 days ago

              probably a sample size issue, we crawl and index everything we are able to; have seen many of this kind of site in the past, and finding them is something that other people have said they enjoy about mojeek