SearXNG: A free internet metasearch engine

(github.com)

91 points | by theanonymousone 3 hours ago

18 comments

  • asciimoo 52 minutes ago
    Ohi, I'm the original creator of Searx, but due to the limitations of the metasearch concept I'm not involved in the development anymore. My new search project is https://github.com/asciimoo/hister (https://hister.org/).

    Hister is a full text indexer for websites and local files which automatically saves all the visited pages rendered by your browser. Storing full page content allows serving offline result previews and the full page content via MCP.

    Take a look at how the MCP can be utilized: https://hister.org/posts/give-your-ai-assistant-a-private-me...

    • kristianpaul 16 minutes ago
      Is this similar to fastcrw ?
      • asciimoo 7 minutes ago
        Both are search engines, but that's all the similarity. Hister has a traditional crawler, but its biggest strength is automatically indexing browser tabs as those are rendered. This way it bypasses authentication, CloudFlare, captchas and most of the annoying limitations of traditional crawlers. Hister also provides full offline result previews. Check out the small read-only demo: https://demo.hister.org/
    • operatingthetan 42 minutes ago
      [dead]
  • exiguus 41 minutes ago
    SearXNG is my daily internet search now +5 years; with YaCY Backends and else as fallback. I also build internal document search or RAG applications with this setup (SearXNG also support json results). However, there are some downer I accept because of privacy: 1. Its slower and the results are not that good then with others. But fast and good enough for most of my queries. 2. From time to time you get blocked on the duckduckgo, brave or whatever search and you must solve some captures. You can prevent this by getting and using API-Keys from them.

    The nice thing about using your own backend is, that you can prio it in the results and for example, if I crawl the smallweb and other site important for myself, this sites come up first in the results.

  • satvikpendem 2 hours ago
    TinySearch wraps this and works well for agents. It's better than the native SearXNG MCP because it optimizes the context before it even gets to the agent so as to not waste tokens.

    https://github.com/MarcellM01/TinySearch

    • drnick1 1 hour ago
      SearXNG did not include a built-in MCP server, last time I checked.
    • ProofHouse 2 hours ago
      Props
  • goodroot 57 minutes ago
    This appears to be a key tool for providing search to local models.

    I'm curious what setups folks use to provide this functionality.

    Since the quantized 24B parameter Gemma model came out, I've had good luck with tool calling on a 4070 Ti Super.

    Successful tool calling is what finally made the local experience useful.

    I should note this is for the general and not coding specific context.

    • gardnr 5 minutes ago
      It has a JSON mode that you need to enable in settings and then you can create a simple python script to interact with it or have the agent use `curl` and `jq` to interact with it.

      It's at the bottom of this page: https://docs.searxng.org/admin/settings/settings_search.html

    • drnick1 12 minutes ago
      I am also interested in what a full local AI stack with web search and other tools looks like. As far as I can tell, SearX does not embed an MCP server, so it can't be directly called from llama-server for example. Open WebUI does have an integration for SearX and other providers, but the results I obtained weren't particularly impressive.
  • artooro 1 hour ago
    It works well if you connect it the Brave Search API, but using it a scraper is fairly unreliable. Google stopped working a few days ago.
  • fishgoesblub 1 hour ago
    I've been using SearXNG for a few years now, however I've been trying out Degoog as a SearXNG alternative since I've had issues with engines constantly failing or being slow since day 1 of using SearXNG, but Degoog has worse results with the same engines. It's a shame since I'm having to pick between slower but better results, or very fast but worse results.
  • ManWith2Plans 2 hours ago
    I've been using this for some projects. It's exceptional and I recommend it highly.

    I actually included a recipe to deploy it to kubernetes in typekro, my TypeScript infrastructure-as-code project for kubernetes: https://typekro.run/api/searxng/

  • dexterdog 2 hours ago
    I've been self hosting this as my default engine across all of my searches for a few years now. I can't recommend it more highly.
    • viviansolide 2 hours ago
      Same experience
    • ProofHouse 2 hours ago
      I’ll have to try, I’ve only recently learned Exa pricing is a bit crazy (especially on searches where you source 30-40 sources)I just used it be default and then was like oh damn when I got hit
  • arikrahman 2 hours ago
    I have used SearXNG hosts like https://searx.be/ but stick with Brave search for the most part. Are there other good hosts people tend to use?
    • vimredo 1 hour ago
      Personally, I self-host it myself. All the hosts I tried either errored often, or gave search results that were complete garbage.
  • lucasrufkahr 1 hour ago
    Yeah, I find that searx results are way more relevant to what I’m actually looking for than a single engine. There’s so much manipulation going on that if you don’t aggregate multiple engines, it’s near impossible to get what you want.
  • rcarmo 1 hour ago
    Years of regular use here, has been great even before I started using it as an agent tool.
  • another_twist 1 hour ago
    Been a fan of searX for a while. Not sure if this is the same thing but there were plenty of hosted versions too.
  • salmonik 2 hours ago
    I prefer 4get.
  • noobcoder 1 hour ago
    how do i configure which specific search engines SearXNG pulls its results from? Can we extend it to onyl search Stack Overflow and GitHub
  • tosief 15 minutes ago
    [dead]
  • tomfow 2 hours ago
    [flagged]
  • tom6ow 1 hour ago
    [flagged]
  • tomnow 2 hours ago
    [flagged]