I am de-googling my website and would like to replace my Google site search with a selfhosted alternative. I just installed Yacy on a VPS for testing purposes and so far it looks impressive.
Now I consider using it for two of my websites and wonder: Can I use one installation of Yacy to power the site search of two different websites? The search on domain1 is supposed to provide only results for pages on domain1 while the search on domain2 is supposed to provide only results for pages on domain2.
Do I need to instances of Yacy for it or is one enough - how can I do that? I tried to find it out by myself without success, so I hope some of you experienced users or developers can help me with my noob question. Thanks a lot!
Thanks for your answer. Not sure if I was clear enough. I know that Yacy can index multiple domains. I wanted to create two search engines for two different sites: domain1 and domain2. Both sites need their own site search that only displays search results for their own domain and not for all of them. How can I do that, if possible?
PS: I looked through all settings and options, it seems one needs a seperate instance for each website search. No way to create a search for only one domain. But I might have overlooked something, very complex admin area!
Advanced setup! I don’t run my own server at home so I use a rented VPS instead, now with one instance on port 8090, the other on 8091. Next step will be reverse proxy and connect it to a domain and then integrate with my two website. Let’s see if I can make it or if I will have to go back to using Google what I would hate…
Thanks, yes, I will play a bit the coming days. Very impressive search results, and it seems the VPS with 2GB RAM with 1cpu seems to be enough for two instances, thanks a lot for your help, very much appreciated!
@TheNomad11 Be a bit careful with certain VPS’ companies regarding YaCY … There’s quite a few unserious actors out there.
E.g I recently got straightup scammed by a company called greencloudvps . com / greenkvm . com
I already had a Japan VPS at theirs which i was quite pleased with, running yacy. And was looking to establish a US based VPS, so to consolidate an existing US Linode VPS to them as well. So as to have just one company to deal with.
Thankfully, i had not signed up for longer than 1 month plan. Because they denied me refund on it. So i cancelled all services with them, and reported their Delaware branch to the U.S FTC, for breaking contract. And to DNB ASA/BankID for their scam. (i paid with VISA)
And most fortunately, i had not come so far into my plan as to actually cancel and delete the existing Linode VPS , that i had intended to replace with greencloudvps’ fcuking sh * t.
Yeah, I am following discussions lowendtalk.com - there are indeed lots of dubious actors in the market. I used Hetzner, Germany’s largest hoster and with good reputation. Ideal for testing as they have hourly billing, so I only paid 8 cent for testing Yacy. Besides that, I don’t do P2P, just a simple website search. I also heard good stuff about Linode, they are in the same league as Digital Ocean, Vultr, Hetzner etc
Take some old PC Hardware. Install Freebsd. Install YaCy. NAT Port 8090 to that box and go!
All you have to pay for is the electric energy. Backup / export your URLs as HTM from time to time and:
Enjoy watching your YaCy instance being part of the most fascinating search engine since www.
Professional hosters are good for hosting static content (which YaCy isn’t). A reasonable (dedicated) machine to run YaCy will cost you several 100 bucks per month. Everything else is a lie.
All the cloud crap is made to rip off big companies after having them locked in before. Nothing for private freaks to play around
I agree, but I don’t know about which dimentions you are takling about. All I want to say is, that if you plan to index more than some few websites using YaCy, a professionally hosted hardware will quickly consume a lot of money.
If you have different experience, pls let me know.
the multi-site-search-in-one-YaCy thing should also be doable with the collection attribute. Thats the idea behind it: every crawl run is assigned to a specific collection by the user and a search can pick out by default only results from a given collection.