How to cache pages for offline use

I’m a new YaCy user and looking to see if I can cache the pages that I’ve manually crawled for offline use

Hi, @pknessness and welcome to the community!

AFAIK, at least the text of the page is “cached”, in fact stored in the solr database.
At the search results page, click “Parser” and you’ll see the text of parsed page. (You must have “Parser” enabled in Search Portal Integration > Portal Design > Search Page Layout). On the left side, there is a pull-down menu, from which you can select “Plain text”, “Original from the cache”, etc.

There is also a cache of HTML pages, used during the crawling, which is limited in size and deleted when full.

Solution, which I was thinking about, was combining Archive.org’s Heritrix as an crawler/archival tool and importing .warc files into YaCy for search. But I’ve never tried.