Hi,
as the old wiki is obsolete, I started the process of batch conversion the old articles to MarkDown, so they could be managed and edited on GitHub.
First batch of pages is commited.
Since PRs had not been merged since last November, there is an updated standalone version of documentation as well.
The pages are probably heavily outdated, so anyone is warmly welcomed to check the newly converted ones and correct them.
I converted only english pages, those interested in other language can follow my path:
First, I downloaded using wget:
wget -r -l0 -np -E --restrict-file-names=unix,ascii,lowercase --convert-links https://wiki.yacy.net/index.php/En:Start;
wget -r -l0 -np -E --restrict-file-names=unix,ascii,lowercase --convert-links https://wiki.yacy.net/index.php/Dev:Start
, then converted by pandoc and cleaned a bit using sed, using this script:
#!/bin/sh
#convert html to md and place it into md dir
#create output dirs
mkdir -p md
# convert all html files to md
for f in *.html
do
echo -n "converting file $f "
#stripe out header and footer
sed -n '/<h1/,/Abgerufen/ p' $f > $f.tmp1
# convert from html to git-flavoured md
pandoc --from html --to gfm $f.tmp1 -o $f.tmp2
# clean spans and divs
sed -e 's/<span[^>]*>//g' $f.tmp2 | sed -e 's/<\/span>//g' | sed -e 's/<div[^>]*>//g' | sed -e 's/<\/div>//g' | sed -e 's/Abgerufen von/Converted from/' > $f.tmp3
echo " to md/`echo $f |sed 's/.html//'|sed 's/\:/_/'`.md"
mv $f.tmp3 md/`echo $f |sed 's/.html//'|sed 's/\:/_/'`.md
rm $f.tmp1 $f.tmp2
done
Then carefully hand-edited.
This is the first batch, mostly ‘Installation’ and all the linked ‘Operation’ section articles.
More files converted, but I got to edit them by hand and commit as the time constrains allow.