Page 1 of 3

How to get the wiki for offline browing?

Posted: Tue Apr 24, 2012 8:47 am
by Hibou57
Hi all,

My first post (and may be the single, who know :mrgreen: ) here. By the way, funny captcha question (the one about source register for movsb).

I was just wondering if there exist an archive of the wiki, for offline browsing. I can't cleanly get it with wget, as I could get all "garbage" pages, like edit, history and the like.

I know it may be updated frequently, but I don't mind, as I like it very much as it already is :D

Have a nice time all =D>

Re: How to get the wiki for offline browing?

Posted: Tue Apr 24, 2012 10:08 am
by Hibou57
Perfect! There is a ZIP archive for the Wiki, without histories and others, just articles and talks pages (the latter being important too, after‑all).

I Thanks you.

Re: How to get the wiki for offline browing?

Posted: Wed May 02, 2012 12:30 pm
by DirkU
But the ZIP archive at http://files.osdev.org/osdev_wiki.zip seems to be dated:
It's date is 30-Dec-2011. To be sure I've checked some changes newer than this date.
None of them I've checked is included in the ZIP archive.

Something wrong with archiving the wiki?

Dirk

Re: How to get the wiki for offline browing?

Posted: Wed May 02, 2012 2:42 pm
by Kazinsal
I'd assume it's manually updated.

Can you curl excluding pages with "&action=edit" and "&action=history" in the address? If so, there's your solution.

Re: How to get the wiki for offline browing?

Posted: Thu May 03, 2012 9:54 am
by Love4Boobies
To my knowledge, that archive is *much* older. The date you mentioned is the one when chase changed his hosting provider.

Re: How to get the wiki for offline browing?

Posted: Mon May 07, 2012 2:44 am
by Solar
There's Special:AllPages and Special:Export. Between them, you should be able to grab the Wiki contents, and either store them into an offline Wiki or for transformation from XML into something more readable. No automated function for "grab it all" exists, to my knowledge. (I really don't like MediaWiki, and this gets another notch in the "dislike" list.)

Re: How to get the wiki for offline browing?

Posted: Mon May 07, 2012 8:16 am
by Love4Boobies
Can you imagine what would happen if enough people tried to download Wikipedia at the same time? At any rate, I think wiki's are not meant for offline reading---the whole point of them is that they can be in a constant state of change.

Re: How to get the wiki for offline browing?

Posted: Mon May 07, 2012 8:25 am
by Solar
Are you seriously comparing our OSDev Wiki with Wikipedia? :shock: :wink:

(Before there was MediaWiki, we were using PhpWiki, which provided a nice download option, which could be disabled / enabled by the admin.)

Re: How to get the wiki for offline browing?

Posted: Mon May 07, 2012 8:33 am
by Love4Boobies
Ah, no. Those were just my $0.02 regarding your comment on MediaWiki:
Solar wrote:(I really don't like MediaWiki, and this gets another notch in the "dislike" list.)
I do have some criticisms of my on but those are off-topic.

Re: How to get the wiki for offline browing?

Posted: Mon Jun 11, 2012 9:52 pm
by chase
The DumpHTML extension is broken, hence no more backups.

I've switched over to using wget in a weekly cron job.

Code: Select all

wget --mirror -k -p --reject '*=*,User:*,Special:*,User_talk:*' --exclude-directories='User:*,User:*/*,User:*/*/*,User_talk:*,User_talk:*/*,User_talk:*/*/*,Special:*,Special:*/*,Special:*/*/*' --user-agent="osdev-mirror" http://wiki.osdev.org/Main_Page
It ends up getting a couple of duplicate pages (if you are on Windows) because some of the pages are archived twice with different case but overall I think it should be a functional offline-copy.

If you guys can improve the wget command or have a better linux alt (maybe curl) let me know. Any additional shell scripting/sed/awk/etc to fix up any broken stuff would also be welcome.

The result is: http://files.osdev.org/osdev_wiki.zip

Re: How to get the wiki for offline browing?

Posted: Sat Jul 21, 2012 12:35 am
by OdinVex
Any possible fix to DumpHTML? Anyone have a pre-broken dump? Can't use the new ones in their form. ;_; *should never have deleted his copy, didn't know*

Re: How to get the wiki for offline browing?

Posted: Sat Jul 21, 2012 4:03 pm
by chase
What is broken in the current dump? (or what don't you like about it)

Re: How to get the wiki for offline browing?

Posted: Sat Jul 21, 2012 4:37 pm
by Combuster
You can't click category links (i.e. pretty much everything on the categorized main page) in at least firefox and epiphany which is rather annoying as it breaks pretty much all of the indexing. Other than that, things work mostly fine here.

Re: How to get the wiki for offline browing?

Posted: Sat Jul 21, 2012 7:37 pm
by OdinVex
Filesystem-structure is entirely different, uses page-titles instead of HTML pages that can properly link. :( No link will work practically.

Re: How to get the wiki for offline browing?

Posted: Sat Aug 04, 2012 5:14 pm
by Grashwandir
hi,
I read the entire post and much more... but I still don't know how to READ the offline archive!
The uncompressed files have no extension, there is not index.php or .html
sorry to be fool, I feel I'm missing something obvious but please I'm completly lost :cry:

I go vacation in two days, and I really want to bring this marvellous Wiki with me, please help me!

PS: forgive my poor english, I don't speak it very well, and now I'm very tired.

Thanks folks.