How to get the wiki for offline browing?

All about the OSDev Wiki. Discussions about the organization and general structure of articles and how to use the wiki. Request changes here if you don't know how to use the wiki.
Hibou57
Posts: 4
Joined: Tue Apr 24, 2012 8:38 am
Contact:

How to get the wiki for offline browing?

Post by Hibou57 »

Hi all,

My first post (and may be the single, who know :mrgreen: ) here. By the way, funny captcha question (the one about source register for movsb).

I was just wondering if there exist an archive of the wiki, for offline browsing. I can't cleanly get it with wget, as I could get all "garbage" pages, like edit, history and the like.

I know it may be updated frequently, but I don't mind, as I like it very much as it already is :D

Have a nice time all =D>
Hibou57
Posts: 4
Joined: Tue Apr 24, 2012 8:38 am
Contact:

Re: How to get the wiki for offline browing?

Post by Hibou57 »

Perfect! There is a ZIP archive for the Wiki, without histories and others, just articles and talks pages (the latter being important too, after‑all).

I Thanks you.
DirkU
Posts: 1
Joined: Wed May 02, 2012 12:22 pm

Re: How to get the wiki for offline browing?

Post by DirkU »

But the ZIP archive at http://files.osdev.org/osdev_wiki.zip seems to be dated:
It's date is 30-Dec-2011. To be sure I've checked some changes newer than this date.
None of them I've checked is included in the ZIP archive.

Something wrong with archiving the wiki?

Dirk
User avatar
Kazinsal
Member
Member
Posts: 559
Joined: Wed Jul 13, 2011 7:38 pm
Libera.chat IRC: Kazinsal
Location: Vancouver
Contact:

Re: How to get the wiki for offline browing?

Post by Kazinsal »

I'd assume it's manually updated.

Can you curl excluding pages with "&action=edit" and "&action=history" in the address? If so, there's your solution.
User avatar
Love4Boobies
Member
Member
Posts: 2111
Joined: Fri Mar 07, 2008 5:36 pm
Location: Bucharest, Romania

Re: How to get the wiki for offline browing?

Post by Love4Boobies »

To my knowledge, that archive is *much* older. The date you mentioned is the one when chase changed his hosting provider.
"Computers in the future may weigh no more than 1.5 tons.", Popular Mechanics (1949)
[ Project UDI ]
User avatar
Solar
Member
Member
Posts: 7615
Joined: Thu Nov 16, 2006 12:01 pm
Location: Germany
Contact:

Re: How to get the wiki for offline browing?

Post by Solar »

There's Special:AllPages and Special:Export. Between them, you should be able to grab the Wiki contents, and either store them into an offline Wiki or for transformation from XML into something more readable. No automated function for "grab it all" exists, to my knowledge. (I really don't like MediaWiki, and this gets another notch in the "dislike" list.)
Every good solution is obvious once you've found it.
User avatar
Love4Boobies
Member
Member
Posts: 2111
Joined: Fri Mar 07, 2008 5:36 pm
Location: Bucharest, Romania

Re: How to get the wiki for offline browing?

Post by Love4Boobies »

Can you imagine what would happen if enough people tried to download Wikipedia at the same time? At any rate, I think wiki's are not meant for offline reading---the whole point of them is that they can be in a constant state of change.
"Computers in the future may weigh no more than 1.5 tons.", Popular Mechanics (1949)
[ Project UDI ]
User avatar
Solar
Member
Member
Posts: 7615
Joined: Thu Nov 16, 2006 12:01 pm
Location: Germany
Contact:

Re: How to get the wiki for offline browing?

Post by Solar »

Are you seriously comparing our OSDev Wiki with Wikipedia? :shock: :wink:

(Before there was MediaWiki, we were using PhpWiki, which provided a nice download option, which could be disabled / enabled by the admin.)
Every good solution is obvious once you've found it.
User avatar
Love4Boobies
Member
Member
Posts: 2111
Joined: Fri Mar 07, 2008 5:36 pm
Location: Bucharest, Romania

Re: How to get the wiki for offline browing?

Post by Love4Boobies »

Ah, no. Those were just my $0.02 regarding your comment on MediaWiki:
Solar wrote:(I really don't like MediaWiki, and this gets another notch in the "dislike" list.)
I do have some criticisms of my on but those are off-topic.
"Computers in the future may weigh no more than 1.5 tons.", Popular Mechanics (1949)
[ Project UDI ]
User avatar
chase
Site Admin
Posts: 709
Joined: Wed Oct 20, 2004 10:46 pm
Libera.chat IRC: chase_osdev
Location: Texas
Discord: chase/matt.heimer
Contact:

Re: How to get the wiki for offline browing?

Post by chase »

The DumpHTML extension is broken, hence no more backups.

I've switched over to using wget in a weekly cron job.

Code: Select all

wget --mirror -k -p --reject '*=*,User:*,Special:*,User_talk:*' --exclude-directories='User:*,User:*/*,User:*/*/*,User_talk:*,User_talk:*/*,User_talk:*/*/*,Special:*,Special:*/*,Special:*/*/*' --user-agent="osdev-mirror" http://wiki.osdev.org/Main_Page
It ends up getting a couple of duplicate pages (if you are on Windows) because some of the pages are archived twice with different case but overall I think it should be a functional offline-copy.

If you guys can improve the wget command or have a better linux alt (maybe curl) let me know. Any additional shell scripting/sed/awk/etc to fix up any broken stuff would also be welcome.

The result is: http://files.osdev.org/osdev_wiki.zip
User avatar
OdinVex
Member
Member
Posts: 55
Joined: Tue Sep 07, 2010 11:00 pm

Re: How to get the wiki for offline browing?

Post by OdinVex »

Any possible fix to DumpHTML? Anyone have a pre-broken dump? Can't use the new ones in their form. ;_; *should never have deleted his copy, didn't know*
“...No rest, no peace...” ― Odin Vex
User avatar
chase
Site Admin
Posts: 709
Joined: Wed Oct 20, 2004 10:46 pm
Libera.chat IRC: chase_osdev
Location: Texas
Discord: chase/matt.heimer
Contact:

Re: How to get the wiki for offline browing?

Post by chase »

What is broken in the current dump? (or what don't you like about it)
User avatar
Combuster
Member
Member
Posts: 9301
Joined: Wed Oct 18, 2006 3:45 am
Libera.chat IRC: [com]buster
Location: On the balcony, where I can actually keep 1½m distance
Contact:

Re: How to get the wiki for offline browing?

Post by Combuster »

You can't click category links (i.e. pretty much everything on the categorized main page) in at least firefox and epiphany which is rather annoying as it breaks pretty much all of the indexing. Other than that, things work mostly fine here.
"Certainly avoid yourself. He is a newbie and might not realize it. You'll hate his code deeply a few years down the road." - Sortie
[ My OS ] [ VDisk/SFS ]
User avatar
OdinVex
Member
Member
Posts: 55
Joined: Tue Sep 07, 2010 11:00 pm

Re: How to get the wiki for offline browing?

Post by OdinVex »

Filesystem-structure is entirely different, uses page-titles instead of HTML pages that can properly link. :( No link will work practically.
“...No rest, no peace...” ― Odin Vex
Grashwandir
Posts: 4
Joined: Sat Aug 04, 2012 5:05 pm

Re: How to get the wiki for offline browing?

Post by Grashwandir »

hi,
I read the entire post and much more... but I still don't know how to READ the offline archive!
The uncompressed files have no extension, there is not index.php or .html
sorry to be fool, I feel I'm missing something obvious but please I'm completly lost :cry:

I go vacation in two days, and I really want to bring this marvellous Wiki with me, please help me!

PS: forgive my poor english, I don't speak it very well, and now I'm very tired.

Thanks folks.
Post Reply