How to get the wiki for offline browing?
How to get the wiki for offline browing?
Hi all,
My first post (and may be the single, who know ) here. By the way, funny captcha question (the one about source register for movsb).
I was just wondering if there exist an archive of the wiki, for offline browsing. I can't cleanly get it with wget, as I could get all "garbage" pages, like edit, history and the like.
I know it may be updated frequently, but I don't mind, as I like it very much as it already is
Have a nice time all
My first post (and may be the single, who know ) here. By the way, funny captcha question (the one about source register for movsb).
I was just wondering if there exist an archive of the wiki, for offline browsing. I can't cleanly get it with wget, as I could get all "garbage" pages, like edit, history and the like.
I know it may be updated frequently, but I don't mind, as I like it very much as it already is
Have a nice time all
Re: How to get the wiki for offline browing?
Perfect! There is a ZIP archive for the Wiki, without histories and others, just articles and talks pages (the latter being important too, after‑all).
I Thanks you.
I Thanks you.
Re: How to get the wiki for offline browing?
But the ZIP archive at http://files.osdev.org/osdev_wiki.zip seems to be dated:
It's date is 30-Dec-2011. To be sure I've checked some changes newer than this date.
None of them I've checked is included in the ZIP archive.
Something wrong with archiving the wiki?
Dirk
It's date is 30-Dec-2011. To be sure I've checked some changes newer than this date.
None of them I've checked is included in the ZIP archive.
Something wrong with archiving the wiki?
Dirk
- Kazinsal
- Member
- Posts: 559
- Joined: Wed Jul 13, 2011 7:38 pm
- Libera.chat IRC: Kazinsal
- Location: Vancouver
- Contact:
Re: How to get the wiki for offline browing?
I'd assume it's manually updated.
Can you curl excluding pages with "&action=edit" and "&action=history" in the address? If so, there's your solution.
Can you curl excluding pages with "&action=edit" and "&action=history" in the address? If so, there's your solution.
- Love4Boobies
- Member
- Posts: 2111
- Joined: Fri Mar 07, 2008 5:36 pm
- Location: Bucharest, Romania
Re: How to get the wiki for offline browing?
To my knowledge, that archive is *much* older. The date you mentioned is the one when chase changed his hosting provider.
"Computers in the future may weigh no more than 1.5 tons.", Popular Mechanics (1949)
[ Project UDI ]
[ Project UDI ]
Re: How to get the wiki for offline browing?
There's Special:AllPages and Special:Export. Between them, you should be able to grab the Wiki contents, and either store them into an offline Wiki or for transformation from XML into something more readable. No automated function for "grab it all" exists, to my knowledge. (I really don't like MediaWiki, and this gets another notch in the "dislike" list.)
Every good solution is obvious once you've found it.
- Love4Boobies
- Member
- Posts: 2111
- Joined: Fri Mar 07, 2008 5:36 pm
- Location: Bucharest, Romania
Re: How to get the wiki for offline browing?
Can you imagine what would happen if enough people tried to download Wikipedia at the same time? At any rate, I think wiki's are not meant for offline reading---the whole point of them is that they can be in a constant state of change.
"Computers in the future may weigh no more than 1.5 tons.", Popular Mechanics (1949)
[ Project UDI ]
[ Project UDI ]
Re: How to get the wiki for offline browing?
Are you seriously comparing our OSDev Wiki with Wikipedia?
(Before there was MediaWiki, we were using PhpWiki, which provided a nice download option, which could be disabled / enabled by the admin.)
(Before there was MediaWiki, we were using PhpWiki, which provided a nice download option, which could be disabled / enabled by the admin.)
Every good solution is obvious once you've found it.
- Love4Boobies
- Member
- Posts: 2111
- Joined: Fri Mar 07, 2008 5:36 pm
- Location: Bucharest, Romania
Re: How to get the wiki for offline browing?
Ah, no. Those were just my $0.02 regarding your comment on MediaWiki:
I do have some criticisms of my on but those are off-topic.Solar wrote:(I really don't like MediaWiki, and this gets another notch in the "dislike" list.)
"Computers in the future may weigh no more than 1.5 tons.", Popular Mechanics (1949)
[ Project UDI ]
[ Project UDI ]
- chase
- Site Admin
- Posts: 710
- Joined: Wed Oct 20, 2004 10:46 pm
- Libera.chat IRC: chase_osdev
- Location: Texas
- Discord: chase/matt.heimer
- Contact:
Re: How to get the wiki for offline browing?
The DumpHTML extension is broken, hence no more backups.
I've switched over to using wget in a weekly cron job.
It ends up getting a couple of duplicate pages (if you are on Windows) because some of the pages are archived twice with different case but overall I think it should be a functional offline-copy.
If you guys can improve the wget command or have a better linux alt (maybe curl) let me know. Any additional shell scripting/sed/awk/etc to fix up any broken stuff would also be welcome.
The result is: http://files.osdev.org/osdev_wiki.zip
I've switched over to using wget in a weekly cron job.
Code: Select all
wget --mirror -k -p --reject '*=*,User:*,Special:*,User_talk:*' --exclude-directories='User:*,User:*/*,User:*/*/*,User_talk:*,User_talk:*/*,User_talk:*/*/*,Special:*,Special:*/*,Special:*/*/*' --user-agent="osdev-mirror" http://wiki.osdev.org/Main_Page
If you guys can improve the wget command or have a better linux alt (maybe curl) let me know. Any additional shell scripting/sed/awk/etc to fix up any broken stuff would also be welcome.
The result is: http://files.osdev.org/osdev_wiki.zip
Re: How to get the wiki for offline browing?
Any possible fix to DumpHTML? Anyone have a pre-broken dump? Can't use the new ones in their form. ;_; *should never have deleted his copy, didn't know*
“...No rest, no peace...” ― Odin Vex
- chase
- Site Admin
- Posts: 710
- Joined: Wed Oct 20, 2004 10:46 pm
- Libera.chat IRC: chase_osdev
- Location: Texas
- Discord: chase/matt.heimer
- Contact:
Re: How to get the wiki for offline browing?
What is broken in the current dump? (or what don't you like about it)
- Combuster
- Member
- Posts: 9301
- Joined: Wed Oct 18, 2006 3:45 am
- Libera.chat IRC: [com]buster
- Location: On the balcony, where I can actually keep 1½m distance
- Contact:
Re: How to get the wiki for offline browing?
You can't click category links (i.e. pretty much everything on the categorized main page) in at least firefox and epiphany which is rather annoying as it breaks pretty much all of the indexing. Other than that, things work mostly fine here.
Re: How to get the wiki for offline browing?
Filesystem-structure is entirely different, uses page-titles instead of HTML pages that can properly link. No link will work practically.
“...No rest, no peace...” ― Odin Vex
-
- Posts: 4
- Joined: Sat Aug 04, 2012 5:05 pm
Re: How to get the wiki for offline browing?
hi,
I read the entire post and much more... but I still don't know how to READ the offline archive!
The uncompressed files have no extension, there is not index.php or .html
sorry to be fool, I feel I'm missing something obvious but please I'm completly lost
I go vacation in two days, and I really want to bring this marvellous Wiki with me, please help me!
PS: forgive my poor english, I don't speak it very well, and now I'm very tired.
Thanks folks.
I read the entire post and much more... but I still don't know how to READ the offline archive!
The uncompressed files have no extension, there is not index.php or .html
sorry to be fool, I feel I'm missing something obvious but please I'm completly lost
I go vacation in two days, and I really want to bring this marvellous Wiki with me, please help me!
PS: forgive my poor english, I don't speak it very well, and now I'm very tired.
Thanks folks.