?

Log in

No account? Create an account
Techie advice needed - downloading wikipedia - Synchronicity swirls and other foolishness

> Recent Entries
> Archive
> Friends
> Profile
> my rpg writing site

November 9th, 2008


Previous Entry Share Next Entry
10:53 pm - Techie advice needed - downloading wikipedia
So, I recently got a 16 GB memory card for my Nokia n810, and after filling it with as many ebooks, pdfs, and music and I can see any reason to have, I still have almost 7 GB free. So, it occurred to me that having all of wikipedia readily accessible w/o an internet connection would be darn handy. So, I found info about downloading it, and tried various sites for various previous versions (various forms of pages-articles.xml.bz2). After nearly completing the download 3 separate times (using my XP laptop) and having it end prematurely in some sort unknown download error, I'm fairly certain that this particular method won't work. Does anyone know of a method for downloading wikipedia (only the article content, including images, if possible) that does work?
Current Mood: hopefulhopeful

(8 comments | Leave a comment)

Comments:


[User Picture]
From:postrodent
Date:November 10th, 2008 07:50 am (UTC)
(Link)
If the "legit" process has failed you, I'd try a torrent search site -- nowtorrents.com or torrentz.com. Bit dodgy, obviously not something you want to do without an up to date antivirus system, but you find the damnedest things there, even _legal_ things.
[User Picture]
From:mindstalk
Date:November 10th, 2008 08:44 am (UTC)
(Link)
How's further Nokia experience been? I had to return my borrowed eee, and haven't gotten a new sub-laptop thing yet.
[User Picture]
From:heron61
Date:November 10th, 2008 09:40 am (UTC)
(Link)
Loving it. It's as good for all pda-like uses as my Clie, while also being almost as good as an ipod for music, and an excellent ebook reader, and a wonderful portable pdf reader (I have most of the bus routes I use stored in it). I've also used it for sending texts (by tethering it to my phone) which is a vast improvement over the annoying vileness of T9 texting, and once for Skype. It definitely has its limits (the gps is nearly useless and I've never bothered with the webcam), but is the best portable device I've ever used.
[User Picture]
From:alobar
Date:November 10th, 2008 09:16 am (UTC)
(Link)
Will all of wikipedia really fit in 7 gigs? I would have thought it would be much larger.
[User Picture]
From:heron61
Date:November 10th, 2008 09:40 am (UTC)
(Link)
It's 4.1 GB compressed, and most things less than double when uncompressed, so I'm betting it would work.
[User Picture]
From:slothman
Date:November 10th, 2008 03:24 pm (UTC)
(Link)
Depends what fraction of it is images; text compresses pretty well.
[User Picture]
From:mindstalk
Date:November 10th, 2008 01:23 pm (UTC)
(Link)
I'm not sure.
http://en.wikipedia.org/wiki/Wikipedia:Size_comparisons estimates 3.5 billion characters, which would be 3.5 Gb. Contrary to John's reply, a couple of my large text files compress at 3:1. I'm not sure where the HTML fits in, maybe that's all generated dynamically. Then there's the pictures, which won't compress much (being already compressed) but there's only so many of those. So perhaps 1G compressed text, 3G pictures, expanding to 3 G text and 3G pictures.

I think of normal books as about a megabyte, so 1G would be 1000 volumes. Though my 1911 Britannica volumes probably have more words than normal.

Also
http://en.wikipedia.org/wiki/Size_of_Wikipedia
http://stats.wikimedia.org/EN/Sitemap.htm
English database in 2006 was 3.4 G, presumably a lot bigger now.
[User Picture]
From:xi_o_teaz
Date:November 10th, 2008 11:58 pm (UTC)
(Link)
Wow. I hadn't even thought to download the entirety of Wiki. I've easily got 7G around here...

One of the problems would be that a downloaded version doesn't get updated, but, if it should go offline (they're far from their goal of fundraising), it might be good to just have it, current as of now.

I really gotta look into downloading this, soon...

> Go to Top
LiveJournal.com