This page is an archive of a community-wide discussion. This page is no longer live. Further comments or questions on this topic should be made in a new Senate Hall page rather than here so that this page is preserved as a historic record. Advanced Jedi Training Droid 6(Talk to my master) 01:05, December 16, 2014 (UTC)
Hi, I tried to download the database dumps from here: http://starwars.wikia.com/wiki/Special:Statistics as I would like to use them for offline reading, but none of them worked:
The "current pages" version only contains about a third of the articles (only 117 926 of the expected 365 749), and the file ends with a <page> tag open, so it looks like an unfinished xml.
The "Current pages and history" version can't even be unpacked, I get an error message saying that it's not a valid gz file. The size of this one is very suspiciously exactly 400 MB, as in 400.000000, so I suspect that it is only the first part of a bigger file that has been split, and the rest is missing.
Wikia was down partially for maintenance earlier today. You might want to try again now; might have better luck. ProfessorTofty (talk) 17:52, November 14, 2014 (UTC)
Unfortunately this didn't solve my problem, and I've found this discussion: http://starwars.wikia.com/wiki/Forum:SH:Statistics, and also tried to download other wikis like the WoW and Muppets ones, both of which seem to have the same problems. All of this suggests to me, that the dumps have been out of order for almost a year, and not just here but on all the wikia wikis. That might be quite worrisome if these are the only archived copies of wookieepedia, as if something were to happen to the servers, two thirds of the current pages couldn't be recovered. 184.108.40.206 00:19, November 15, 2014 (UTC)
I'm quite certain that Wikia maintains regular backups of the actual databases. The dumps available for public download (which is what you are reporting broken) are in XML format, which is designed to be read by client programs (i.e. bots and offline readers); they also only contain fields that bots and offline readers are likely to care about, and do not contain information about deleted pages and revisions. Those dumps are provided as a courtesy for end users and are are neither meant as nor capable of being used as true backups. The actual complete databases used by the server, with all fields and deleted stuff, are MySQL, and would be backed up privately in that format by Wikia's technical staff. —MJ—Jedi Council Chambers 01:35, November 15, 2014 (UTC)