[erlang-questions] wkipedia rendering engine
Thorsten Schuett
schuett@REDACTED
Mon Jun 30 14:34:35 CEST 2008
On Monday 30 June 2008, Joe Armstrong wrote:
> Rock and roll....
>
> can you be more explicit than http://download.wikimedia.org can you
> point me to a specific file
> that I can download that works with your dump reader?
http://download.wikimedia.org/backup-index.html contains a list of the
individual dumps. For historical reasons, we are doing our tests with the
bavarian wiki, but you can use any dump. The bavarian is nice, because I can
read it and the size is reasonable. So on
http://download.wikimedia.org/barwiki/20080612/ you will find the list of
different kinds of dumps. I downloaded
http://download.wikimedia.org/barwiki/20080612/barwiki-20080612-pages-meta-history.xml.bz2
which is a compressed XML file containing the complete history of each
article.
> bunzip2 barwiki-20080612-pages-meta-history.xml.bz2
> java -jar dumpreader.jar barwiki-20080612-pages-meta-history.xml
It will output the erlang terms to the command line. Be carefull, this dump is
already ~500MB in size.
Thorsten
More information about the erlang-questions
mailing list