[erlang-questions] wkipedia rendering engine

Thorsten Schuett schuett@REDACTED
Mon Jun 30 14:34:35 CEST 2008


On Monday 30 June 2008, Joe Armstrong wrote:
> Rock and roll....
>
> can you be more explicit than http://download.wikimedia.org can you
> point me to a specific file
> that I can download that works with your dump reader?
http://download.wikimedia.org/backup-index.html contains a list of the 
individual dumps. For historical reasons, we are doing our tests with the 
bavarian wiki, but you can use any dump. The bavarian is nice, because I can 
read it and the size is reasonable. So on 
http://download.wikimedia.org/barwiki/20080612/ you will find the list of 
different kinds of dumps. I downloaded 
http://download.wikimedia.org/barwiki/20080612/barwiki-20080612-pages-meta-history.xml.bz2 
which is a compressed XML file containing the complete history of each 
article.
> bunzip2 barwiki-20080612-pages-meta-history.xml.bz2
> java -jar dumpreader.jar barwiki-20080612-pages-meta-history.xml
It will output the erlang terms to the command line. Be carefull, this dump is 
already ~500MB in size.

Thorsten



More information about the erlang-questions mailing list