I don't know xmerl eventp, but for such large XML files it may be prudent performance-wise to interface to a fast 'C' SAX parser such as expat. The ejabberd project has an Erlang driver written to use the expat parser. It's also some pretty good code for a proven real-world system that one could learn a lot from. You may be able to adapt the code (I think it's called xml_stream.erl and the 'C' files under c_src) although if, as you write, you have only been doing Erlang for a few weeks it may be a bit of a leap to get into linked-in drivers.<br>
<br>Hope this helps.<br><br><div class="gmail_quote">On Wed, Jun 18, 2008 at 6:07 PM, Matt Harrison <<a href="mailto:matt@lummie.co.uk">matt@lummie.co.uk</a>> wrote:<br><blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">
I have to extract information from some extremely large XML files, 6GB+<br>
as well. All signs point to xmerl_eventp but the documentation is non<br>
exactly fleshed out and as I've only been doing erlang for a few weeks I<br>
am really struggling.<br>
<br>
Did anyone go down the eventp route and did you find any documentation<br>
or can any provide a basic example of the callback module that is<br>
required.<br>
<br>
Any help so greatly appreciated...<br>
<br>
regards,<br>
<br>
Matt<br>
<br>
P.s. Great Pragmatic book Joe.<br>
<br>
<br>
_______________________________________________<br>
erlang-questions mailing list<br>
<a href="mailto:erlang-questions@erlang.org">erlang-questions@erlang.org</a><br>
<a href="http://www.erlang.org/mailman/listinfo/erlang-questions" target="_blank">http://www.erlang.org/mailman/listinfo/erlang-questions</a><br>
<br>
</blockquote></div><br>