[erlang-questions] Rant: I hate parsing XML with Erlang

Kevin A. Smith kevin@REDACTED
Tue Oct 23 15:46:49 CEST 2007


Maybe that should be packaged separately? Seems odd that you'd have  
to get the webserver just for a HTML parser....

FWIW, I tried writing a very permissive feedparser but lost interest  
partially due to the ugliness of Erlang's XML parsing APIs.

--Kevin

--Kevin
On Oct 23, 2007, at 9:30 AM, Sean Hinde wrote:

> Take a look at yaws_html.erl. That is quite a nice parser that
> doesn't produce the same bloat as xmerl
>
> Sean
>
> On 23 Oct 2007, at 14:08, Joel Reymont wrote:
>
>>
>>
>> On Oct 23, 2007, at 2:02 PM, Vlad Dumitrescu wrote:
>>> Do you try to scrape arbitrary HTML? I don't think a XML parser will
>>> help that much in such a case, because HTML is only a distant cousin
>>> of XML...
>>
>> Completely arbitrary HTML. Any web page out there. The syntax and
>> approach won't be much for HTML, assuming you had a robust parser. My
>> rant is about the syntax.
>>
>> --
>> http://wagerlabs.com
>>
>>
>>
>>
>>
>> _______________________________________________
>> erlang-questions mailing list
>> erlang-questions@REDACTED
>> http://www.erlang.org/mailman/listinfo/erlang-questions
>
> _______________________________________________
> erlang-questions mailing list
> erlang-questions@REDACTED
> http://www.erlang.org/mailman/listinfo/erlang-questions




More information about the erlang-questions mailing list