<html><head></head><body style="word-wrap: break-word; -webkit-nbsp-mode: space; -webkit-line-break: after-white-space; "><div><br></div><div>Tuncer also pointed out that there is Mats Cronqvist's trane:</div><div><br></div><div><a href="https://github.com/massemanet/trane">https://github.com/massemanet/trane</a></div><div><br></div><div>Which also aims to parse "real-world" HTML. Looks interesting.</div><div><br></div><div>BR,</div><div>Ulf W</div><br><div><div>On 18 Feb 2012, at 18:21, Ulf Wiger wrote:</div><br class="Apple-interchange-newline"><blockquote type="cite"><div style="word-wrap: break-word; -webkit-nbsp-mode: space; -webkit-line-break: after-white-space; "><br><div><div>On 18 Feb 2012, at 17:33, Matti Oinas wrote:</div><br class="Apple-interchange-newline"><blockquote type="cite">Xmerl could be used to parse these templates if only XHTML would be supported. We could also easily validate these templates for correct markup using XML validators.</blockquote></div><br><div><br></div><div>Like I wrote, there is an unreleased version of xmerl_sax_parser with a 'html' mode, which handles most of the usual HTML badness (i.e. non-well-formed XML).</div><div><br></div><div>I would like to suggest that the OTP team release it. It doesn't have to be perfect. When parsing html, there is no such thing as perfect.</div><div><br></div><div>There is also the yaws_html module, which I admit that I've never used.</div><div><br></div><div>BR,</div><div>Ulf W</div></div></blockquote></div><br></body></html>