[erlang-questions] html parsing in erlang?

Carlo Cabanilla <>
Wed Jan 20 23:06:21 CET 2010


On Wed, Jan 20, 2010 at 2:23 PM, Garrett Smith <> wrote:

> On Wed, Jan 20, 2010 at 7:41 AM, Carlo Cabanilla
> If your application can process the web content in batch, or using a
> disk based queue/spool, you could use this:
>
> - Grab + parse web content in Python
> - Dump your output (presumably trees, maps, etc.) to an Erlang term
> (see the erl_term module in py-interface
> http://www.lysator.liu.se/~tab/erlang/py_interface/ - or BERT
> http://bert-rpc.org/)
> - Read the terms on disk from Erlang
>
> To avoid the intermediary phase of writing to disk, you could setup
> your Python app as a port, which I've found to work very well.
>
>
 I was actually considering this design, have you implemented this before?
What's the overhead for the serialization/deserialization over the wire
like?


.Carlo


More information about the erlang-questions mailing list