I can regenerate the behavior by stopping the network interface in the far node (linux ifdown). That runs the connected Erlang node, which was receiving the messages. I wonder if this how the Erlang implementation is or local to this particular setup.<br>
<br>Also I use HIPE. I'll try what you suggested below and also without HIPE.<br><br>Thanks,<br>- Eranga<br><br><br><br><br><br><br><div class="gmail_quote">On Tue, Mar 4, 2008 at 2:08 PM, Ulf Wiger (TN/EAB) <<a href="mailto:ulf.wiger@ericsson.com">ulf.wiger@ericsson.com</a>> wrote:<br>
<blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;"><br>
It sounds as if the sending node is blocked in auto-connect.<br>
<br>
Try the kernel environment variable {dist_auto_connect, once}.<br>
It will ensure that any attempt to send to a disconnected node<br>
immediately fails. If one of the nodes restarts, they will<br>
automatically reconnect, as usual. You can explicitly connect<br>
the two nodes by calling net_kernel:connect(Node).<br>
<br>
BR,<br>
Ulf W<br>
<br>
Eranga Udesh skrev:<br>
<div class="Ih2E3d">> The problem occurs when the network connectivity is broken (abnormally).<br>
> The receiving node is not receiving messages. The sending processes are<br>
> blocked, since those message delivery calls (gen_event:notify/s, etc)<br>
> are waiting for about 10 secs to return. We checked the implementation<br>
> of such calls and notice, the functions are waiting until the messages<br>
> are delivered to the receiving node. Is there's a way (a system flag may<br>
> be) to avoid such blocking and to return immediately?<br>
><br>
> BRgds,<br>
> - Eranga<br>
><br>
><br>
><br>
> On Mon, Mar 3, 2008 at 6:51 PM, Chandru<br>
> <<a href="mailto:chandrashekhar.mullaparthi@gmail.com">chandrashekhar.mullaparthi@gmail.com</a><br>
</div><div class="Ih2E3d">> <mailto:<a href="mailto:chandrashekhar.mullaparthi@gmail.com">chandrashekhar.mullaparthi@gmail.com</a>>> wrote:<br>
><br>
> On 03/03/2008, Eranga Udesh <<a href="mailto:eranga.erl@gmail.com">eranga.erl@gmail.com</a><br>
</div><div class="Ih2E3d">> <mailto:<a href="mailto:eranga.erl@gmail.com">eranga.erl@gmail.com</a>>> wrote:<br>
> > Hi,<br>
> ><br>
> > I am experiencing a high message passing delay between 2 Erlang<br>
> nodes, after<br>
> > an abnormal network disconnection. Those 2 nodes are in a WAN and<br>
> there are<br>
> > multiple Hubs, Switches, Routes, etc., in between them. If the<br>
> message<br>
> > receiving Erlang node stopped gracefully, the delay doesn't<br>
> arise. Doing<br>
> > net_adm:ping/1 to that node results no delay "pang". However<br>
> > gen_event:notify/2, gen_server:cast/2, etc. are waiting for about<br>
> 10 seconds<br>
> > to return.<br>
> ><br>
> > What's the issue and how this can be avoided?<br>
><br>
> Have you tried putting a snoop to see whether the delay is on the<br>
> sending/receiving side?<br>
><br>
> This might be useful: <a href="http://www.erlang.org/contrib/erlsnoop-1.0.tgz" target="_blank">http://www.erlang.org/contrib/erlsnoop-1.0.tgz</a><br>
><br>
> cheers<br>
> Chandru<br>
><br>
><br>
><br>
</div>> ------------------------------------------------------------------------<br>
><br>
> _______________________________________________<br>
> erlang-questions mailing list<br>
> <a href="mailto:erlang-questions@erlang.org">erlang-questions@erlang.org</a><br>
> <a href="http://www.erlang.org/mailman/listinfo/erlang-questions" target="_blank">http://www.erlang.org/mailman/listinfo/erlang-questions</a><br>
</blockquote></div><br>