[erlang-questions] Cause heartbeat timeout
Dmitry Kolesnikov
dmkolesnikov@REDACTED
Wed Nov 8 11:37:04 CET 2017
Thank you for tips.
I’ve not managed to reproduce it with lists.
The node get killed due to OOM.
- Dmitry.
> On 8 Nov 2017, at 11.49, Dmitry Klionsky <dm.klionsky@REDACTED> wrote:
>
> I faced a situation like this two times.
> It was on a 2 CPUs EC2 instances. Heartbeat timeout was 45 secs.
>
> The first offender was zlib:gzip/1 and ~1GB file, the other was ++ over two long lists. Both operations took about 1 min to complete.
>
> My explanation is the heart's erlang part can't send the heartbeat to the heart port in time because
> either the scheduler it's bound to is busy or port I/O is busy/congested.
>
> Hope this helps.
>
> On 11/08/2017 10:13 AM, Dmitry Kolesnikov wrote:
>> Hello,
>>
>> I am trying to debug an issue of node termination by heartbeat timeout and its recovery. I am looking for advice about the reproduction of heartbeat timeout in controlled environment. What is the best approach to freeze Erlang node?
>>
>> Best Regards,
>> Dmitry
>> _______________________________________________
>> erlang-questions mailing list
>> erlang-questions@REDACTED
>> http://erlang.org/mailman/listinfo/erlang-questions
>
> --
> BR,
> Dmitry
>
More information about the erlang-questions
mailing list