[erlang-bugs] [erlang-questions] R11B-2 SMP Timer Race Condition Bug [Re: bug in timer:sleep/1 smp implementation (R11B-0)]

Toby Thain toby@REDACTED
Fri Dec 22 21:49:19 CET 2006


On 22-Dec-06, at 3:22 PM, Serge Aleynikov wrote:

> Additionally, I should say that we've been able to reproduce this  
> bug on
> several Linux platforms ...
> It happens when all the CPUs in SMP mode are over 75% loaded.  The bug
> doesn't happen immediately after starting a release, but after a  
> period
> of 5 min to 3 hours, which makes it pretty hard to diagnose.  The
> tracing method that we initially tried to use was to include printf
> statements in the emulator to stderr.  However, this prevented the bug
> from showing up.  Further it was changed to using SysV message  
> queue to
> communicate trace to an external process that dumped the trace to a
> file.  This allowed to gain further understanding of the problem,  
> but as
> Dmitry indicated any attempt to reduce the code to a minimal example
> made the problem disappear.

Could DTrace on Solaris help?

--Toby



More information about the erlang-bugs mailing list