Non-reproducible bug on a live erlang system
Thu Jan 14 04:09:50 CET 2010
Consider the following case, you have a live/busy Erlang system in
production which handles thousands of transactions per second and
millions of users, and customer reported a non-reproducible bug. The
problem is non-reproducible, or intermittent, or very hard to
reproduce in live system and in lab.
You can not turn on the debug log that will bring the system down, and
Erlang trace will not help since the problem is non-reproducible or
hard to reproduce.
How to resolve this kind problem? Can you shed light on this?
More information about the erlang-questions