Non-reproducible bug on a live erlang system

Kaiduan Xie <>
Thu Jan 14 04:09:50 CET 2010


Hi, all,

Consider the following case, you have a live/busy Erlang system in
production which handles thousands of transactions per second and
millions of users, and customer reported a non-reproducible bug. The
problem is non-reproducible, or intermittent, or very hard to
reproduce in live system and in lab.

You can not turn on the debug log that will bring the system down, and
Erlang trace will not help since the problem is non-reproducible or
hard to reproduce.

How to resolve this kind problem? Can you shed light on this?

Thanks,

kaiduan


More information about the erlang-questions mailing list