[erlang-questions] systematic global registration discrepancies
Valentin Nechayev
netch@REDACTED
Sat Dec 18 08:29:55 CET 2010
Tue, Nov 23, 2010 at 16:10:24, netch wrote about "[erlang-questions] systematic global registration discrepancies":
> we are using Erlang cluster for 20-25 nodes which all resides on different
> hosts. We are experiencing systematic problems with global registration of
> following kinds:
>
> 1. Attempt of register via global:register_name() hangs for an unlimited time
> (we could see it for a few hours until our patience is expired).
>
> 2. A name which is successfully reported as registered disappears from
> registered name lists at all nodes (including the registering one!)
>
> We had to add monitoring of global functionality which stops the node where
> registration hangs. It periodically detects registration failure and stops
> nodes, usually this is group of 7-10 nodes per one such failure. But it can't
> detect second case (silent disappearing).
>
> We use R12B5; it's planned to upgrade but is impossible for the closest next
> release.
R14B01 - the same but length of failing period seems reduced (on partial
statistics; full results will be in a few days)
> Did anybody seen this? Please suggest how to debug such problem.
Repeating the question.
-netch-
More information about the erlang-questions
mailing list