[erlang-questions] 19.x <-> 18.x wire protocol incompatibility ?
Alexandre Snarskii
snar@REDACTED
Wed Oct 12 19:11:22 CEST 2016
Hi!
Got into interesting 'does not work' situation while connecting newly
deployed c-node (compiled with otp 19.0 libraries) with older 18.3
erlang node: nodes were unable to communicate with messages
2016-10-12 18:43:28.404 [warning] emulator '<erlang node name>' got a
corrupted external term from '<c-node name>' on distribution channel ...
logged on erlang side.
After some research, root cause was isolated: C-node was initialized using
ei_connect_xinit with random() % 16 for its creation id. In 18.x, Pid of
C-node was encoded as ERL_PID_EXT and used only two lower bits of creation id.
However, in 19.x there is a new Pid wire presentation, ERL_NEW_PID_EXT, which
encodes all 32 bits of creation id. Worse yet, this presentation is
automatically selected for encoding any Pid with creation id > 3, and,
as this presentation is not known by older 18.x nodes, this leads to
connection drop.
Solution: if you expect your C-nodes to communicate with pre-19.x erlang
nodes, you may use only values 0..3 for creation id. In my case I just
replaced random() % 16 with random() % 4.
Question: may be it makes sense to document this wire protocol incompatibility
somewhere ?
More information about the erlang-questions
mailing list