[erlang-questions] Updating Myths of Erlang Performance

Sat Jun 4 14:10:35 CEST 2016

On 04 Jun 2016, at 12:52 PM, Loïc Hoguin wrote:

> On the other hand, if your input is a binary, converting + looping might be slower than just looping through the binary. Not to mention the extra garbage created.
> 
> This section should point this out, otherwise we'll end up with users converting binaries to list "because lists are faster for this".
> 
> Also I think there are too few cases where lists win, especially in that kind of comparison. You are basically comparing binaries-as-string vs strings, and when using those it's frequent to match against <<"abcdef", T/binary>> forms (vs [$a, $b, $c, $d, $e, $f|T]) and as you pointed out, in that case binaries win.
> 
> IMO the myth is that list strings are efficient at all when in fact parsing them or even just having them in memory is inefficient. They have their use cases (Unicode related mostly).
> 
> Of course if the section is about people storing lists of integers in a binary then it makes perfect sense, but I haven't seen that one yet. :-)
> 
> -- 
> Loïc Hoguin
> http://ninenines.eu
> Author of The Erlanger Playbook,
> A book about software development using Erlang

Consider the following functional equivalents (A) and (B):

(A)
twist_tripod( <<>>, <<>>, <<>>, <<Result/binary>> )  -> Result;
twist_tripod( <<A0:1, A1:1, A2:1, A3:1, A4:1, A5:1, A6:1, A7:1, RemA/binary>>,
              <<B0:1, B1:1, B2:1, B3:1, B4:1, B5:1, B6:1, B7:1, RemB/binary>>,
              <<C0:1, C1:1, C2:1, C3:1, C4:1, C5:1, C6:1, C7:1, RemC/binary>>,
              <<Result/binary>> )
->
    twist_tripod( RemA, RemB, RemC, <<A0:1, B0:1, C0:1,
                                      A1:1, B1:1, C1:1,
                                      A2:1, B2:1, C2:1,
                                      A3:1, B3:1, C3:1,
                                      A4:1, B4:1, C4:1,
                                      A5:1, B5:1, C5:1,
                                      A6:1, B6:1, C6:1,
                                      A7:1, B7:1, C7:1, Result/binary>> )
.

(B)
twist_tripod( [], [], [], Result )             -> Result;
twist_tripod( [A|LA], [B|LB], [C|LC], Result )
->
    Out_0 =     (A band 16#80)              bor     % a7
                ((B band 16#80) bsr 1)      bor     % b7
                ((C band 16#80) bsr 2)      bor     % c7
                ((A band 16#40) bsr 2)      bor     % a6
                ((B band 16#40) bsr 3)      bor     % b6
                ((C band 16#40) bsr 4)      bor     % c6
                ((A band 16#20) bsr 4)      bor     % a5
                ((B band 16#20) bsr 5),             % b5

    Out_1 =     ((C band 16#20) bsl 2)      bor     % c5
                ((A band 16#10) bsl 2)      bor     % a4
                ((B band 16#10) bsl 1)      bor     % b4
                (C band 16#10)              bor     % c4
                (A band 16#08)              bor     % a3
                ((B band 16#08) bsr 1)      bor     % b3
                ((C band 16#08) bsr 2)      bor     % c3
                ((A band 16#04) bsr 2),             % a2

    Out_2 =     ((B band 16#04) bsl 5)      bor     % b2
                ((C band 16#04) bsl 4)      bor     % c2
                ((A band 16#02) bsl 4)      bor     % a1
                ((B band 16#02) bsl 3)      bor     % b1
                ((C band 16#02) bsl 2)      bor     % c1
                ((A band 16#01) bsl 2)      bor     % a0
                ((B band 16#01) bsl 1)      bor     % b0
                (C band 16#01),                     % c0

     twist_tripod( LA, LB, LC, [Out_0, Out_1, Out_2|Result] )
.

Believe me when I tell you that (B) is considerably faster than (A) -- I know, as I had to write (B) in order to optimize (A).
However, if you look at  (A), it seems so much easier to see what is being done. Not so with (B).

As it is not always about the speed, maybe the myth is that we use binary matching because it is faster. 
I think we use it because it is so much more expressive, concise  and therefore elegant.

V/

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://erlang.org/pipermail/erlang-questions/attachments/20160604/8877d0b5/attachment.htm>