[erlang-questions] Parallel Shootout & a style question

Tue Sep 2 09:22:11 CEST 2008

I guess there should be a utility library with
some different ways of parallelizing a computation...

Does the shootout maintain different source trees
for the single-core and multicore programs?

Otherwise, we'll incur a slight penalty on the
sequential benchmarks. If it's noticeable, we
can always split the sequential and parallel
parts using a cheap test:

case erlang:system_info(schedulers) of
     1 -> seq(...);
     _ -> par(...)
end.

I don't think rpc:pmap() does quite the right
thing. It tries to spread the computations
across erlang nodes. We want a much simpler pmap.

depthLoop(D, M) ->
    L = lists:seq(D,M,2),
    F = fun(D1) ->
           check(D1,M)
        end,
    Results = pmap(F, lists:seq(D,M,2)),
    lists:foreach(
       fun(Res) ->
          io:fwrite("~w\t trees of depth ~w\t check: ~w~n",
                    Res)
       end, Results).

check(D, M) ->
    N = 1 bsl (M-D + ?Min),
    [2*N, D, sumLoop(N,D,0)].

pmap(F, L) ->
    Me = self(),
    Pids = [spawn(fun() ->
               Me ! {self(),catch F(X)}
            end || X <- L],
    [receive {P,Res} -> Res end || P <- Pids].

(I haven't tried the above, or thought too much
about whether it's exactly the right solution.
See it as an implementation sketch.)

BR,
Ulf W

Kevin Scaldeferri skrev:
> First,  I don't think it's been mentioned here, but the language  
> benchmarks shootout finally got some multi-core hardware!
> 
> http://shootout.alioth.debian.org/u64q/
> 
> At the moment, though, there are almost no submissions of parallelized  
> code, so the results are about the same as the existing hardware.
> 
> I figured (slightly spurred on by the Haskell community) that we  
> should try to submit some modified versions that actually use the  
> multiple cores.  So, for example, I made a slight change to the binary  
> trees code and got a nearly 2x speedup on my 2-core machine.  In doing  
> so, I did run into one of those little things that I've never really  
> known the preferred approach for.  My modified function looks like this:
> 
> depthLoop(D,M) when D > M -> ok;
> depthLoop(D,M) ->
>      Self = self(),
>      spawn(fun() ->
>                    N = 1 bsl (M-D + ?Min),
>                    io:fwrite("~w\t trees of depth ~w\t check: ~w~n",
>                              [ 2*N, D, sumLoop(N,D,0) ]),
>                    Self ! done
>            end),
>      depthLoop (D+2,M),
>      receive done -> done end.
> 
> 
> parallelizing would only require the addition of the spawn, except  
> that if I do that, the function finishes executing and the program  
> exits before most of the processes run at all.  So, I need to add the  
> call to self(), the send, and the receive in order to prevent  
> premature termination.  So, what I'm wondering is, is there a better  
> idiom for achieving this goal?
> 
> 
> Thanks,
> 
> -kevin
> _______________________________________________
> erlang-questions mailing list
> erlang-questions@REDACTED
> http://www.erlang.org/mailman/listinfo/erlang-questions