[erlang-questions] benchmarks game harsh criticism

Isaac Gouy igouy2@REDACTED
Fri Nov 30 21:17:53 CET 2007


--- Bengt Kleberg <bengt.kleberg@REDACTED> wrote:
-snip-
> quotes from "Timing Trials, or, the Trials of Timing" as per request:
> "* Memory-related issues and the effects of memory hierarchies are 
> pervasive: how memory is managed, from hardware caches to garbage 
> collection, can change runtimes dramatically. Yet users have no
> direct 
> control over most aspects of memory management. "
> 
> "We started to construct a table with three dimensions: task, 
> programming language, and machine. Eventually we added the size of
> the 
> problem solved by the program as a fourth dimension, and we changed
> the 
> presentation from tables to graphs. Varying the problem size helped
> us 
> to detect unusual runtime effects, while a graphical presentation 
> highlights patterns and trends in runtime instead of individual 
> performance scores."
> 
> "We designed tests whose runtime should grow linearly with the size
> of 
> the problem: runtime = mÂ?size + b. Thus, if we choose size to be
> large 
> enough to justify ignoring the fixed overhead (b), the log-log plot 
> should show a straight line of unit slope. Exceptions indicate
> anomalous 
> behavior that deserves further attention."


(Incidentally what they meant by anomalous behaviour is something like
this: "the line connecting C runtimes appears absolutely horizontal.
... This happens because the optimizer eliminates the entire loop,
replacing it by sum = n.")


Let me suggest to you that the paragraphs you quote are descriptions
not recommendations. 

Here's a recommendation: "... we advise all who want to know which
version of a program will run faster to construct test programs and
find out the truth for their language processor and machine."

Here's another: "It does seem wise to take all such experiments
­including these ­with a large grain of salt."


> Now it is your turn. could you quote the exact words where they say
> that 4 inputs and a spread of x10 is good enough?

There is no such statement, nor have I claimed that there is: I have
described the spread shown in the tests - "we can see that they varied
the problem size by < ~10x" 



      ____________________________________________________________________________________
Be a better sports nut!  Let your teams follow you 
with Yahoo Mobile. Try it now.  http://mobile.yahoo.com/sports;_ylt=At9_qDKvtAbMuh1G1SQtBI7ntAcJ



More information about the erlang-questions mailing list