I asked two direct questions, if you wish not to answer them that's up
to you.

You keep claiming that the benchmarks game does not have sufficiently
many measuring points during benchmarking to be able to spot anomalous

Let me explain it to you one more time - /we know/ that the benchmarks
game's 3 measuring points /were sufficient/ to spot anomalous behaviour
in some binary-trees programs because that is how someone spotted the
anomalous behaviour!

There's no particular reason you would have known that is how the
anomalous behaviour was detected, but now you do know - you now know
that the benchmarks game measuring points have been sufficient to spot
anomolous behaviour, your claim is untrue.

