[erlang-questions] very large key lookup

Richard A. O'Keefe ok@REDACTED
Tue Sep 26 03:05:24 CEST 2006


ke han <ke.han@REDACTED> wrote:
	We will have hundreds of millions of these 32 byte keys in the the  
	server's "DB" and each key's associated 128 byte key.
	I am looking for advice on the format for these keys along with which  
	libraries to use for balance between memory  and lookup performance.   
	The current format is 0-9 + a-z character strings.

I note that there are 36 characters in the 0-9 + a-z character set.
This means that one can pack 3 of these characters into 2 bytes.
So 32 of these characters can be packed easily in 22 bytes, not 32.
To put it another way, a key space of /[0-9a-z]{32}/ is equivalent
to a key space of about 166 bits.

The internal form of these keys could well be Erlang binaries;
there doesn't seem to be any reason why the external form should change.

I do wonder, though, why
63,340,286,662,973,277,706,162,286,946,811,886,609,896,461,828,096 
primary keys are needed; enough to give every human being now alive
10**40 different keys.  Even if we consider every human being who has
ever lived, and give each of them 3 million computers, that's still
10**32 each.  This *looks* like far more extra bits than are needed
for reliability (error detecting codes) and security.  What am I missing?




More information about the erlang-questions mailing list