Handling huge amounts of data

Chandrashekhar Mullaparthi Chandrashekhar.Mullaparthi@REDACTED
Thu Jun 5 11:10:11 CEST 2003


Have you tried using 64 bit erlang - your app would be a good stress for it


-----Original Message-----
From: Vlad Dumitrescu [mailto:vlad_dumitrescu@REDACTED]
Sent: 05 June 2003 07:41
To: erlang-questions@REDACTED; Jay Nelson
Subject: Re: Handling huge amounts of data

> What are you doing with this data?  Does it have any
> regularity that can be captured in a function?  How
> much manipulation?  Would a binary work for you
> instead of lists?

Well, the data is a list (between 500.000 - 1.500.000 items) of chunks of
data that may be lists of bytes, or binaries and are 27 bytes each. For each
of them, I need to compare it with all the others in a non-trivial way (some
1000 similar tests), and select only those that pass the test.

The problem that I see is the globality of the search, which makes that I
can't use a "divide and conquer" strategy [*]. Also there is no good
locality of data access either.
[*] More precisely, in the worst case "d&c" will fall back onto "serch all"
after much work.

I tried several ways of storing the data in memory, as list of tuples, list
of binaries, ets table and in all cases the VM becomes slower and slower and
slower until it breaks.

I try to find something to do in Erlang in order to present the great
results and maybe allow further applications. We are a Web application that
uses J2EE, and that would make a great target, but this is the third
implementation in 3 years (don't ask why) and I feel nobody will accept
another one if this works...


This email (including attachments) is confidential.  If you have received
this email in error please notify the sender immediately and delete this
email from your system without copying or disseminating it or placing any
reliance upon its contents.  We cannot accept liability for any breaches of
confidence arising through use of email.  Any opinions expressed in this
email (including attachments) are those of the author and do not necessarily
reflect our opinions.  We will not accept responsibility for any commitments
made by our employees outside the scope of our business.  We do not warrant
the accuracy or completeness of such information.

More information about the erlang-questions mailing list