[erlang-questions] Erlang for Speech Recognition

Banibrata Dutta <>
Sun Jun 19 16:45:40 CEST 2011


On Sun, Jun 19, 2011 at 2:36 PM, Ivan Uemlianin <> wrote:

<snip>


> ** Audio Preprocessor
>
> Automatic Speech Recognition (ASR) is essentially mapping a sequence of
> integers (i.e., acoustic signals) onto a sequence of linguistic symbols
> (i.e., phonemes (units of linguistic sound) or words).  The raw audio data
> (e.g. from a wav file or a microphone) is not terribly useful for this and
> the first step is to convert this data into a more useful abstract
> representation.  Each 100ms of sound is transformed into a feature vector of
> 39 features, known as Mel-Frequency Cepstral Co-efficients (MFCCs).
>

<snip>

Doesn't that remember where I'd read this, but the topic was something like
"what Erlang as a language is probably not well suited for...", and
significant amount of mathematical calculation, was probably one of those.
Has that stand changed ? Could it be a factor to consider ?

-- 
regards,
BDutta
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://erlang.org/pipermail/erlang-questions/attachments/20110619/8aeaf195/attachment.html>


More information about the erlang-questions mailing list