<div class="gmail_quote">On Sun, Jun 19, 2011 at 2:36 PM, Ivan Uemlianin <span dir="ltr"><<a href="mailto:ivan@llaisdy.com">ivan@llaisdy.com</a>></span> wrote:<br><br><snip><br><div> </div><blockquote class="gmail_quote" style="margin: 0pt 0pt 0pt 0.8ex; border-left: 1px solid rgb(204, 204, 204); padding-left: 1ex;">
** Audio Preprocessor<br>
<br>
Automatic Speech Recognition (ASR) is essentially mapping a sequence of integers (i.e., acoustic signals) onto a sequence of linguistic symbols (i.e., phonemes (units of linguistic sound) or words). The raw audio data (e.g. from a wav file or a microphone) is not terribly useful for this and the first step is to convert this data into a more useful abstract representation. Each 100ms of sound is transformed into a feature vector of 39 features, known as Mel-Frequency Cepstral Co-efficients (MFCCs).<br>
</blockquote><div><br><snip><br><br>Doesn't that remember where I'd read this, but the topic was something like "what Erlang as a language is probably not well suited for...", and significant amount of mathematical calculation, was probably one of those. Has that stand changed ? Could it be a factor to consider ?<br>
<br>-- <br></div></div>regards,<br>BDutta<br>