<div class="gmail_quote">On Wed, Aug 1, 2012 at 10:39 AM, Richard Carlsson <span dir="ltr"><<a href="mailto:carlsson.richard@gmail.com" target="_blank">carlsson.richard@gmail.com</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">

<div class="im">On 08/01/2012 12:52 AM, CGS wrote:<br>

<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">

Actually, try this:<br>

<br>

1. set your environment to UTF-8 (in my case, whatever Linux terminal<br>

with BASH environment, export LANG="en_US.utf8", use locale to find your<br>

environment language definition - "en_US.latin1" for LATIN-1)<br>

2. in a module:<br>

<br>

test_reverse(String) -> lists:reverse(String).<br>

<br>

3. Give as parameter the example given by yourself.<br>

4. Check the output.<br>

</blockquote>

<br></div>

Ah, but when you say "give as parameter" you mean "pass it a string literal from the shell", right?</blockquote><div><br></div><div>Yes. Sorry for the confusion.</div><div> </div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">

 I never said anything about strings in the shell - that's a different environment from source files, and as you described, the shell nowadays detects your locale and translates UTF-8 console input into a string literal containing Unicode code points.</blockquote>

<div><br></div><div>The shell was just an example which is a part of the problem. Another part is the communication over HTTP/HTTPS protocol. And there may be some other examples. My point here is the interaction of Erlang with any environment which uses UTF-8. If Erlang compiler doesn't become aware of UTF-8, the code may be unstable without even understanding why. As you said, it seems that the compiler needs a fix.</div>

<div> </div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"> This is exactly how it would happen in source code as well, if the compiler only knew how to detect that a source file is in a different encoding from Latin1. So the compiler is really the main thing that needs to be fixed, and then there should be no surprises on the encoding level anymore.</blockquote>

<div><br></div><div>I couldn't agree more.</div><div> </div><div>CGS</div><div><br></div></div>