[erlang-questions] Atom Unicode Support

Benoit Chesneau bchesneau@REDACTED
Wed Feb 3 18:30:54 CET 2016


> On 03 Feb 2016, at 18:25, José Valim <jose.valim@REDACTED> wrote:
> 
> I just wanted to clarify, to avoid any confusion, this patch is mostly about support for unicode atoms forms in the compiler. This is a required step if you ever want to support Unicode in the language but it does not imply such. There are three main milestones:
> 
> 1. Support unicode atom forms (i.e. the form {atom, Line, Atom} can have a UTF-8 encoded atom)
> 2. Support unicode atoms between single quotes (for example, 'ノクス', as you can already write "ノクス")
> 3. Support unicode in the language (for example, being able to write variables in Japanese)
> 
> This discussion was originally related to 1 but I could contribute 2 if desired. Many of the concerns raised above are related to step 3 which, afaik, is not planned.
> 
> 
> 

I wasn't clear enough, but supporting UTF8 in atom is OK, but do we actually support collation on binaries and  string ? Either way, how "é" compare to "è" or "e" ? What is the rule chosen?

- benoit
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://erlang.org/pipermail/erlang-questions/attachments/20160203/d9021ae2/attachment.htm>


More information about the erlang-questions mailing list