[erlang-questions] utf8 in source files

Allan Wegan <>
Tue Nov 9 07:34:12 CET 2010


-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

> From what I saw, the usage is in regular literal strings and in
> comments.

Standard list based string literals of course do not work. Each byte of
a UTF-8 encoded string becomes a separate integer in the resulting list.

Simply using UTF-8 without changing the parsing process does only work
with binary string literals. For binaries it does not matter whether the
parser treats code points encoded in multiple bytes as a whole or as
separate bytes.

- -- 
Allan Wegan
Jabber: 
ICQ:    209459114
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v2.0.14 (MingW32)

iQEcBAEBAgAGBQJM2OtjAAoJENm5axHh7AcxmW8H/jJjfaXaF0xIXDwumG7ahosA
BgEtYpGlaXsjTVPWXip0cDxg8c/jPrNHUqSsMx4bq7BVAVYH/QemaxEkS2QKHBzm
rerFewlFKfsWVTDdfvTu3g3jDdHk7ICGcZvLtsFEZY2woXc4hSukivCa9fCQo4ND
EzScAtRka2tyv37bsv/MtbgpknUASQyL51RCxXs9jIfWg8W8jE9PJEyEYGKUB1an
cLI1Lpy/D5R5XN0vA8BvtgW7PV4kJdG8/oYSiWv3mBgmd0Oe6ceXwhtwnO1FGjCu
xa6n04nSUTW2rMqvjNMvCE38+PkX5twueURe2Vo2Bx/mFGWH2W3F5pwqoV25ILc=
=tPpJ
-----END PGP SIGNATURE-----


More information about the erlang-questions mailing list