[erlang-questions] Atom Unicode Support

Felix Gallo felixgallo@REDACTED
Wed Feb 3 17:32:27 CET 2016


There's also an interesting security issue around Unicode source code.

Take for example the recent hack of Cryptsy, which involved a guy taking
what looked like an innocent and safe pull request to fix an issue in one
part of his software, but through the magic of the preprocessor, turned out
to do something else entirely:

http://earlz.net/view/2016/01/16/0717/analyzing-the-56-million-exploit-and-cryptsys-security

and then set your gaze on:

https://github.com/reinderien/mimic

and then consider that, e.g., atoms are dynamically created on mention,
without warning.

F.


On Wed, Feb 3, 2016 at 6:11 AM, Fred Hebert <mononcqc@REDACTED> wrote:

> On 02/03, Pierre Fenoll wrote:
>
>> What about re.erl character classes?
>>
>> I believe the regular expression [\s] does not match Unicode spaces, even
>> when giving the unicode atom flag to re.erl functions.
>>
>> And there are other classes that Unicode defines that would be great for
>> re.erl to support.
>>
>
> Pass in the `ucp' option:
>
> ucp
>    Specifies that Unicode Character Properties should be used when
> resolving \B, \b, \D, \d, \S, \s, \W and \w. Without this flag, only
> ISO-Latin-1 properties are used. Using Unicode properties hurts
> performance, but is semantically correct when working with Unicode
> characters beyond the ISO-Latin-1 range.
>
> _______________________________________________
> erlang-questions mailing list
> erlang-questions@REDACTED
> http://erlang.org/mailman/listinfo/erlang-questions
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://erlang.org/pipermail/erlang-questions/attachments/20160203/bc12404d/attachment.htm>


More information about the erlang-questions mailing list